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: PRIMACY EFFECTS IN PERSONALITY IMPRESSION 


FORMATION USING A GENERALIZED ORDER ` 
EFFECT PARADIGM * 


NORMAN H. ANDERSON 
University of California, San Diego 


Ss were read a set of personality trait adjectives, and judged how much they 
would like a person so described, A generalized order effect paradigm was used, 
Each set consisted of a sequence of high (or low) value adjectives; into this 
sequence a block of 3 low (or high) adjectives was interpolated at all possible 
ordinal positions, The results showed a straight-line primacy effect: the net 
influence of an adjective decreased linearly with ordinal position in the set. It 
was shown that the response to a set could be described as a weighted average 
of the scale values of the separate adjectives. This model was related to a 
previously employed linear model for opinion change. An interpretation of the 


results in terms of shift in meaning was also discussed. 


In a task introduced by Asch (1946), the 
subject is presented with a set of adjectives 
that describe a person, and gives his impres- 
sion of the person so described. Tf the adjec- 
tives are presented in serial order, the response 
to a given set may depend on the particular 
order in which they are given. In fact, a 
primacy, or first impression, effect is usually 
obtained (e.g., Anderson & Barrios, 1961; 
Asch, 1946). Its cause is unknown. 

It is useful to consider this personality 
adjective task as serial information processing: 
as each successive item of information is re- 
ceived, it is, presumably, integrated into the 
impression built up from the previous items. 
As a consequence, the impression will change 
continuously and systematically over the series 
‘of items. 
~ The theoretical analysis of impression for- 
mation is handicapped by lack of knowledge 
about this presumed serial buildup of the im- 
pression. The fact of primacy shows that, in 
some as yet poorly defined sense, the later 
adjectives have less influence than the earlier 
adjectives, but this conclusion is lacking in 
precision and detail. 

The purpose of the present experiments was 

_ to get a more detailed picture of the influence 
of early and late adjectives. For this purpose, 


11 wish to thank Stephen Hubert and Ralph 
Stewart for their help in running subjects, and Rhoda 
C. Lindner for a thorough, critical reading of the 
manuscript. The work was done at the University of 
California, Los Angeles, and was supported by Na- 
tional Science Foundation Grant G-12986. 


a probe stimulus was interpolated at various 
points within a set of adjectives. Tt was an- 
ticipated that the relative influence of the 
probe, as a function of its position of interpo- 
lation, would yield the desired picture, 

The paradigm employed can most readily 
be explained by reference to Table 1 which 
lists two of the four experimental conditions. 
The symbols, L and H, stand for adjectives 
of low and high scale value. In Condition 
(3L)6H, a set of 3 Ls is interpolated at all 
possible positions within a set of 6 Hs. There 
are seven possible interpolation positions, and 
the seven corresponding orderings are shown 
in the right-hand column of Table 1. 

In Condition (3L)3H, a set of 3 Ls is in- 
terpolated within a set of 3 Hs. In this case, 


TABLE 1 
ADJECTIVE ARRANORMENTS or Conprrions (3L) 3H 
anp (3L)6H ) 
Condition 
Interpolation 
position B E me- 
OLIH OL)óH 
1. IP, LLLHHH LLLHHHHHH 
2. IPs HLLLHH HLELHHHHH 
3. IP, HHLLLH HHLLLHHHH 
4. IP; HHHLLL HHHLLLHHH 
5. IPs HHHHLLLHH 
6. IP; HHHHHLLLH 
7. IP; HHHHHHLLL 


Note.—Adjectives read to the subject in order listed, H and 
L stand for adjectives of high and low scale value, The three 
Ls are interpolated as a unit at various positions within the Hs, 


eo: 
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there are only four interpolation positions, as 
listed in the left-hand column of Table 1. 

In both conditions, the set of 3 Ls is the 
probe stimulus. If later adjectives influence 
the impression less than earlier adjectives, the 
probe will have less influence the later it ap- 
pears. Thus, the curve relating the impression 
to the probe position will quantify the influ- 

„ence of an adjective as a function of its ordinal 

position in the set. As will be seen below, the 
scheme of Table 1 may also be considered as 
a generalized order effect paradigm. 

Of the two experiments reported here, Ex- 
periment I indicated that the generalized 
order effect paradigm yielded a straight-line 
primacy effect (Figure 1). Although this re- 
sult was formally consistent with previous 
work, it was somewhat surprising. Accordingly, 
a second experiment was performed, to repli- 
cate the first, and to test the applicability of 
a more precise experimental design. 


METHOD 
Instructions and Procedure 


Individual subjects were read sets of six, or nine, 
adjectives. They were told that each set of adjectives 
described a different hypothetical person, their task 
being to form an impression of that person and to 
judge how much they would like that person. 

Judgments were made verbally on an 8-point 
rating scale, ranging from 1 (highly unfavorable) to 
8 (highly favorable). Each step of the scale was 
indexed by a verbal label formed from the various 
combinations of favorable and unfavorable with 
highly, considerably, moderately, and slightly as 
modifiers. A card containing the scale lay before the 
subject throughout the experiment. 

Subjects were instructed to think of each adjective 
of a given set as having been contributed by a 
different acquaintance of the person being described. 
Thus, each set might contain both good and bad 
traits, and the adjectives might occasionally seem 
inconsistent. It was emphasized that there was no 
right or wrong response, that the subject’s own feel- 
ings were what mattered. 

The adjectives of each set were read once through 
at a steady rate allowing slightly over 2 seconds per 
adjective. The total time per set, including stimulus 
and response, was 20 seconds for six-adjective sets, 
and 30 seconds for nine-adjective sets. In Experi- 
ment I, four practice sets were given, but no, prac- 
tice was given in Experiment II. 


Adjectives 


The adjectives used in constructing the sets were 
the same as had been roughly scaled for use in 
previous work (Anderson & Barrios, 1961). A list of 
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48 H adjectives and a list of 48 L adjectives were 
used. Typical H adjectives were: kind, creative, 1. 
brilliant; typical L adjectives were: shallow, opin- 
ionated, unattractive. 


Subjects 


Subjects were female volunteers from introductory 
psychology who were fulfilling a course requirement: 
The Ns for Experiments I and II were 60 and $21. 
respectively. Random assignment was used within 
each experiment. 


Design 


The complete procedure for constructing the sets of 
adjectives was somewhat complicated. However, the |® 
specific details, given under the headings Experiment 
I and Experiment II below, are not necessary for an 
understanding of the basic design and results. The 
general idea of the design is given here and in the)” 
two following subsections. i 

The basic idea of the design can be illustrated by # 
considering Condition (3L)6H of Experiment IL” 
First, 3 L and 6 H adjectives were chosen, From fy 
these, 7 sets were formed by interpolating the 3 Ls” 
as a unit within the sequence of 6 Hs, at each of the 
seven possible interpolation positions as illustrated 
in Table 1. This step was then replicated six more 
times with newly chosen adjectives, thus yielding 49 
sets altogether. 

These 49 sets were arranged in seven blocks of 7/7 
sets using a Greco-Latin square so that, within each f 
block, each set of adjectives occurred once, and 
each interpolation position occurred once. The tri 
order of the blocks was then balanced over subjects: 

In general, the sets received by each subject were 
chosen to give a reasonable sample of different ad- 
jectives, and to balance adjectives and interpolation | 
positions over trials and over subjects. Although the 5 
design details are somewhat tedious, they are straight- 
forward and give perhaps the simplest method of 
satisfying the desired criteria. 

Conditions. Both experiments employed the same 
four main experimental conditions. Conditions 
(3L)3H and (3L)6H have been illustrated in Table” 
1 and described in the introduction. 

The remaining two conditions were the comple- 
ments of those in Table 1. In Condition (3H)3L, # 
sequence of 3 H adjectives was interpolated as & 
unit at each of the four possible positions within & 
sequence of 3 L adjectives. In Condition (3H) 61, 41 
sequence of 3 H adjectives was interpolated as a unit 
at each of the seven possible positions within a sè- 
quence of 6 L adjectives. 

Condition assignment. In Experiment I, each of 3 
subjects judged 32 sets from Condition (3L)3H andi 
32 sets from Condition (3H)3L. Thus, each of thes 
subjects judged 8 sets at each interpolation position 
for each condition. 

Similarly, each of 28 subjects judged 28 sets from 
Condition (3L)6H and 28 sets from Condition 
(3H)6L. Thus, each of these subjects judged 4 setsi 
of each interpolation position for each condition. 


> 
> 


IMPRESSION FORMATION 3 


© e Jn Experiment II, each of 24 subjects judged 16 


sets from Condition (3L)3H and 16. sets from Con- 
dition (3H)3L, In contrast to Experiment I, separate 
‘groups of subjects were used in Conditions (3L)6H 
and (3H)6L; there were 14 subjects in each, and 
‘each subject judged 49 sets. 


Experiment I ` 


“To illustrate the design, consider first Condition 
(3L)6H. Here there are seven possible interpolation 
positions; accordingly, the construction unit was a 
7X 7 Greco-Latin square. 

_ A square was first chosen. Then seven 6-H sets 
were chosen with the 6 H adjectives arranged in 
fixed serial order. These seven 6-H sets constituted 
the columns of the square. Next seven 3-L sets were 
chosen with the 3-L adjectives arranged in fixed 
"serial order. These 3-L sets constituted the Latin 
letters of the square. In this way, each cell of the 
“square was assigned a 6-H set and a 3-L set; the 
latter was then interpolated within the former at 
the ordinal position indicated by the Greek letter in 
that cell. A similar square was constructed for Condi- 
‘ion (3H)6L, and the columns of these two squares 
‘were then interlaced to form a rectangle of 7 rows 
and 14 columns. 

‘Each row of the rectangle was assigned a dif- 
-ferent subject. Over a group of seven subjects, each 
3-L set was interpolated once into each 6-H set at 
each interpolation position; similarly each 3-H set 
was interpolated once into each 6-L set at each in- 
terpolation position. No subject judged the same set 
of nine adjectives twice. 

Each row of the rectangle just described contrib- 

_ uted one block of 14 trials for each of seven subjects. 
Three more such rectangles were constructed in the 
same way, thus yielding a total of four blocks of 14 
trials, These four blocks were then arranged in a 
4X4 Latin square to balance blocks of sets over 
trials. 

For ‘Conditions (3L)3H and (3H)3L, the method 
of construction was similar. The unit of construc- 
tion was a 4X4 Greco-Latin square since there 
were only four interpolation positions. As above, 
each subject served under both conditions. Sets were 
constructed in blocks of 16, each block being formed 
from two unit squares from each condition. There 
were again four such blocks, and these were bal- 
anced over trials by a 4X 4 Latin square. 


Experiment IT 


‘This experiment was essentially a replication of 


Experiment I. However, the design details differed 


* somewhat since it was required that each subject 


judge the same adjectives in each of the interpola- 
tion orderings. The design may be illustrated by 
considering Condition (3L)6H. The basic unit in the 
construction was again a 7 X 7 Greco-Latin square. 

First, seven 6-H sets were chosen with the 6 ad- 
jectives arranged in fixed serial order. For each 6-H 
set, a paired 3-L set was also chosen, with the 3 
adjectives arranged in fixed serial order. There were 
thus seven pairs of sets, each pair consisting of a 3-L 


set and a 6-H set. These pairs were assigned to the 
Latin letters of the square. Then, in ‘each cell of the 
square, the 3-L set was interpolated within the 6-H 
set at the ordinal position indicated by the Greek 
letter in that cell. The full 7 X 7 square thus in- 
corporated a sevenfold replication of the patadigm 
illustrated in the right-hand column of Table 1, each 
replication based on different adjectives. 

To balance sets over trials, the rows of the Greco- 
Latin square were then considered as the letters of a 
new 7 X 7 Latin square. One subject was assigned to 
each row of this Latin square. Each of these seven 
subjects thus judged all 49 sets represented by the 
Greco-Latin square. This procedure was replicated 
with a new choice of adjectives and seven additional 
subjects served in this replication. The same pro- 
cedure was employed for Condition (3H) 6L. 

For Conditions (3L)3H and (3H)3L, the unit of 
construction was a 4X4 Greco-Latin square since 
there were only four interpolation positions. As in 
Experiment I, each subject served in both condi- 
tions. Sets were constructed in blocks of eight, each 
block being formed from one unit square from each 
condition, There were four such blocks, and these 
were balanced over trials by a 4 X 4 Latin square. 


Randomization 


Subject to the above conditions, each step in the 
construction of the sequences of sets was independ- 
ently randomized whenever a choice was required. In 
particular, adjectives for each set were chosen at 
random, with replacement, so that each set was 
drawn from the same pool of adjectives. 


t RESULTS 


The results are shown in Figure 1 which 
plots the interpolation curves for each condi- 
tion. Each curve gives the mean response as a 
function of the ordinal position at which the 
probe stimulus was interpolated. 

The results are simply summarized. Disre- 
garding some apparent irregularity in Experi- 
ment I, the interpolation curves are essentially 
straight lines, and each shows a continuous 
primacy effect. The four upward sloping 
curves represent the LLL probe; the four 
downward sloping curves represent the HHH 
probe. Thus, the net influence of the probe 
stimulus on the overt response decreases 
linearly as a function of interpolation position. 

The primacy may be exhibited in more de- 
tail by considering the interpolation curve for 
Condition (3L)6H. The first two points on 
this curve correspond to Rows 1 and 2 of 
Table 1, The two set orders are identical in 
their last five positions; they differ in the 
presentation order of the first four adjectives. 
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EXP. 1 
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(3H)6L 


MEAN IMPRESSION RESPONSE 
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SUCCESSIVE INTERPOLATION POSITIONS 


Fic. 1, Mean impression response as a function of interpolation position of the probe stimulus. 
(Probe stimulus indicated by the notation (3L) or (3H) in the condition designations by each 


curve, See also Table 1.) 


Thus, the difference between the first two 
points on the curve arises because the first 
four adjectives at the first point are given in 
the order, LLLH, whereas the first four ad- 
jectives at the second point are given in the 
order, HLLL. Since the HLLL order produces 
the higher response, the first two points on the 
curve display a primacy effect. 

The same reasoning applies to the compari- 
son of the second and third points. In Rows 
2 and 3 of Table 1, it is seen that the first, 
and the last four, positions are identical. At 
the four intermediate positions, the second 
point is based on the order, LLLH, whereas 
the third point is based on the order, HLLL. 
Thus, the second and third points on the curve 
also display a primacy effect. 

This reasoning may be extended to the 
comparison of any two points on an interpo- 
lation curve. In fact, the scheme of Table 1 
may be considered as a generalized order 
effect paradigm. In either column, the first 
and last rows represent the ordinary, low-high 
versus high-low, order effect paradigm. Any 
two successive rows also embed an order 

effect paradigm of the form, LLLH versus 


O——O HHH INTERPOLATION 
O——oO LLL INTERPOLATION 


EXP. 2 
(3L)6H 


(3L)3H 


(SH)6L 
(3H)3L 
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HLLL. For example, the last two rows of 
Condition (3L)6H are identical in the first 
five positions, and differ in the stated way 
over the last four positions. Similarly, Rows 
and 6 are identical in their first and last post 
tions; the seven intermediate positions giv 
the comparison, LLLHHHH versus HHH 
LLL, 


The interpretation of these results will be 
resumed in the Discussion section. At th 
point, certain irregularities in Figure 1, and 
the statistical analysis, require attention. 


Statistical Analysis 


For the most part, the curves of Figure 1 spea 
for themselves. Accordingly, the statistical analyst 
will be mainly directed to certain methodological 
problems, and will be restricted to the data of E; 
periment II. 

To illustrate the method of analysis, consider first 
a given subject in Condition (3L)6H of Expeth 
ment II. The sets judged by this subject formed 
a 7X7 Greco-Latin square. The sums of squares fon 
Interpolation Position, Set, and Trial Block we! 


mean square was taken as an estimate of the within- 
subject variability for that subject although, 
course, it would be expected to overestimate that! 
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wariability somewhat. This procedure was followed 
for each subject; the residual mean square, averaged 
over subjects, is given in the last row of Table 2. 

Each Greco-Latin square was based on seven dif- 
ferent sets of adjectives, and seven subjects served 
in each square. These sets and subjects were consid- 
ered as a 7 X 7 factorial design, and this was ana- 
lyzed in the standard way to obtain the sums of 
squares for Sets, Subjects, and the Sets X Subjects 
interaction. These sums of squares were pooled over 
the four Greco-Latin squares, and resulting pooled 
mean squares are given in Table 2. Similar analyses 
‘were made for Conditions (3L)3H and (3H)3L. 

The statistical significance of the various sources 
in Table 2 is, of course, barely relevant. The impor- 
tant question concerns the magnitudes of the vari- 
abilities associated with the various sources. For 

. present purposes it suffices to assert, somewhat 
‘loosely, that the relative sizes of the mean squares of 
Table 2 indicate that Sets, Subjects, and Sets X Sub- 
jects are all substantial sources of variability. 

Two irregularities in Figure 1 may now be con- 
sidered. Comparison of the experiments shows that 
in certain cases corresponding curves have consider- 
ably different elevations. However, the corresponding 
curves are based on different sets and different sub- 
jects and their differences in elevations are reasonably 
attributable to Set and Subject variabilities. 

It will also be seen that the interpolation curves 
of Experiment I are considerably less regular than 
those of Experiment II, In both experiments, the 
Set and Subject variabilities are excluded from the 
between-point variability of each curve. The Set X 
Subject variability is also excluded in Experiment II, 


' but not, however, in Experiment I. Necessarily, 


therefore, the Experiment I curves are expected to 
be less regular than the curves of Experiment II. 

The above analysis, together with the general 
agreement in curve shape across the two experiments, 
lead to a useful recommendation about method. 
Previous experiments in this series, up to and in- 
cluding the present Experiment I, were designed so 
that each subject judged a given set of adjectives in 
dhe order only, This was done to avoid possible 
carry-over effects from previous judgments. 

In Experiment II, however, each subject judged a 
given set in all orders. This procedure seems to have 
yielded essentially the same results as in Experiment 
I, thus suggesting that memory of previous judg- 
ments may not be a serious problem in this task. The 
advantages of this procedure, in terms of data reli- 
ability, have already been pointed out. In addition, 
the design details tend to be simpler. This procedure 
is thus the more desirable, and the present results 


_ Sive it some justification, 


Discussion 
Primacy 


As has been noted, the interpolation curves 
of Figure 1 represent a generalized order effect 
paradigm. Any two points on a curve repre- 
sent an order effect comparison and, disregard- 
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TABLE 2 
Summary ANALysis OF VARIANCE, EXPERIMENT II 


Number adjectives in set 


Source Nine Six 
df MS df MS 
Sets (A) 24 | 23.84 21 | 15.86 
Subjects (B) 24 | 11.53 21 9.23 
AXB 144 4.01 147 3.37 
Within subjects 
Variability 840 0.87 288 1,29 


ing the irregularities in the data of Experiment 
I, all such comparisons show primacy. More- 
over, the primacy appears to be a straight-line 
function of the interpolation position of the 
probe stimulus. In other words, the net influ- 
ence of an adjective decreases linearly with its 
ordinal position in the set. 

For illustrative purposes in the remaining 
discussion, it will be helpful to consider the 
last two interpolation positions of Condition 
(3L)6H in detail. From Table 1, these are: 


IP,: HHHHH,,LLL,,H uJ 
IP;: HHHHH,;HpLp;LLps 


The p’s denote positions within the sets that 
will be referred to later. 

The response to IP; was higher than that to 
IPs. Thus, the order effect paradigm defined 
by the last four adjectives, LLLH versus 
HLLL, shows a primacy effect. Since both sets 
have the same first five adjectives, this pri- 
macy is not to be explained by any simple 
appeal to the importance of first impressions. 

This result, and more generally, the linear- 
ity of the interpolation curves, had not been 
expected initially. The rationale of that ex- 
pectation helps illuminate the data, and may 
be illustrated as follows. 

At Position ps in either set of Array 1, the 
subject has integrated five Hs. At Position ps 
in IP;, a sixth H has been integrated. By a 
law of diminishing returns, this sixth H should 
have relatively little effect. Thus, the state of 
the subject should be approximately the same 
at Position ps in IPs and Position pe in IP;. 
Three Ls follow either position. Hence the 
state of the subject should be approximately 
the same at Position ps in IP, and Position 
Po in IP. 
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At Position po in IP;, the subject responds. 
But at Position ps in IPe, there remains one 
more adjective to be integrated, and this is an 
H. This terminal H could be expected to have 
an appreciable effect on any of several 
grounds: it is fresher in the subject’s memory 
when the response is made since it is received 
last; its isolation from the other Hs by the 
intervening Ls could draw extra attention to 
it; it is to be integrated into an impression 
depressed by the just preceding Ls and so 
should be less subject to diminished return 
than the sixth H of IP;. 

If the terminal H in IP, did have an ap- 
preciable effect, and if the subject’s state were 
indeed about the same at Position ps in IP. 
and ps in IP;, then the response to IPs would 
be higher than to IP;. In other words, the 
comparison of Array 1 would show a recency 
effect. 

That primacy is in fact obtained shows, of 
course, that the above rationale is wrong. 
However, it is thought that this rationale 
helps set out the nature of the problem, posed 
by the data. Three interpretations of the data 
will now be taken up. 


Weighted-Average Model 


The data can be described by a weighted- 
average model, in which the response to any 
set is simply a weighted mean of the scale 
values of the adjectives of that set. The scale 
value would index the location of the adjective 
on a favorableness dimension; the weight 
would represent the influence or importance 
of the adjective in the total impression. 

Let A; denote the scale value of the adjec- 
tive in the Ath ordinal position in a given set, 
and let ws denote the weight. The equation 
for the response is then 


See L wrd 


[2] Equation 2 to obtain the algebraic expression 
E wr for the predicted response listed in Table 3.1 
TABLE 3 


PREDICTED Impressions FOR CONDITION (3L)3H FROM WEIGHTED-AVERAGE MODEL 
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In general, the subject may have an initial 
impression prior to presentation of the first | 
adjective. This possibility would be included 
in Equation 2 by letting Ao and wo denote th 
scale value and weight of the initial impres 
sion. Unless otherwise stated, however, it w 


the weights decrease linearly with ordinal po 
sition: 
w = a— (k— 1)b, 


where & is the ordinal position index. 
constants, @ and b, whose exact values aré nol 
of present concern, represent, respectively, the 
weight of the first adjective, and the changi 
in weight per ordinal position. For a set of 
adjectives, the weights for the successively 
presented adjectives would then be: 


a,a—b,a— 2b, 
a—3b,a—4b,a—5b. [4 
It will also be assumed that all H adjectives 
have the same scale value which, for preset 
convenience, will be denoted by H; that all L 
adjectives have the same scale value, L; and 
that all adjectives have the same weight excep 
as weight depends on ordinal position. Anal 
ogous simplifying assumptions will be made 
without explicit statement in the two othet 
interpretations as well. These simplifying as 
sumptions are appropriate here because of tht 
balancing in the design, and because only the 
shape of the interpolation curve will be coni 
sidered. 
This weighted-average model is applied tú 
Condition (3L)3H in Table 3. The weights 
are given by Array 4, and their sum is Swi 
= 6a — 15b. These weights are then used il 


Set Order Predicted response 
tedPr LLLHHH [aL+ (a—6)L+ (a—26)L+ (a—3b)H+ (a—4b) H+ (a—5b)H]/ (6a—15b) 
2. IP; HLLLHH [eH-+ (a—b)L+ (a—26)L+ (a—3b) L+ (a—4b) H+ (a—5b)H]/(6a— 15b) 
3. IP; HHLLLH [eH+ (a—b)H+ (a—2b)L+ (a—3b) L+ (a—4b) L+ (a—5b)H]/ (6a—15b) 
4, IPs HHHLLL [cH-+ (a—b)H-+ (a—26)H+ (a—3b)L+ (a—4b)L+ (a—5b)L]/(6a—15b) 


Note,—H and L denote scale values of H and L adjectives, respectively, 
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In Rows 1 and 2 of Table 3, the second, 
third, fifth, and sixth terms are the same in 
both expressions. These terms cancel when 
the difference is taken. The difference between 
the expressions in Rows 1 and 2 then simpli- 
fies to 3b(H — L)/(6a — 15b). The difference 
_ between the predicted response for any two 
consecutive interpolation positions in Table 3 
is found to be the same, 3b(H — L)/(6a— 
_ 155). Thus, the model implies that the dif- 
* ference between any two successive interpola- 

tion points in the curves of Figure 1 is a con- 

stant, or, in other words, that the interpolation 
curves are straight lines. A parallel argument 

. holds for each of the other experimental con- 

` ditions. In general, therefore, the weighted- 
average model, together with the assumption 
of Equation 3 concerning the change in 
weights, predicts that the interpolation curves 
are straight lines. 

It should be explicitly noted that the above 
analysis is ad hoc, and that the present data 
do not adequately test the model. However, it 
has at least been shown that the weighted- 
average model can account for the results. 

Moreover, once the model is stated explic- 
itly, it becomes apparent that it makes addi- 
tional predictions. There are 20 possible or- 
derings of three Ls and three Hs, and 84 
possible orderings of three Ls and six Hs. If 
the model is correct, then it will account for 
the response to all possible orderings, not just 
the few used here. These predictions can easily 
be derived in the manner illustrated in Table 
3. 

œ Of the various ways to test a model, 
parameter-free tests, which do not require 
estimation of the parameters of the model, 
are the simplest and most useful in many re- 
spects. The above prediction of straight-line 
interpolation curves is one such parameter- 
free test. Others may be obtained by appropri- 
ate selection among the possible presentation 
orders. For example, it is straightforward to 
show that HLHLLH and HLLHHL yield the 
same predicted response, and this prediction 
can, of course, be tested directly from the raw 
data. Similarly, the difference in response to 
HLHLHL and LHLHLH is 30(H —L)/ 
(6a — 15b), which is equal to the difference 
between successive points on the interpolation 
curves. Since these predictions depend on 
Equation 3, they may, of course, be specific 
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to the present task even though Equation 2 
held more. generally. 


Linear Model 


A second interpretation of the data can be 
made within the framework of a proportional- 
change, or linear, model that has been applied 
to opinion change (Anderson, 1959, 1964; 
Anderson & Hovland, 1957). The basic sup- 
position of this model, as applied to the pres- 
ent task, would be that the impression devel- 
ops step by step as each successive adjective 
is presented. More specifically, it is assumed 
that the impression produced by the Ath ad- 
jective of a set is given by 


Xp = Xua + Cy (Ar~ Xs). [5] 


Here Xx is the impression just before, and 
Xr the impression just after, presentation of 
the th adjective. A; denotes the scale value 
of the kth adjective, and cy, is here used 
to denote the proportionality coefficient, or 
change parameter. 

According to Equation 5, the change in im- 
pression produced by the kth adjective is 
cx(Ax — Xr). The amount of change is pro- 
portional to the distance, (Ar — Xi), be- 
tween the adjective presented and the just 
previous impression. The change parameter, 
Cr, is a measure of the influence of the kth 
adjective: the smaller is cz, the less is the 
change in the impression. 

The main conceptual difference between 
the weighted-average and the proportional- 
change models is that the weighted-average 
formulation considers only the final response, 
whereas the proportional-change model en- 
visages the final response as the end result of 
a step-by-step buildup of the impression. In 
the present task, of course, only the final re- 
sponse is actually observed. Formally, how- 
ever, Equation 5 may be applied to describe 
the presumed buildup of the impression. The 
proportional-change model could then account 
qualitatively for primacy by assuming that 
the change parameter showed a sufficiently 
large decrease over the successive adjectives 
(Anderson & Hovland, 1957). Here however, 
the question is whether this model is quanti- 
tatively consistent with the primacy effects 
of the generalized order effect paradigm. 

This analysis is not complicated, but will 
only be outlined here. By successive applica- 
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tion of Equation 5, the final impression can 
be expressed as a function of the initial im- 
pression (if any) and of the scale values of 
the adjectives of the set. This function is in 
fact a’weighted average, with the weights be- 
ing expressed in terms of the cx. In other 
words, the proportional-change model implies 
the weighted-average model and, since the 
latter is consistent with the data, so also is 
the former. 

This correspondence between the models 
can be pushed one step further. In general, it 
can be shown that 


[6] 


If the wx are known, then Equation 6 allows 
one to solve for the cx. 

Equation 6 thus opens up the interesting 
possibility of tracing out the step-by-step 
buildup of the impression even though only 
the final impression is observed. For the 
present data, with the w+ given by Equation 
3, the expression for the cx obtained from 
Equation 6 is 


Weick = w(i E Cr) Ch—1- 


i a— (k—1)b 
wo + ka — 3(k — 1)b] 


Ck 


[7] 


Change of Meaning Interpretation 


The last theoretical interpretation to be 
considered is one suggested by Asch (1946). 
In Asch’s formulation, the initial adjectives 
set up a “directed impression” that apparently 
is considered to change the effective meanings 
of the later adjectives. Striving for a uni- 
fied impression, the subject selects out those 
shades of meaning of the later adjectives that 
fit in with the directed impression established 
by the initial adjectives. 

For an HHHLLL set, then, the more favor- 
able shades of meaning of the Ls would be 
selected; for an LLLHHH set, the less favor- 
able shades of meaning of the Hs would be 
selected. Presumably, therefore, the impres- 
sion produced by HHHLLL would be more 
favorable than that produced by LLLHHH; 
that is, there would be a primacy effect. 

For the ordinary order effect paradigm, 
this qualitative reasoning may suffice. For the 

generalized order effect paradigm, a more 
explicit statement of the theory is necessary. 
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A possible attack could be made beginning 
with a simple (unweighted) average model 
(Anderson, 1962) in which the predicted re) 
sponse is just the mean of the effective scale 
values of the adjectives. 
To illustrate how such a model might be 
applied, consider Array 1. Assume first : 
each of the first several Hs has the same scale 
value, H. Next, the directed impression pro- 
duced by these Hs will effectively shift the 
Ls toward more favorable meanings, thus) 
increasing their effective scale values. 


of the terminal H in IP,. Since the directed 
impression set up by the preceding five Hs 
and three Ls will decrease the effective scale 
value of this terminal H, H’ is less than Hi 
The response to IP¢ is then just the mean of 
(SH + 3L' + H’); the response to IP; is they 
mean of (6H + 3L’). Since H’ is less tha 
H, the predicted response to IP, is less than 
that to IP;. 

With the given assumptions, therefore, the) 
model could account for the primacy over the 
last two interpolation points. However, con: 
siderably more work would be needed to pu 
the model into a realistic and testable forn 
Tn particular, the amount of shift in meaning 
of a given adjective will depend, presumably, 
both on the solidity and on the scale valu 
of the impression set up by the preceding ad- 
jectives. Consequently, the change in scale 
value of the first interpolated L will depend 
on the number of preceding Hs; and the three 
Ls would themselves suffer unequal changes in | 
scale value. A complete analysis will evidently | 
not be simple, but it is questionable whethet 
any reasonable set of assumptions would lead) 
to a prediction of linearity in the interpolation 
curves. } 

This shift of meaning interpretation willl 
not be pursued further here, but it should be) 
noted that other treatments of this interpreta- 
tion are possible, and that Asch favors 4 
rather different orientation (Asch, 1946, pp: 
286-287). Nevertheless, the question raised 
by Asch, whether the meanings of the words | 
change in context, is important in any analysis 
of impression formation. 
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« Comparisons and Comments 


A basic difference between the foregoing 
interpretations is in the origin ascribed to the 
primacy effect. The shift of meaning expla- 
nation attributes primacy to changes in scale 
value, whereas the other two explanations at- 
‘tribute primacy to changes in weight or im- 
portance of the later adjectives. These two, 
the weighted-average model and the propor- 


`- tional-change model, have been shown to be 


consistent with the data, but whether the 
same can be said of the shift of meaning 
explanation is problematical. 

There is, moreover, independent evidence to 


pi support the change in weight interpretation. 


If the subjects recall the adjectives, in addi- 
tion to giving the impression response, pri- 
macy disappears, and a recency effect may 
take its place (Anderson & Hubert, 1963). 
This result was taken to support the idea 
that the primacy obtained in the standard 
version of the task results from decreased 
attention to, and hence lower weights for, the 
later adjectives, Further support for this 
idea is given by the results of Stewart 
(1965) who requested the subject to respond 
(cumulatively) after each successive adjective 
of the set. This procedure, which presumably 
would tend to equalize attention to all the 
adjectives, also eliminated primacy or induced 
recency. These results decrease the force of 
Asch’s (1946) argument that primacy implies 
shift of meaning. 

At the same time, there is evidence that 
‘argues more directly against the shift in 
meaning explanation. Primacy is obtained in 
an impression of meals task, otherwise similar 
to that used here, and this primacy seems 
difficult to explain in terms of shift of mean- 
ing (Anderson & Norman, 1964), Moreover, 
some success has been obtained with a simple 
average model in the personality adjective 
task with simultaneous presentation (Ander- 
son, 1962) although this model would be 
incorrect if the adjectives changed meaning 
from one context to another. 

Present evidence, therefore, indicates that 
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the weighted-average model and the propor- 
tional-change model formulations are consid- 
erably more satisfactory than the change in 
meaning interpretation. This conclusion is 
provisional, of course, and certainly cómmon 
sense indicates that context will affect mean- 
ing, at least in certain cases. Nevertheless, 
there is as yet no satisfactory evidence for 
such contextual shifts of meaning in impres- 
sion formation (Anderson & Norman, 1964). 

If indeed primacy is caused by decreases 
in the weights of the later adjectives, the 
problem then arises of explaining this weight 
decrease. The attention decrement explana- 
tion, although it receives some support from 
the cited data, is not entirely satisfying and 
may itself stand in need of further analysis. 
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COMMITMENT TO EXPOSURE AS A DETERMINANT 
OF INFORMATION RECEPTIVITY * 


k TIMOTHY C. BROCK * 
Ohio State University 


College males, 82 regular smokers and 90 nonsmokers, indicated their interest 
in reading an article which denied or asserted a relationship between smoking 
and lung cancer. Degree of commitment to exposure affected information 
receptivity but, even when smokers expected immediate self-exposure to the 
information, they did not avoid the message linking cancer to smoking, thus 
stringently reproducing a result from prior studies by Feather. Dissonance 
theory’s core assumption, that persons actively avoid information which would 
likely increase dissonance, was reappraised and a modification was proposed. 


Selective self-exposure by prospective re- 
cipients, the cardinal “fact of life” in com- 
munication processes, exhibits a disquieting 
lack of fit between theory and evidence. The 
controversy concerns the assumption that 
persons will actively avoid information which 
opposes their beliefs or behavior. Although 
published verifications are few, the avoidance 
assumption is shared by many writers (Cooper 
& Jahoda, 1947; Festinger, 1957; Hovland, 
1959; Hyman & Sheatsley, 1947; Janis, 1957; 
Klapper, 1949, 1960; Lazarsfeld, Berelson, & 
Gaudet, 1944; Lipset, Lazarsfeld, Barton, & 
Linz, 1954; Maccoby, Romney, Adams, & 
Maccoby, 1959; McGuire & Papageorgis, 
1961). 


Recent experimental? studies have been 


1 Supported by grants from the National Science 
Foundation (GS-343) and the Alumni Achievement 
Fund of Iowa State University. I thank N. T. 
Feather and W. J. McGuire for criticisms and sug- 
gestions, 

2 The data collection was completed in May 1963 
_while the author was at Towa State University. 

3 Relevant correlational studies are typically equiv- 
ocal. Klapper (1949) wrote: 


that persons will deliberately or unconsciously 
avoid material which they know questions their 
existing opinions has been so clearly demonstrated 
as to be now axiomatic in the literature on com- 
munication [p. 29]. 


Klapper’s “axiom” was based primarily on the clas- 
sic study of 1940 voting behavior by Lazarsfeld et al. 
(1944), However, Klapper (1960) has noted that 


approximately one third of the “constant parti- 
sans” [in the 1940 campaign] did not practice 
selective exposure and about one fifth were in fact 
exposed more to communication opposing their 
party position [p: 25]. 
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ordered to two derivations from dissonan 
theory (Festinger, 1957): 


If a source of information is viewed as potentially 
decreasing dissonance or providing new elements 
consonant with (their) behavior, persons sho 

expose themselves to this source. If the source of in- 
formation is viewed as potentially increasing di 
sonance, there should be active avoidance of ex- 
posure to the information [p. 163]. 


With the exception of a laboratory experiment 
by Festinger (1957, pp. 162-176) and i 
replication by Cohen, Brehm, and Latané 
(1959), the bulk of the evidence has not sup- 
ported the second derivation concerning avoid- 
ance of dissonance-increasing information 
(Adams, 1961; Ehrlich, Guttman, Schönbach; 
& Mills, 1957; Feather, 1962, 1963; Mills, 
Aronson, & Robinson, 1959; Rosen, 1961): 
Further failures might be seen in studies 


Preferential exposure to Propaganda from one’s ow! 
side was not reobtained when Lazarsfeld and Berel- 
son (Berelson, Lazarsfeld, & McPhee, 1954) fol- 
lowed the 1948 campaign: 


avoided material actually discussed or was suspected 


by the respondents of discussing the cancer-smoking 
theme. 


INFORMATION 


° | (Brodbeck, 1956; Festinger & Thibaut, 1951; 

Gerard, 1953; Steiner, 1962). 

Repeated failures to demonstrate avoidance 
of dissonant information suggest acknowledg- 
ing that Festinger’s theory is wrong in this re- 
spect. However, since the avoidance assump- 

. tion was announced by Festinger (1957, p. 3) 

ds the second of his two “basic hypotheses,” a 

more constructive alternative was sought in 
. , Brehm and Cohen’s (1962) recent revision of 
dissonance theory. Central to their revision is 
an insistence that commitment is a necessary 
condition for the arousal of dissonance and 
that derivations from the theory are equivo- 
_ cal where commitment is not clearly specified 
by the experimental procedure. Brehm and 
Cohen usually applied the “commitment test” 
to the independent variable by asking whether 
a person has clearly and/or actively chosen 
one alternative (behavior, belief, object) and 
rejected another. The commitment require- 
ment can be extended to the dependent varia- 
ble by certifying that the subject’s response 
implied a commitment to subsequent behavior 
or action (Feather, 1959). What degree of 
commitment to subsequent exposure was felt 
by the subjects as they indicated preferences 
for information? 

Only in the Ehrlich et al. (1957) study was 
commitment to exposure entirely unambig- 
uous: subjects indicated preferences for com- 
munications which they thought they would 
immediately read. Commitment to exposure 
was absent in Feather’s (1962, 1963) studies 
and its presence was at least doubtful in 
“Adams’ (1961) investigation. In the latter 
study “very few” (p. 76) subjects attended 
“talks” although 89% expressed “some de- 
sire” to attend “in about 2 weeks.” In the 
Mills et al. (1959) and Rosen (1961) studies 
it was questionable that the subjects seriously 
expected to read articles from scientific jour- 
nals about objective and essay examinations, 
the alternatives in a prior decision. Doubt 
„about commitment to exposure may be raised 
for these studies chiefly because the major 
manipulation, although seemingly powerful, 
did not elicit differential self-involvement. 
Some subjects were told their chosen exam 
would count 70% and others, 5%, toward 
their grade. Since this information was some- 
how discounted (there was no effect on the 
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dependent variable) perhaps the preference 
ratings of the articles were also treated “hypo- 
thetically” by the subjects. At any rate, clari- 
fication could be achieved by varying com- 
mitment to exposure within the same experi- 
mental context and examining the effects on 
receptivity to dissonant and consonant infor- 
mation. Consequently Feather’s (1962, 1963) 
method was replicated together with manipu- 
lation of commitment to exposure. The hy- 
potheses followed from dissonance theory and 
were the same as Feather’s (1963, p. 157); 
smokers, more than nonsmokers, will seek out 
information denying a relationship between 
smoking and lung cancer and will avoid in- 
formation asserting that relationship. 


METHOD 
Design 


There were two types of dependent variables in a 
2X2X2 factorial arrangement: preference ranking 
of magazine-article titles and measures of attitudes. 
The independent variables were: a classification of 
subjects into smokers and nonsmokers; the stand 
taken for, or against, the smoking-cancer link as 
revealed by the critical title, in a list of 13 titles; 
and the variation of commitment to exposure. Order 
of appearance of the critical title, fifth or tenth, in 
the list was balanced in each of the eight cells. 


Procedure 


Exposure manipulation. As in Feather (1963) the 
instruments were administered in classrooms.* In 
the no-exposure condition, the experimenter entered 
and distributed the following printed form. 


Iowa State University Reading Survey’ 


Titles of several magazine articles appear below. 
Indicate your preferences among these articles by 
ranking the titles in their order of interest to you. 
Write the number “1” next to the title that you 
would be most interested in reading today, BL. 
next to the title that would be your second choice, 
and so on. Write “13” next to the title that you 
would be least interested in reading today. 


In the exposure condition the experimenter made his 
entrance conspicuously encumbered by a load of 
magazines of the same dimensions as Saturday Eve- 
ning Post. Six-inch white tab slips protruded promi- 
nently from each magazine. The experimenter placed 
his burden on a table in front of the room in such a 
way that a large pile of magazines was clearly dis- 
played to all subjects. The following printed form 
was then distributed. 


4I am indebted to N. T. Feather for generously 
providing materials, raw data, and further informa- 
tion about his studies (1962, 1963). 
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TABLE 1 
MEAN AcE, MEAN TIME OF SMOKING, AND MEAN NuMBER OF CIGARETTES SMOKED PER Day 


No exposure 
: Smokers Nonsmokers Smokers Nonsmokers 

Caper | Nolin | Cancer | Nolnk | Cancer | Notink | Cancer 
20 24 23 18 20 18 24 25g 

Mean age (years) 20.30 22.17 22.28 20.16 20.05 20.11 20.21 20.88 

iod smoked 
Gk ae 3.39 3.25 3.20 3.58 
Mean number ciga- 
ee per fat 17.30 19.75 17.70 18.83 


Note.—"Cancer link” identifies title “Smoking Leads to Lung Cancer”; “No link" identifies “Smoking Does Not Lead t 


Lung Cancer," 


Towa State University Reading Survey 


Students read magazine articles about topics in 
which they are very interested. The articles are 
read in classrooms so that group reading time and 
reading enjoyment scores can be obtained. Titles 
of several magazine articles appear below. After 
indicating your preferences among these articles, 
you will be handed a magazine by a survey as- 
sistant. The magazine given to you will contain 
one article which was one of your highest pref- 
erences. [All articles are the same standard length.] 


The remaining instructions were the same as in the 
no-exposure condition except that “is” and “are,” 
where appropriate, were substituted for “would be.” 
Ranking of titles. The critical titles were Feather’s 
(1962, 1963). The list of titles ranked by the subject 
included either Smoking Leads to Lung Cancer or 
Smoking Does Not Lead to Lung Cancer. The two 
lists and the order of appearance of the critical title 
on the list, fifth or tenth, were alternated before the 
forms were distributed. Several of Feather’s titles 
were replaced because they seemed inappropriate, 
for example, The White Australia Policy, Who Will 
be Governor-General? The list, not including the 
critical titles, was Cure of Illness by Hypnotism, 
Sex and the Beatniks, How to Lose Weight, How to 
Rise Above Fear, Advertising Abuses the Public, 
Religion and Life, The Way We Look to Foreign 
Tourists, Living With Your Neighbors, Memorizing 
—A Useful Art, Culture’s Bargain Basement, Relax- 
ation and Reading, Tax Rebates for Students. 
Questionnaire. The inventory which followed the 
ranking included Feather’s (1962, 1963, p. 158) atti- 
tude items. The first item, “smoking leads to lung 
cancer only when the person is a heavy smoker,” 
was scored from 1 (strongly disagree) to 5 (strongly 
agree), The second item concerned the convincingness 
of “the evidence that smoking leads to lung cancer” 
and it was scored from 1 (not convincing at all) to 
5 (highly convincing). The remainder of the inven- 
tory consisted of items measuring attitude toward 
smoking and the magnitude, duration, and continuity 


of the subject’s smoking habit. After the question 
naires were collected, the exposure subjects were told 
that the articles would be read at a later session “$ 
that enough magazines can be assembled to giv 
everyone his first choice.” When the data collection 
was completed, instructors gave the subjects a brie 
explanation of the study. 


Sample Selection, Characteristics, and Repre- 
sentativeness 


The procedure was administered to 274 underclas 
men of whom 172 could be classified either as non 
smokers (N = 90), persons who reported they nevét 
started to smoke, or smokers (N = 82), persons 
who, for at least a year, smoked a minimum of 10 
cigarettes per day and had not attempted to stop) 
smoking in the previous 5 months. Ninety-eight sub 
jects were unassignable to these groups and four 
other subjects were discarded for not following the 
instructions. Table 1 shows the mean ages, mea 
period smoked, and mean number of cigarett 
smoked per day for smokers and nonsmokers rating 
cancer link (Smoking Leads to Lung Cancer) or n0 
link (Smoking Does Not Lead to Lung Cancer) un 
der exposure and no-exposure treatments. 


a probability sample (total 
N =1986) of the United States adult population obs 
tained in January and February of 1962.5 Five 

ikert labels (strongly agree, agree, etc.) accompa 
nied each item which was scored so that 1 indicat a 
the most favorable feeling toward smoking and 5) 
the least. The differences between smokers and non- 
smokers’ mean scores on each item were computed) 


® These data and permission to reproduce them 
were kindly supplied by Daniel Horn, 
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TABLE 2 


MEAN Scores on ÅTTITUDE Items RELEVANT TO SMOKING: PRESENT SAMPLE AND A 
NATIONAL PROBABILITY SAMPLE (MALES) 


Smokers Nonsmokers 
Item = 
Towa State National Towa State National 
“There is nothing wrong with smoking as long as a 
person smokes moderately 2.13 2.31 3.30 3.33 
Smoking is.a dirty habit 3.18 3.16 3.95 3.92 
_ There is nothing wrong with smoking 2.61 3.29 3.47 3.94 
* I hope my children never smoke 3.89 4.04 4.39 4.48 
Smoking costs more than the pleasure is worth 3.29 3.66 4.29 4.45 


Note.—Scores range 1-5. Low score indicates favorable attitude toward smoking. 


. correlation. In both samples, smokers and nonsmokers 
* differed least on the issue of children smoking and 
differed most regarding the statement that there is 
nothing wrong with moderate smoking. Hence, the 
present sample was representative of the national 
population on a relevant attitude dimension. 


RESULTS 


The effectiveness of the exposure—no-expo- 
sure variation was assessed by interviewing 20 
randomly selected exposure subjects and 10 
no-exposure subjects. It was clear that all 
exposure subjects expected to read articles, at 
least at the time they were ranking the titles. 
Each of the no-exposure subjects rejected the 
suggestion that “you may have had an ink- 
ling while ranking the titles that they referred 
to articles you might be asked actually to 
read,” List position of the critical title, 
whether fifth or tenth, was involved exclusively 
in main and interaction effects yielding Fs 
‘Yess than unity. 


Ranking of the Critical Titles 


The principal data for the present sample 
are given in Row 1 of Table 3. The eight cells 
have unequal frequencies and therefore an 
unweighted-means analysis was employed 
(Walker & Lev, 1953, pp. 381-382; Winer, 
1962, pp. 222-224).° The triple interaction of 
borderline significance (F = 3.30, p < .075) 
was clarified by computing separate analyses 
of variance for the exposure and no-exposure 
groups (see Table 4). Under no exposure 
there were no reliable effects, all Fs less than 
unity. Under exposure the main effect of 
smokers versus nonsmokers was reliable 
(p < .001) as well as the interaction of this 
variable with cancer link versus no link 
(p < 01). These analyses showed that under 
exposure smokers preferred no link more than 


6 The method of unweighted means was used in 
the analyses of variance in the present study; for 
each reported analysis of variance, a Bartlett’s test 
produced a chi-square with p > .05. 


TABLE 3 
MEAN RANKING OF CRITICAL TITLES 


No exposure Exposure 
Smokers Nonsmokers Smokers Nonsmokers 
4 Cancer | Notink | Cig" | No link Cancer | Notink | Cancer | No link 
1. Present sample 7.50 7.08 7.43 744, 6.05 3.44 6.96 8.72 
2. Feather I, Males 
: eda: 37) 4,91(23) | 6.00(30) | 8.31(26) 7.00 (28) 
. Feather IT, 
(1963, p. 180) (6.64022) | 6.4707) | 9.41(22) | 9.06017) 


Note.—The smaller the score, the greater the interest in reading the article entitled cancer link (“Smoking Leads to Lung 


Cancer”) or no link (“Smoking Does Not Lead to Lung Cancer’’). 


1,0-13.0. 


Cell Ws for Feather's data are given in parentheses. Scores range 
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TABLE 4 


ANALYSIS OF VARIANCE OF RANKINGS 
IN PRESENT SAMPLE 


Source df | MS F 

Smokers versus 

nonsmokers (A) 1| 5.249] 8.92*** 
Cancer link versus 

no link (B) 1 | 0.198 | <1 
Exposure versus 

no exposure (C) 1|2.290| 3.89 
AXB 1|2.880| 4.89* 
AXC 1|4.351| 7.40** 
BXC 1 | 0.024 | <1 
AXBXC 1|1.940| 3.30 
Error 164 | 0.588 

* 
p Sor 
eK D < 001. 


cancer link while nonsmokers did the opposite, 
that is, preferred cancer link to no link. 
Hence, differential ranking of the articles was 
not obtained unless the subjects expected they 
would have to read them. 

The first hypothesis, smokers prefer no link 
more than nonsmokers, was supported under 
exposure and under no exposure, but in the 
latter treatment the difference was unreliable. 
The second hypothesis implied that smokers 
would avoid cancer link more than non- 
smokers. A corrollary expectation was that 
avoidance of cancer link by smokers would be 
greater under exposure than under no ex- 
posure. Both predictions were unsupported. 
Smokers did not display less interest than 
nonsmokers in cancer link and, under ex- 
posure, the direction of the difference, though 
unreliable, was contrary to the hypothesis. 
Thus, analysis of the data in Row 1 of Table 
3 produced two clear findings. The exposure 
variation had a powerful effect, particularly on 
the attractiveness of the no-link message for 
smokers; under both treatments the data 
failed to show the theoretically postulated 
avoidance of discrepant information. 


Comparison of Present Data with Feather’s 
Data 


The no-exposure data were compared with 
results from Feather I (1962) and Feather II 
(1963), Rows 2 and 3 of Table 3. Both 
analyses of variance produced a reliable first- 
order interaction (Fs = 4.51, 5.40, p< .05, 
df = 1/184, 1/155). The interaction reflected 


the presence of a smokers versus nonsmokers 
main effect in Feather’s data and its absence 
in the present no-exposure group. When each’ 
no-exposure mean was compared with it 
counterpart in Feather’s data, only one of the 
eight comparisons yielded a two-tailed 
p<.05: present smokers preferred cancer 
link less than Feather’s 1962 males. Hence 
the present no-exposure data did not differ 
greatly from Feather’s results. 

The exposure data were compared with the 
results from Feather I and II. Both analyses 
of variance showed, in addition to a reliable 
main effect for smokers versus nonsmokers 
($ < .001), a significant second-order inter 
action (Fs = 11.17, 4.57, p< .01 and 05 
df=1/186, 1/157). Two effects, present 
under exposure but not in Feather’s data, con 
tributed most heavily to the triple interaction: 
the heightened valence of the no-link message 
for exposure smokers; the greater interes! 
shown by exposure nonsmokers in cancer link 
than in no link. 

In sum, comparisons of the present data 
with Feather’s results showed again that de 
gree of commitment to exposure affects in 
formation receptivity. With respect to ag 
hypotheses, all the data in Table 3 agreed in 
showing at least trends which supported the 
first hypothesis. The second hypothesis, 
avoidance of discrepant cognition, was unsup 
ported across the board. 


Attitudes toward the Smoking-Cancer Link 


Attitude mean scores for the present sam 
ple and for Feather’s (1962, 1963) studies 
are presented in Table 5. Analysis of variance 
of scores for Item 1 yielded a single reliable 
effect: under exposure there was greater agree 
ment by nonsmokers than by smokers (F =) 
4.70, p< 05). Nonsmokers’ restriction of 
lung cancer to heavy smokers was puzzling” 
and, indeed, Feather’s data (Rows 2 and 3 of. 
Table 5) were also unamenable to a clea 


ward. For Item 2, the only significant effect 
was in accord with dissonance theory and with 
the previous findings by Festinger (1957, P: 
155) and Feather (1962, 1963): smokers 
found the evidence for the smoking-cancer link 
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Fe TABLE 5 
Mean SCORES ror Two ATTITUDE ITeMs FOR PRESENT SAMPLE AND FEATHER’s SAMPLES 
No exposure Exposure 
i Smokers Nonsmokers Smokers Nonsmokers 
Cancer | Notink | Cancer | Nolnk | CARE | Notink | Cag | No link 
Item 1 
. . 1, Present sample | 3.00(20) | 3.08(24) | 3.00(23) | 3.47(18) | 3.06(18) | 3.00(18) | 3.46(24) | 3.44(25) 
2. Feather I, Males| 3.02 (53) 3.19 (54) 
3. Feather II 3.14(22) | 2.82(17) | 3.09(22) | 3.29(17) 
Item 2 
4. Present sample | 2.10(20) | 2.42(24) | 3.13(23) | 2.00(18) | 2.65(17) | 2.31(16) | 3.00(22) | 2.46(24) 
5. Feather I, Males| 2.68 (53)* 3.19 (54) 
6, Feather II 2.23 (22) | 2.82(17) | 3.18(22) | 3.24 (17) 


Note.—For Item 1, high score indicates agreement “that smoking leads to lung cancer only when a person is a heavy smoker"; 
for Item 2, high score indicates person is convinced by ‘‘the evidence that smoking leads to lung cancer.” Possible score range is 
1.0-5.0. Two smokers did not respond to Item 1. Five smokers and three nonsmokers did not respond to Item 2. Cell Ns in paren- 


theses. 
a Combined data for cancer-link and no-link conditions. 


less convincing than did nonsmokers (F = 
10.91, p < .005). 


Relationships between Rankings of the Criti- 
cal Titles, Attitudes toward the Smoking- 
Cancer Link, and Smoking Behavior 


Internal correlational analyses were per- 
formed to assess four explanations for the 
. lack of avoidance of the cancer-link message 
by smokers. In terms of the first possibility, a 
portion of the present smokers may not have 
been fully committed to the smoking habit‘ 
and hence their receptivity to cancer link re- 
flected predecisional nonselectivity. A second 
possibility was that a portion of the present 
smokers had been smoking for so long and/or 
so heavily that they could expose themselves 
to cancer link without increasing psychic 
malaise. Present smokers were split at the 
median into those who smoked more than 2.5 
years and less than 2.5 years; those who 
smoked an average of more than 20 cigarettes 
per day and less than 20 cigarettes a day. 
Those smoking more and longer were less con- 
vinced of the evidence for the smoking-cancer 
. link (second attitude item) but the differences 
were unreliable; #’s were 1.42 and 1.40. The 
first possibility implied that preference for 
cancer link would decrease as smoking dura- 


TThere was less similar reason for doubt about 
commitment to smoking by Feather’s (1963) sub- 
jects because they had been smoking for at least 9 
years, 


tion and consumption increased. The second 
possibility implied a direct relationship be- 
tween smoking duration and consumption, on 
the one hand, and preference for cancer link, 
on the other. Length of time of smoking and 
mean daily consumption of cigarettes were 
reliably correlated with each other (7’s from 
51 to .55, p< .05) but were unrelated to 
ranking of both critical titles (7’s less than 
30). 

Another test of predecisional nonselectivity 
was made by comparing present no-exposure 
subjects with Feather’s older smokers. If pres- 
ent smokers, whose habit was only 3 years old, 
were, for that reason, still receptive to any 
pertinent information, then their interest in 
the critical titles should be greater than per- 
sons who have been smoking for 14 years 
(Feather’s, 1963, subjects). However, Rows 
1 and 3 of Table 3 show an opposite trend: 
more interest by smokers with the longer com- 
mitment to the habit. 

A third account stems from the reasonable 
assumption that many smokers are afraid 
because dangers of smoking are now widely 
publicized. Reading frightening material could 
justify smokers’ fear (Feather, 1963, p. 161; 
Festinger, 1957, p. 149). Finally, it is possible 
that most smokers are beset by extreme dis- 
sonance and will seize any means of reducing 
it, including changing their behavior. As 
Festinger (1957) suggested: 
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If the dissonance is so large that it is almost suffi- 
cient to overcome the resistance to changing the 
behavior, one may expect that the easiest way to 
eliminate the dissonance is temporarily to increase it 
sufficiently so as to change the behavior. Under these 
circumstances one would expect persons to expose 
themselves to dissonance-increasing information [p. 
163]. 


Comparisons pertinent to the above ac- 
counts were sought in the data of 36 subjects 
who were previously excluded because they 
quit smoking. Both the “fear” and “extreme 
dissonance” views predict that quitters would 
relish reading cancer link, more than no link, 
because the former title supported their de- 
cision. For both exposure groups, the dif- 
ferences were in the same direction and con- 
trary to the prediction, but not reliably so. 
Combined quitters mean ranking of cancer 
link was 8.30 and no link, 6.40. 

A second set of comparisons involved ap- 
proximate median splits of smokers into those 
who reported one or more attempts to stop 
smoking (prior to 5 months before the data 
collection) and those who never tried to stop. 
If fear and/or extreme dissonance were more 
characteristic of persons who attempted to 
stop smoking than of persons never so at- 
tempting, then the persons who attempted to 
stop smoking might prefer cancer link more 
than the persons who never tried to quit, 
while the latter group should prefer no link 
more than the former. For both exposure 
treatments the obtained differences were un- 
reliably opposed to the first prediction and 
unreliably in support of the second. 

In sum, internal analyses did not provide 
any clear evidence that smokers were deterred 
from avoidance of “Smoking Leads to Lung 
Cancer” by predecisional nonselectivity, deep 
commitment to the habit, fear, or extreme 
dissonance. 


Discussion 


The two principal results were a demonstra- 
tion that degree of commitment to exposure 
influences information receptivity and a fail- 
ure to support the hypothesized avoidance of 
dissonance-increasing information. The results 
showed that when smokers expected to ex- 
pose themselves to communications, informa- 
tion denying the link between smoking and 
cancer was much preferred in comparison to 


information asserting a smoking-cancer link. ` 


Differential preference for the cancer-link and 
no-link messages, however, was not obtained 
in three separate data collections (present 
study; Feather, 1962, 1963) in which the 
subjects did not expect to expose themselves. 
The effectiveness of the exposure variation in 
influencing smokers’ interest in no link was 
expected to carry over to smokers’ avoidance 
of the cancer-link message. Lack of evidence 
for such avoidance here, and in most previous 
tests, makes it necessary to reevaluate the 
avoidance postulate, per se, and as a com- 
ponent of dissonance theory. 

Brehm (1962, p. 93) has suggested that, 
immediately after a decision, discrepant in- 
formation may be highly salient. Although 
differential salience may account for findings 
in recall experiments, salience seemed less 
pertinent when recipients control their future 
exposure. Information receptivity in retro- 
spect differs from receptivity in prospect: in 
the former, avoidance or suppression of dis- 
sonant cognition has been satisfactorily docu- 
mented (Brehm, 1962; Buss & Brock, 1963; 
Levine & Murphy, 1943; Wallen, 1942). 

Akin to salience in its possible effect on 
receptivity may be the novelty value of in- 


formation. College-level subjects might be ` 


receptive to discrepant information (Feather, 
1962, p. 62) because their curiosity and/or 
exploratory appetites were whetted. One may 
safely say, however, that the no-link message 
should have had the greater novelty value. 
Novelty, of course, could not account for the 
Ehrlich et al. (1957) results. Steiner (1962,"p. 
267) has suggested that persons who are free 
to change their behavior will be especially re- 
ceptive to adverse information. The previously 
described data from quitters, and from those 
attempting to stop smoking, contain trends 
opposed to the outcomes implied by Steiner. 

According to Mills et al. (1959, p. 262) 
and Feather (1963, p. 162) persons seek in- 
formation about their chosen alternatives “ir- 


respective of whether the information sup- ` 


ports or does not support” the choice. For 
their data, Mills et al. suggested that recep- 
tivity to personally relevant information can- 
celed the tendency to avoid discrepant infor- 
mation. This interpretation could not account 
for portions of the data in Table 3, For smok- 
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“ws, both tendencies (seek relevant informa- 
tion, seek decision support) worked in the 
same direction for no link and in opposed 
directions for cancer link. Hence, smokers 
should have preferred no link more than can- 


e cer link. For nonsmokers, the consonant can- 


cer-link messages should have been preferred 
to no link. Both deductions from the Mills 
et al. account were uniformly unsupported in 
the three no-exposure samples. Under ex- 
posure, the direction of the difference between 
smokers and nonsmokers’ rating of cancer link 
was also incompatible with Mills et al., unless 
the assumption could be defended that the 
critical titles were perceived as less relevant 
by nonsmokers than smokers. This assumption 
was at the core of Feather’s (1963, p. 162) 
interpretation. However, to one familiar with 
contemporary student subcultures, not smok- 
ing seems to be just as clearly a “course of 
action,” involving effortful commitment, as 
does smoking. The male nonsmoker has com- 
mitted himself to not smoking in spite of con- 
trary social pressure, for example, freshman 
year “rushing” parties; the majority of his 
associates and friends smoke and their wel- 
fare, and hazards, are not matters of indif- 
ference to him; lung cancer and health are 
of interest to him. Since it was by no means 
obvious that the cancer-link message could be 
regarded with indifference by nonsmokers, 
Feather’s explanation was also not completely 
satisfying. 

If the avoidance postulate were merely a 
peripheral assumption, doubts about its va- 
lidity would be only slightly disturbing. How- 
ever, the avoidance of “situations and infor- 
mation which would likely increase dis- 
sonance” was one of Festinger’s (1957, p. 3) 
two “basic hypotheses.” It is true that when 
evaluation or attitude has been the dependent 
measure, much support for dissonance theory 
has been obtained. Nevertheless, for Festinger 
and other dissonance theorists (Brehm & Co- 
hen, 1962) “selective exposure is a more di- 
‘rect reflection of avoidance of dissonant cog- 


. nitions than attitude change [p. 93].” Revi- 


sion of the avoidance postulate was indicated, 
a revision which would encompass not only 
the failures to support the postulate but the 
confirmations as well (Cohen et al., 1959; 
Festinger, 1957, pp. 162-176). 


The reformulation stresses several aspects 
of postdecision behavior and the drive char- 
acter of dissonance. Following a decision, a 
person will always attempt to marshal cog- 
nition justifying his chosen alternative. -The 
first assumption concerns the sources of justi- 
fying cognitions. If a great many of the 
person’s prior experiences were relevant to 
the decision or, if the alternatives were very 
familiar, then he would have many sources 
from which to draw cognition justifying his 
choice. On the other hand, if few of his prior 
experiences could be associated to the decision 
and/or if the alternatives were quite novel, 
then, necessarily, fewer justifying cognitions 
would be available. The possession of justify- 
ing cognition is crucial when a person en- 
counters information opposing his choice. The 
justification-impoverished person may then try 
to evade further exposure but not necessarily 
the person whose choice has been bolstered 
(Kendall & Wolf, 1949, p. 178). The latter 
individual might even seek out discrepant 
information, depending on the validity of a 
second assumption. To the extent that dis- 
sonance shares the drive properties of hunger, 
sex, etc., dissonance reduction may be ac- 
companied by positive affect. To experience 
the positive affect accompanying dissonance 
reduction it may be necessary first to expose 
oneself to dissonance-increasing stimulation. 
Persons with ample resources for achieving 
dissonance reduction, that is, those able to 
marshal much postchoice justification, might 
actively seek out dissonance-increasing infor- 
mation in order to reduce the dissonance thus 
aroused and obtain the concomitant gratifica- 
tion. Few other motives are usually satisfied 
via so little energetic commerce with the en- 
vironment. 

The preceding suggestions can be brought 
together in a tentative reformulation: (a) 
persons will expose themselves to cognition 
dissonant with their choice (of behavior, 
opinion, object, etc.) in direct proportion to 
the number and importance of prior cogni- 
tions ‘consonant with the choice; (b) the 
gratification of dissonance reduction is propor- 
tional to the magnitude of dissonance arousal; 
(c) when dissonant cognition is not avoided, 
receptivity to it will be proportional to the 
magnitude of the anticipated discrepancy. 
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The proposed reformulation has not yet re- 
ceived direct tests. The first proposition, a, 
calls attention to a difference between Fest- 
inger’s (1957, pp. 162-176) “gambling” ex- 
periment and the other pertinent studies. The 
type of situation confronting the subjects in 
the “gambling” experiment is infrequent in 
real life and the choice alternatives were novel. 
The selected alternative was therefore not 
capable of support to the same degree as the 
alternatives involved in studies showing no 
avoidance of dissonance increase: automobiles, 
smoking or nonsmoking, course examinations, 
and lectures on child development. The sec- 
ond proposition, b, emphasizes the gratifica- 
tion of dissonance reduction. Exposure to dis- 
sonance-increasing information, when the dis- 
sonance can readily be reduced, might provide 
the kinds of motivational satisfactions which 
have been reviewed and united by White 
(1959) under his concept of competence. Per- 
haps exposure to choice-discrepant informa- 
tion immunizes (McGuire & Papageorgis, 
1961) the choice against further contradic- 
tion. Whether persons frequently undertake 
self-immunizing exposure to dissonant infor- 
mation is not yet known but there are condi- 
tions (Davids, 1963) under which some per- 
sons (MacKinnon, 1962) will be open to 
ambiguity and inconsistency arising from 
their receptivity to discrepant information. 
The third proposition, c, follows from a and b 
for persons who have made a cognitively sup- 
ported choice and who may seek the gratifica- 
tion of dissonance reduction. It may be as- 
sumed that present smokers had many occa- 
sions and much instigation to amass cognitions 
justifying their habitual behavior. When an 
opportunity was presented, particularly in the 
exposure treatment, to admit and then dispel 
dissonance, it was not avoided. 
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The study dealt with the effects of (a) level 


of fear and (b) specific plans for 


action vs. general recommendations on attitudes toward tetanus inoculations 


and actually getting 


tetanus shots, The arousal of fear resulted in more favor- 


able attitudes toward inoculation and the expression of stronger intentions 


to get shots. However, 


for Ss receiving a specific plan for action. 
fear level some level of arousal was necessary 


actually getting shots occurred significantly more often 


Although action was unaffected by 
for action to occur. A specific 


plan was not sufficient for action to appear. Although the 2 dependent measures 
were affected by different independent variables, those people getting shots 


were also more favorable toward doing so. 
and speculations were presented on the 
the translation of attitudes into actions. 


studies on fear arousal and actions, 
role of specific action plans in 


Information alone seldom provides suffi- 
cient impetus to change attitudes or actions 
toward a given object (Cohen, 1957; Klap- 
per, 1960; Rosenberg, 1956). The information 
must not only instruct the audience but must 
create motivating forces which induce attitude 
and behavioral change. Janis and Feshbach 
(1953, 1954) were among the first to explore 
the effects of information which arouses fear 
or avoidant motivation on the changing of at- 
titudes. Their results indicated that high fear 
arousal produced less adherence to recom- 
mendations, presumably because high fear 
produced responses of defensive avoidance. 
Support for the finding of less persuasion 
with high- than with low-fear communications 
has also been presented by Goldstein (1959) 
and by Janis and Terwilliger (1962). How- 
ever, in other recent studies evidence has ac- 
cumulated which suggests the need to re- 
evaluate the relationship between fear arousal 
and persuasion. 

First, Berkowitz and Cottingham (1960) 
have demonstrated that, at relatively low 
levels, increments in fear may produce in- 
creased attitude change especially for subjects 
for whom the communication was less rele- 
vant. Leventhal and Niles (1964), and Niles 
(1964) have also found that fear arousal in- 


1This study was conducted under United States 
Public Health Service Grant CH00077-02. We would 
like to thank John S. Hathaway and James S. Davie 
of the Department of University Health for the co- 
operation and help they gave to this study. 
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The results are compared with other i 


creases persuasion. They obtained a positive 
correlation between reported fear and inten- 
tions to act (Leventhal & Niles, 1964), and 
increases in intentions with increasingly pow- 
erful communications (Niles, 1964). These ef- 
fects were found using stimuli considerably 
more vivid and frightening than those used in 
any of the earlier investigations. Thus, these 
experiments suggest that fear functions as a 
drive which promotes the acceptance of rec- 
ommended actions, and, regardless of the ab- 
solute level of fear arousal used in any study, 
the communication which arouses more fear 
will be more persuasive. 

There are a number of incidental factors 
that may account for the different results in 
these studies: for example, Janis and Fesh- 
bach’s topic was dental health while ling 
cancer was the topic for Leventhal and Niles 
(1964) and Niles (1964); Janis and Feshbach 
(1953) used high-school students, Leventhal 
and Niles used people attending the New York 
City health exposition, and Niles used college 
students. However, while these factors could 
be responsible for the different outcomes, one 
variable which seems of particular importance 
is the availability of the recommended action. 
In their study, Janis and Feshbach (1953) 
suggested that fear arousal could lead to in- 
creased persuasion if the action was immedi- 
ately available. In the Leventhal and Niles 
study, action was immediately available to all 
groups of subjects; that is, they could get an 
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X “X ray, and, while stopping smoking may re- 


quire concerted effort over a long period of 
time, it can be initiated immediately. In the 


` Niles experiment the arousal of fear increased 


desire to take action principally for subjects 


ewho do mot see themselves as vulnerable to 


disease. Subjects who feel vulnerable to dis- 
easé showed relatively small increases in will- 
ingness to take preventive action when made 


fearful. Their greater resistance to persuasion 


seemed to be related to their tendency to judge 
the recommendations to prevent lung cancer 
as ineffective. In addition, subjects high in 
vulnerability scored low on a scale of self- 
esteem that relates to seeing oneself as able 
to cope with the environment (Dabbs, 1962; 
Leventhal & Perloe, 1962). The findings sug- 
gest that when environmental conditions or 
the subject’s dispositional characteristics make 
action seem highly possible and effective, fear 
will promote action and attitude change. 
The present study was designed to provide 
additional data on this question. Fear-arousing 
and non-fear-arousing communications were 
used recommending a cleat action (taking a 
tetanus shot) which is for all intents and pur- 
poses 100% effective. Thus, in line with our 
earlier findings (Leventhal & Niles, 1964; 


“Niles, 1964), it was predicted that more atti- 


tude change and more action would be pro- 
duced by the high-fear conditions. Second, an 
attempt was made to experimentally manipu- 
late the perceived availability of the recom- 
mended action by giving some subjects a 
specific plan to guide their action. It was hy- 
pothesized that adherence to the recom- 
mended act would be greater among subjects 
possessing a specific plan. Finally, an inter- 
action was expected between fear and spe- 
cificity: highly motivated subjects, that is, 
those exposed to the fear-arousing materials, 
were expected to show the greatest attitude 
and behavioral compliance when a clear plan 
for action was given to them. 

Another question which was raised with re- 


‘gard to these divergent findings was the kind 
_ of emotion evoked by the stimulus. Careful 


attention has been given to discriminating 
levels of fear arousal, and to the possibility 
that fear-arousing communications arouse ag- 
gression as well as anxiety (Janis & Feshbach, 
1953; Robbins, 1962). Other studies in the 


current program (Leventhal, Jacobs, & Trem- 
bly, 1963 unpublished*) suggest that fear 
may be experienced with many other emo- 
tions. Therefore, several items were used to 
assess emotional arousal with the hope that 
these would provide added information on the 
nature of the fear associated with persuasion, 


METHOD 


Design and Subjects 


The experimental design incorporated two levels of 
fear and two levels of information on the availability 
of the recommended action, Additional control 
groups were run to clarify questions unanswered by 
the factorial design. These are described in the 
Results section, Booklets were used to present the 
fear-arousing stimuli and to deliver the recommenda- 
tion for inoculation, A questionnaire was completed 
after reading the booklet. 

All subjects were seniors at Yale University and 
were selected by taking every other name from the 
class list. Initial contacts were by mail, and specific 
appointments for the experimental session were made 
by phone. No inducements were offered for partici- 
pating in the study and subjects only knew that they 
were to evaluate a public health pamphlet. All con- 
tacts with the subjects were made by using the name 
of the John Slade Ely Center and the University 
Department of Health. 

Subjects were run individually and in a building 
2.5 blocks away from the University Health Service. 
Conducting the study in the University Health Build- 
ing would have made it far too simple for subjects 
to get shots. When a student entered the experiment, 
he was given a pamphlet and told: “Would you 
please read this pamphlet carefully. When you are 
finished, please bring it back to me and I will give 
you a questionnaire to fill out about it” After 
reading the pamphlet and filling out the question- 
naire the students departed. There was minimal 
conversation with the experimenters. 


Experimental Manipulations 


The booklets were composed of two sections; a 
“fear section,” dealing with the causes of tetanus 
and including a case history of a tetanus patient; 
and a “recommendation section,” dealing with the 
importance of shots in preventing the disease. There 
were two forms of each section: high fear and low 
fear, specific recommendation and nonspecific recom- 
mendation, making four pamphlets. 

Fear manipulation. The same facts about the dis- 
ease were present in both fear levels. Three devices 
were used to manipulate fear: coupling frightening 
or nonfrightening facts with basic information on 
tetanus; emotion-provoking or emotion-nonprovoking 
adjectives to describe the causes of tetanus, the 
tetanus case, and the treatment of tetanus; and 


2 Unpublished study entitled “Negative Emotions 
and Persuasion.” 
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including different kinds of photographs to illustrate 
the pamphlet. For example, in the high-fear booklet 
the incidence was described as being as high as that 
for polio, and the bacteria were described as “under 
your finger-nails, in your mouth” etc, and as 
literally surrounding the reader. The low treatment 
simply stated these facts in a nondramatic way. The 
aim of the high-fear booklet was to create a strong 
feeling of personal vulnerability. 

‘A case history, constructed from reports in medical 
journals, was presented to make vivid the severity 
of the disease. In the high-fear condition, the word- 
ing was constructed to create a clear image of the 
patient's symptoms (convulsions; his back arched 
upwards, his head whipped back, mouth slammed 
shut, etc.). Photographs were also included which 
showed a child in a tetanic convulsion and bedridden 
patients. Three of the photographs were in color. 
One illustrated a gaping tracheotomy wound, the 
others depicted patients with urinary catheters, 
tracheotomy drainage, and nasal tubes, The treat- 
ments illustrated are actually used in the therapy 
of severe cases of tetanus, They proved to be quite 
startling to the subjects. 

In the low-fear condition, colored photographs 
were omitted as were the pictures of the hospital 
patients and equipment, Two photographic copies 
of drawings of the facial expressions found in tetanus 
were included, The case history was described in 
unemotional terms and, whereas the patient died in 
the high-fear booklet, he survived in the low-fear 
case. Otherwise, the booklets were factually identical 
and were approximately of equal length (7 mimeo- 
graphed pages). 

Plan for action. After the presentation of the case 
history, all pamphlets contained identical paragraphs 
on the importance of controlling tetanus by inocula- 
tion. This point was illustrated by statistics which 
clearly demonstrated that shots are the only powerful 
and fully adequate protection against the disease. 
In addition, it was stated that the University was 
making shots available free of charge to all interested 
students. 

For the high availability treatment additional 
material was included urging the students to take a 
shot and providing a detailed set of suggestions as 
to how he could do this within the context of his 
daily activities. The points made can be paraphrased 
as follows: The University Health Service expressed 
the hope that all students would take the necessary 
action to protect themselves, the location of the 
University Health Service was described and the 
times that shots were available were listed, precisely 
where to go and what to do to get a shot was indi- 
cated, a map was presented of the campus with the 
University Health Building clearly circled, and a 
request was made that each student review his 
weekly schedule to locate a time when he would 

pass by the University Health so that he could stop 
in to be inoculated. The specific recommendation, 
then, is essentially a detailed plan to make the sub- 
jects rehearse the various steps needed to take the 
suggested action. Thus, the low availability groups 
are told of the effectiveness of shots and that shots 


are available. The high availability groups have 
information plus additional material helping them 
to plan and to review the specific steps needed to 
take shots. It should be made clear, however, that 
since the subjects were seniors, they all knew the 
location of student health, and it is extremely likely 
that they had all visited the building at some time 
in the past. The plan, therefore, would simply make : 
salient that which is already known rather than 
providing new information. 


Response Measures 


Two types of responses were observed for all 
subjects participating in this experiment, Immediately 
after the communication all subjects completed a 
questionnaire on which they reported their attitudes, 
feelings, and reactions to the experimental setting. 
In addition, a record was obtained of all subjects 
taking a tetanus inoculation. 

Questionnaire measures. The questionnaire included 
items on: prior inoculation against tetanus; inten- 
tions to be inoculated; attitude regarding the impor- 
tance of inoculation; judgments of the likelihood of 
contracting tetanus and its severity if contracted; 
emotions experienced while reading the communica- 
tions; and reactions to, and interest in, the com- 
munications. The items used will be reported in the 
Results section. 

Behavioral measures. The records of all partici- 

nts were checked by student health authorities 
and a count was made of the subjects in each condi- 
tion who were inoculated. The dates for inoculation 
were also obtained. Those students who were inocu- 
lated at the close of the semester, more than 1 month, 
after the study termination, were not included in the 
inoculation count. It is common practice for students 
taking trips abroad to receive inoculations at the 
end of the semester. 

The questionnaire also included a variety of items 
on many diseases besides tetanus. The items were 
included principally to suggest to the subject that 
the investigation was on health, rather than an 
attempt to coerce him into taking an inoculation. 
Thus, items asked about prior shots for polio, 
typhoid, and flu, and feelings of susceptibility to 
and the severity of six other diseases, 


RESULTS 
Fear Arousal 


As can be seen from Table 1, the fear 
manipulation was highly successful. Subjects 
report feeling greater fright, tension, nervous- 
ness, anxiety, discomfort, anger, and nausea 
in the high- than in the low-fear treatment.® 
All differences were significant at p < .005. 


The self-reports of emotion were obtained by 
asking “While you were reading the pamphlet did 
you find that you had any of the following feelings?” 
A series of adjectives with 21-point scales followed 
this statement. 
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Se; TABLE 1 
MEAN REPORTED EMOTIONS BY TREATMENT AND PRIOR INOCULATION OF SUBJECT 
High fear Low fear Significance data 
: Emotion* > 
o prior rior 
© Nops Shot Dome Shot (high-low) af 
Fear 9.25 8.47 4.51 3.21 43.19* 1/139 
. Base line (n = 11) 2.61 
Recommendations only (n= 29) 3.00 
V Tension 9,28 9.07 4.43 3.58 35.28* 1/139 
~ Base line 1.35 
Recommendations only 2.83 
Nervousness 7.85 7.79 3.93 2.70 36.40* 1/139 
Base line 1.53 
Recommendations only 3.10 
N Ankiety ; 9.51 9,06 5.43 3.26 32.30" 1/139 
* "| Base line 2.97 
has + Recommendations only 4.03 
4 Discomfort 11.43 10.40 4.47 3.36 63.84* 1/139 
Base line 3.87 
Recommendations only 3.52 
Anger 4.56 3.40 2.19 1.59 15.54* 1/140 
Base line 5.94 
Recommendations only 3.41 
Nausea 7.34 5.22 2.61 2.19 27.81* 1/140 
Base line 3.87 
Recommendations only 1.59 
Ne 30 44 29 44 


nificant interactions, 


figures, 
*p = 005, 


Incidental observations indicated that the 
high-fear booklets were indeed distressing. 
__All subjects were extremely intent and focused 
on the materials; some appeared pale, others 
haken and many made other sounds and 
gestures indicating distress. These treatment 
effects were significant whether or not sub- 
jects had been inoculated against the dis- 
ease. (Means are also presented for two 
control groups. One was not exposed to any 
communication and simply filled out the 
“Health Practices Questionnaire.” The other 
“received only the specific recommendation 
material prior to completing the question- 
naire.)* 


"4The control subjects completed essentially the 
same questionnaire as the experimental subjects. 
However, all references to the communication were 
“omitted for the questionnaire-only control. The rela- 
_tively high mean for anger in the questionnaire-only 
" groups reflects the irritation felt by some of these 
subjects upon having to answer questions (on emo- 
tion) which did not seem to have a clear referent. 
On the other hand, a few of the recommendation- 


Note,—The means are averages of the means for specific-nonspecific for each fear level. This was done for ease in presentation. 
The analyses were conducted on the complete table using Walker and Levy's (1953) technique for unequal n's. There were no sig- 


* Twenty-one points on scale. Higher numbers on all scales indicate higher reported affect. 
b Nausea was also effected by specificity. This will be reported in a later table. 
© Ns are the number of cases for the averaged pair of means. The N in the second column was actually higher than 44 for two 


Attitudes. A general question (“How im- 
portant do you think it is to get a tetanus 
shot?”—13-point scale) was used to assess 
the degree of importance which subjects at- 
tached to tetanus shots. Another question was 
used to assess his intentions to avail himself 
of inoculation (“Do you intend to get a 
tetanus shot?”—13-point scale). As can be 
seen in Table 2, regardless of prior inocula- 
tion, subjects in the high-fear conditions feel 
that shots are more important than do sub- 
jects in the low-fear conditions (F = 8.45, 
df = 1/140, p < .01). 

For the intention question, there were two 
important trends. First, subjects who had 
had a shot within the last 2 years scored 
lower than those who had not had a shot 
(F = 16.8, df = 1/136, p < .005). In addi- 
tion, there was a trend for subjects in the 


only subjects felt upset by the suggestion to get 
shots, The detailed suggestion to act seemed unusual 
enough to convince a few that a threat must be 
imminent, 
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TABLE 2 
MEAN ĪMPORTANCE OF TETANUS SHOTS AND INTENTIONS TO BE INOCULATED 
No prior shots Prior shots 
Control 
; High fear Low fear High fear Low fear 
UE of shots (13-point scale) are tot ae ne 
: Be | tho) eee) me 
i 11.29 10.. ‘ 5 
ken 17 13 2 22 Wi 
Recommendations only “u 
Base line ey 
N 
i -poi l 
Totei to get shots (13-point scale) TE O ae A 
z 12 o% | is2 135 
i 11.29 7. 
Agee 17 13 21 20 < 
Recommendations only 10 
Base line ie 
N 
Strong intention (13) 18« 9 
Moderate intention (1-12) 12 20 


sx? = 6.22, df = 1, p <.02. 


high-fear treatment, regardless of inocula- 
tion status, to express stronger intentions to 
get shots than did subjects in the low-fear 
condition (F = 3.55, df = 1/136, p < .08). 
Since the distributions were skewed down- 
wards and the means in the high-fear cells 
were over 11, using a 13-point scale, it ap- 
peared that a ceiling effect was operating to 
minimize the differences. Therefore, a test of 
the effect of fear on intentions was performed 
by treating all scores of 13 as very strong 
intentions and scores of 12 and below as weak 
intentions. Using only those subjects who kad 
not had a shot within the last 2 years (rele- 
vant subjects), a significantly greater number 
of subjects expressed a strong intention to 
be inoculated in the high-fear treatment than 
in the low (x? = 6.22, df = 1, p < .02). 
Action. During the 4-6 week period be- 
tween the experimental sessions and the end 
of classes 9 of the 59 eligible subjects went 
for shots. Of the 9, 4 were in the high fear 
specific, 4 in the low fear specific, 1 in the 
low fear nonspecific, and none in the high 
fear nonspecific. A comparison between the 
27.6% of specific takers and the 3.3% of 
nonspecific takers is significant (CR = 2.65, 
É <.01). Subjects in the specific condition 


were more likely to get shots.® Thus, attitudes 
and actions appear to be affected by different 
factors. While a low fear nonspecific com- 
munication has little influence on either atti- 


tudes or actions, fear-arousing messages af- ' 


fect attitudes regardless of specificity of plan, 
and recommendations using specific plans 
affect actions regardless of the level of fear. 

Recommendations-only control. Because the 
specificity factor did not interact with arousal 
as predicted, it was unclear whether the 
arousal of fear was a necessary condition for 
action. The main effect suggests that specific 
information may be a sufficient condition for 


5 Seven additional eligible subjects took shots fol- 
lowing the close of classes. These were distributed as 
follows: four specific (two high and two low) and 
three nonspecific (two high and one low). Adding in 
these cases gives 41% specific, 13% nonspecific tak- 
ing shots (CR =2.44, p < .02). Among the ineligible 
subjects two in the high fear nonspecific condition 
took shots. It appears that most of these subjects 
were receiving shots as part of a series in preparation 
for travel. 

€ One could argue that the specificity effect was 
obtained because subjects in the nonspecific condi- 
tion missed the statement that shots could be ob- 
tained at University Health. However, when subjects 
were asked where they would get shots, 70% of 
those in the nonspecific treatment and 72% of those 
in the specific mentioned University Health, 


. 


‘She occurrence of action. To test this pos- 
=, sibility a control group was run of subjects 
“exposed only to specific information. This 
group was run the following year and a time 
4 f difference is involved which was absent in 
| othe other comparisons. 
<The procedures for contacting and dealing 
with subjects were identical to those used in 
the four experimental groups. Of the 30 eli- 
.giblė subjects in the group not one availed 
himself- of the opportunity to obtain an 
inoculation. Thus, specific information alone 
does not seem to be sufficient to influence 
\ actions or attitudes (see Table 2). 
', Action base line. The date of tetanus inocu- 
lation for a sample of 60 students was also 
“ obtained to record the base rate of inoculation 
seeking during the experimental period. None 
of the students (eligibles or ineligibles) were 
l inoculated during that period. Therefore, 
- while the rate of shot taking was not high 
in the specific experimental treatments 
(27.6%), it is obviously greater than the 
base rate.” 


* Mediating Factors 


~. Variables associated with attitude change. 

. In addition to the fear measures, several other 
. measures of reported feelings varied in the 

same manner as did attitudes toward tetanus 

~ 5. inoculations. Subjects in the high-fear condi- 
¿= tion felt that tetanus was more serious than 
‘did subjects in the low-fear treatments (“How 
serious do you think it would be if you con- 
tracted tetanus?” F = 22.94, df= 1/139, 
< .005). Subjects in the high-fear condi- 
tions also reported more concern about getting 
tetanus (“When you think about the possibil- 
ity of getting tetanus, how concerned or 
worried do you feel about it?” Table 3, 
F = 3.92, df = 1/140, p< .05), more worry 
about the way they had treated cuts (“While 
you were reading the pamphlet, did you feel 
worried about the way you have treated 
-abrasions, cuts, or bruises?” F= 3.75, df 
= 1/136, p < .10), and reported more irrita- 


Only one of the subjects included in the experi- 
ental groups had had-a shot just prior to partici- 
pating. He had received it as treatment for an in- 
jury. Thus, there is little reason to believe that stu- 
“dents spontaneously take shots during the year to 
protect themselves against tetanus. 
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tion directed at the photographs than did 
subjects in the low-fear condition (“Did the 
illustrations irritate you or make you angry?” 
F= 6.95, df= 1/136, p< .01). High-fear 
subjects were also more certain than low- 
fear subjects that the pictures used enhanced 
the pamphlet (“Did the illustrations in the 
pamphlet enhance the message of the pam- 
phlet?” F= 11.62, df=1/134, p< .001). 
These effects were significant regardless of the 
subjects prior vaccination history. As with 
the prior measures of emotional arousal, being 
vaccinated is no protection against the dis- 
tressing emotions which appear to be elicited 
by the pamphlet per se. It is also interesting 
to note that the arousal of aggression (anger 
and irritation) occurs in the same conditions 
as opinion change. Therefore, either the an- 
noyance and irritation prompted by the com- 
munication and the illustrations does not 
minimize their effectiveness or was not of 
sufficient strength to arouse resistance to 
persuasion. 

Variables associated with action. In ex- 
amining data relating to action, we shall 
compare the means only for those subjects 
eligible for vaccination. Subjects receiving 
specific recommendations tended to report 
stronger feelings of susceptibility to tetanus. 
Though the difference is at the .10 level (“The 
chances are in 100 that I will contract 
tetanus” Table 4, F=3.14, df= 1/136, 
p < .10), the scores are highly skewed and 
do not approach significance using an ap- 
propriate test. 

Subjects receiving the specific recommenda- 
tion reported feeling less nauseated than 
those getting the general recommendation 
(F = 6.14, df = 1/140, p < .01). Thus, while 
nausea was increased by the high-fear book- 
lets, it was depressed by the specific recom- 
mendation and for the low fear specific was 
below the control mean, In addition, the 
specific subjects reported more interest 
in the communication (“Did you find the 
pamphlet interesting?” F = 4.26, df = 1/137, 
p< .105). It seems, therefore, that the cor- 
relates of action are a greater interest in 
the outer environment and a lessening of 
what may be potentially inhibiting visceral 
reactions, though fear itself is high. 
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MEAN OTHER DIFFERENCES BETWEEN HIGH AND Low FEAR 
CORRELATED WITH ATTITUDE CHANGE 


High fear Low fear 
: F 
(high-low) o 
No pior | Prior shot | Noprior | Prior shot 
1, Feanieences of tetanus“ gee 1068 eae E02 22.94** 1/139 
Control (V=11) 7.02 
Recommendations only (N =30) 8.90 
ti . 
A he cep 10.48 10.59 9.13 7.67 3.92* 1/140 
N 30 45 29 44 
Control We 11) i pee 
Recommendations only 5 
3. ory over cuts and bruises E > oa EA 3.75 1/136 
Control es 11) i ne 
Ri tions on! 2 
4. EAE MEATA pra S 23i zee 6,95** 1/136 
N 0 
5. fen enhance pamphlet ie Be PA oy 11.62*** 1/ es 


* Item 1 used a 13-point scale, 


Items 2 and 3 used 21-point Scales, 


and Items 4 and 5 used 9-point scales, 


» Means reported are the average of the means for specific and nonspecific cells, 
*p = 05, . 


mp = 01. 
wD = 005. 


Takers versus nontakers. The analyses to 
this point appear to indicate that fear arousal 
is sufficient to influence attitudes while both 
arousal stimuli and specific recommendations 
are needed for action. Since an increase in 
the level of fear does not increase the rate of 
action taking, it may appear that actions and 
attitudes are no longer related to one an- 
other. To obtain further evidence on this 
question a post hoc comparison was made 
of takers and nontakers on the questionnaire 
measures. In making the comparisons, a 


constant equal to the difference between the 
high- and low-fear means was added to all 
Scores in the low-fear condition. This elimi- 


nated the main effects of arousal and’ 


allowed a comparison of shot takers and non 


takers ignoring the effect of the fear treat- 


ment. The only values that approached 
significance were for anxiety, fright, impor- 
tance of shots, and feelings that the illustra- 
tions enhanced the pamphlet. Thus, shot 
takers, who are mainly in the specific recorn- 
mendation condition, not only differ from 


TABLE 4 
MEAN DIFFERENCES BETWEEN SPECIFIC AND NONSPECIFIC TREATMENTS 


(No Prior Snor SUBJECTS Onty) 


Specific 


Nonspecific Control 
High fear | Low fear | High fear | Low fear eee ese 

Susceptibility (100-point scale) 7.089 13.5690 4.736 4.615 5,58 11.614 

N 5 13 16 17 13 11 28 
Nausea (21-point scale) 5.61 1.75 9.06 3.46 3.87 1.59 

N 13 16 17 13 - 
Interested in pamphlet i" 

(9-point scale) 8.15 7.73 717 7.3 — 
13 15 17 13 


* A few subjects had extremely high scores for these questions which caused an unduly inflated mean, 


TABLE 5 


MEAN Comparison OF SHOT TAKERS 
WITH NONTAKERS 
Takers | Nontakers t 
e=motion 
Anxiety 11.90 9.07 | 1.55 
Ru 9 50 
Fright 11.39 8.88 1.47 
N 9 50 
Attitude 
. Importance 11.96 10.49 1.89* 
9 50 
Illustrations enhance 8.89 6.36 1.59 
9 48 
Susceptibility* 19.16 10.32 1,28 
N 8 21 


. 
® Specific treatment subjects only. 
*p = .05, one-tailed. 


nontakers in the general recommendation 
condition in the ways discussed before, but 
they also show higher scores on the above 
measures. Attitude and arousal are related, 
therefore, to action. 


Discussion 


The data lend mixed support to the 
hypotheses. As in the earlier experiments 
(Leventhal & Niles, 1964; Niles, 1964) fear- 
arousing communications increased attitudinal 
acceptance of the recommendations, in this 
case, favoring tetanus inoculations. Support- 
ing evidence for the facilitating effect of fear 
on attitude change can also be found in Weiss, 
Rawson, and Pasamanick (1963) where high 
scores on dispositional anxiety facilitated 
opinion change. However, these results are 
contradictory to the data reported in two 
studies of acceptance of recommendations for 
dental hygiene (Goldstein, 1959; Janis & 
Feshbach, 1953) where increases in fear 
level appeared to be associated with resist- 
ance to the recommendations.’ As suggested 
earlier (Leventhal & Niles, 1964; Niles, 
1964), the discrepancy between the experi- 
ments may relate to differences in the per- 


. ®&The Janis and Terwilliger (1962) experiment also 
tends to support the thesis that high fear increases 
resistance to persuasion. However, the“ experiment 
was not specifically designed to test this hypothesis 
and the results on this particular issue were border- 
line; the trend for high fear (6-2/11) versus low fear 
(11-0/16) on acceptance of a nonsmoking recom- 
mendation being only suggestive (CR = 1.35, p < .18, 
two-tailed). 
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ceived effectiveness of the recommended ac- 
tions. Thus, tetanus inoculations are far more 
effective as a preventive measure for tetanus 
than toothbrushing is for dental disease. No 
matter how one cares for his teeth, he is still 
likely to have some caries. On the other hand, 
the incidence of tetanus is practically zero for 
protected people, and for lung cancer, the 
incidence is extremely low for nonsmokers, 
Therefore, when fear is aroused it may be 
critical to present an extremely effective (or 
effective-appearing) recommendation to mini- 
mize the possibility that subjects will leave 
the communication setting still in need of 
reassurance and thus open to counter- 
persuasion. 

It has also been suggested (Weiss et al., 
1963) that fear will have opposite effects 
upon attitude change depending upon the 
subjects initial position. When subjects hold 
competing opinions, the increased drive level 
could be predicted to strengthen the incorrect 
responses more than the correct ones (e.g., 
Farber & Spence, 1953), In the present ex- 
periment, it is clear from the control-group 
data that subjects are initially favorable to- 
ward shots. It is possible, however, that sub- 
jects in the dental hygiene study (Janis & 
Feshbach, 1953, 1954) were negative to- 
ward some of the recommended practices and 
that fear strengthened the “incorrect” re- 
sponses. However, this argument loses some 
strength as smokers in the lung cancer studies 
also showed more acceptance of recommenda- 
tions with high levels of fear (Leventhal & 
Niles, 1964; Niles, 1964). Still, the actions 
recommended in the current setting are 
preventive, simple to take, and relatively 
painless. 

Of greater interest, however, is that specific 
plans for action influence behavior while level 
of fear does not. But specific information 
alone is insufficient as action is influenced 
only when specific information is combined 
with one of the fear-arousing communications. 
The group exposed only to specific informa- 
tion is generally quite similar to an unexposed 
control for reported emotions and very similar 
to the unexposed control’s attitudes concern- 
ing the importance of shots. Therefore, while 
emotional arousal is necessary for attitudinal 
and behavioral change, it seems to be suf- 
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ficient for the former and only necessary for 
the latter. Does this mean that behavior and 
attitudes are entirely independent of one 
another? In our first study on lung cancer 
(Leventhal & Niles, 1964) a very high cor- 
respondence was found between intentions to 
get X rays and actually having one taken. 
In addition, X-ray takers reported more fear 
than nontakers. In the present study, the 
comparisons of takers and nontakers revealed 
a similar effect, that is, the takers regarded 
shots as more important (though intentions 
were not stronger) than the nontakers, and 
the takers had higher scores on some of the 
fear indices. Neither of these experiments 
shows significant differences between fear 
treatments for the action measures. If one re- 
examines the setting, for the lung cancer 
study, it soon becomes apparent that subjects 
in all conditions were given a highly specific 
plan for taking X rays; that is, while de- 
livering the recommendation for X rays, “the 
experimenter pointed directly at the X-ray 
unit which was down the corridor from the 
‘theatre.’ The unit was clearly visible to all 
Ss... [Leventhal,& Niles, 1964, p. 462].” 
Therefore, the effects on action are extremely 
similar in both studies and both studies 
produced a relationship between attitude and 
actions, though the relationship is weaker in 
the present experiment. 

Although there is a positive relationship 
between attitudes and behavior, the present 
data show that the independent variables 
have different effects upon attitudes and 
actions. Specific information for taking action 
does not in itself produce favorable attitudes 
but does establish a link between attitude and 
action. What is the nature of this link that 
permits the attitude to be translated into 
action? In certain situations, for example, 
those where the action is immediately pos- 
sible, specificity may entail the elimination of 
barriers to action (Leventhal & Niles, 1964). 
However, in situations such as the present, 
where the actions were carried out several days 
subsequent to the communication, other as- 
pects of the manipulations, for example, re- 
hearsing the action, making a decision to act, 
as well as simple information on how to make 
the response, could be responsible for the 
link. An- examination of questionnaire effects 


> 


associated with the specificity manipulatica 


tentatively suggests that specificity altered 
the subject’s emotional state. Thus, subjects 


receiving the specific plan for action were ` 


somewhat more interested in the materials 


and reported significantly less nausea which; 


can be interpreted to mean that the specific 
information eliminated various inward-turning 
inhibitory features of the fear state. Several 
authors have distinguished between inhibitory 
or depressive fear states and excitatory fear 
states (Bull, 1962; Kollar, 1961; Shands, 
1955) and have associated striving and pro- 
tective activity with the latter. However, 
while it is clear that these affective states 
can be distinguished in communication studies 
(Leventhal et al., 1963 unpublished *) the 
study of their relationship to persuasion and 
action has just begun. The present data do 
suggest, however, that providing a clear pos- 
sibility or plan for action can reduce the 
inhibitory properties of certain fear states. 

Regardless of the exact process by which 
specific information links the evaluative and 
action components, it is still puzzling why 
more action did not occur in that condition 
where the attitude change was greatest. There 
is a very simple hypothesis that can be sug- 
gested to account for this. If the effects of 
fear dissipate rapidly with time, then it may 
be that the failure to find more action in the 
high-fear treatments reflects the fact that 
attitudes were measured at the time of 
exposure while action took place after the 
fear-induced attitude effects had been dis- 
sipated (Leventhal & Niles, 1965). If this is 
the case, no relationship between attitude 
change and behavior could possibly be ex- 
pected. 
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CIGARETTE SMOKING AND ALTERNATIVE METHODS 6 


OF REDUCING DISSONANCE 


LAWRENCE A. PERVIN anp RAYMOND J. YATKO 


Princeton University 


An experiment designed to investigate the various methods used by smokers 
to reduce cognitive dissonance and the relationships among these methods. 
100 undergraduates, 50 smokers and 50 nonsmokers, responded to a question- 
naire designed to investigate 6 possible methods of dissonance reduction. 
Smokers were found to reduce dissonance by minimizing the validity of some 
smoking-cancer findings and by minimizing the personal danger of smoking. 
Smokers did not appear to reduce dissonance by knowing less about the 
smoking-cancer relationship, by underestimating the general extent of the 
danger, or by minimizing the danger of smoking relative to other dangers in 
the world. Neither group showed behavioral interest in acquiring further 
relevant information. A low degree of association was generally found among 
the methods of dissonance reduction suggesting that smokers used alternative 


methods for reducing dissonance. 


Since the publication of Festinger’s (1957) 
theory of cognitive dissonance, a significant 
number of studies have demonstrated the 
relevance of the theory to a wide range of 
phenomena. The circumstances under which 
dissonance reduction will not take place and 
the factors determining the method used to 
reduce dissonance have been less well defined 
and inadequately investigated. 

Cigarette smokers have provided an inter- 
esting group of subjects for investigation of 
the effects of effort toward dissonance reduc- 
tion. When a cigarette smoking individual 
is confronted with the information that 
smoking is most probably harmful, dissonance 
is created between this cognition and the 
cognition that he continues to smoke. Steps 
to reduce dissonance could include termina- 
tion of smoking, avoidance of information 
supporting a smoking-cancer relationship, 
criticism of such information, or minimization 
of the importance of the issue. In an early 
study, Festinger (1957) found that the more 
people smoked the more they refused to ac- 
cept information which would have been dis- 
sonant with smoking. Feather (1962) repli- 
cated this finding but also found cigarette 
smokers to be more interested than non- 
smokers in articles concerning the smoking- 
cancer relationship and regular smokers to be 
more interested in information contrary to 
their held opinion than in information sup- 
porting their opinion. The subjects in this 


study were young university students. Feather 
argued that they were not committed to 
smoking and were interested in any relevant 
information, as is characteristic of pre- 
decision behavior. Thus, commitment was held 
to be an important factor in influencing where 
and how dissonance reduction took place. In 
a later study, Feather (1963) replicated these 
findings with older subjects who were* com- 


mitted to the smoking habit. He concluded 


that the seeking of relevant information also 


occurs after commitment and that dissonant — 
states seem to have more obvious effects on 


evaluation than on sensitivity. 

While smokers and nonsmokers have been 
found to differ in their evaluation of the 
smoking-cancer information, many people .ac- 
cept smoking as a major cause of lung cancer 
but continue to smoke. Horn (1963) found 
that 48% of all cigarette smokers accepted 
cigarette smoking as a cause of lung cancer 
and only 29% rejected it as a likely cause. It 
is possible that these people use other methods 
to reduce the dissonance. If this were true, 
it would suggest that people use alternative 
methods of reducing cognitive dissonance. 
Recently, Steiner and Rogers (1963) pre- 
sented evidence in support of the view that 
individuals do use alternative means of 
reducing dissonance. __ 

The present study attempts to investigate 
the influence of dissonant states upon the 
evaluation of various types of relevant in- 


‘formation, to see whether smokers reduce dis- 
sonance by means other than minimization 
‘of the smoking-cancer relationship and, 
finally, to see whether smokers use alterna- 
tive or supplementary methods of reducing 
. `e dissonance. 


APES METHOD 
Subjects 
Subjects. were 100 Princeton upperclassmen 
`- (juniors and seniors), 50 of whom were cigarette 
smokers and 50 of whom were nonsmokers. The 
latter group did not contain any pipe, cigar, or ex- 
cigarette smokers. Both groups were drawn from a 
larger population of 184 respondents to a question- 
naire, The smokers had been smoking cigarettes for 
fn average of 3.6 years and currently smoked an 
average of 18.0 cigarettes per day. 


Materials and Procedure 


A questionnaire was administered to assembled 
groups’ of students in the dormitories and clubs. 
The questionnaire. consisted of six sections. Each 
section was designed to investigate an alternative 
method of réducing dissonance. Section 1 attempted 
to get at differential exposure to and awareness of 
relevant information between smokers and non- 
smokers. The task was to correctly respond to 12 
trué-false statements concerning the effects of 
smoking, There were three items in each of four 
Statement categories: prosmoking, true (“Investi- 
. gators have not been able to produce any cancer 
‘in rodents by exposing them to tobacco smoke.”) ; 
i prosmoking, false (“While nicotine may have some 
harmful Seffects, it is also known to have some 
A rative. jowers.”) ; con-smoking, true (“Virtually all 
with lung cancer have admitted to 
}con-smoking, false (“The damage done 
fohalation of. cigarette, smoke is 


Section 2. measured ‘the subject’s evaluation’ of the 
/eridenge : ‘concerning the smoking-cancer relationship. 
“Te aa of eight statements which have been 
Te about the evidence concerning a smoking- 
„cancer relationship. Half of these statements empha- 
sized. the’ validity of the evidence (“It is hard to 
escape the conclusion that the statistical relationships 
: found reflect a direct ‘causal relation.”) and half 
minimized the validity of the evidence. Subjects 
indicated whether they strongly agreed, mildly agreed, 
or disagreed with each statement. 
ss While ‘cigarette smokers could be aware of the 
„information and generally believe in the evidence, 
t. they: could reduce dissonance by minimizing the 
ext of a relationship and thereby the perceived 
danger. Section 3 measured the extent to which 
relative to nonsmokers, would underesti- 
ize of the smoking-cancer danger. Eight 
from research studies were presented with 
tatistical findings left blank (“The lung 
times 
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higher among cigarette smokers than among non- 
smokers.”). Subjects estimated the exact findings. 

Another means of reducing dissonance is to believe 
that the facts are not personally relevant. Section 4 
required subjects to give estimates of: the probabil- 
ity of their being among those predisposed towards 
lung cancer, the number of cigarettes smoked per 
day that is truly dangerous, the minimum number of 
years within which a smoking individual predisposed 
towards cancer will get the disease, the time within 
which a cure for all cancer should come, the proba- 
bility that filters successfully screen out poisonous 
agents. 

Section 5 investigated the extent to which smokers 
emphasize the pleasures of smoking and minimize 
the negative side effects of smoking (kills the taste of 
food, “bad breath”). Because of the obvious dif- 
ference between the two groups in experience of the 
pleasures and negative effects of smoking, nonsmokers 
were asked to respond according to what they 
thought the pleasures and negative effects were for 
smokers, 

A final way to reduce dissonance is to minimize 
the danger of smoking relative to other dangers. For 
example, smokers can minimize their concern with 
death or emphasize other serious dangers in the 
world. Section 6 asked the subjects to rate their fear 
of death and the danger of air pollution, World War 
III, radiation, and death through accident. 

An additional item was included in the question- 
naire. This item indicated that a mimeo listing the 
facts reported on the smoking-cancer relationship was 
available at the Medical Office. This item was in- 
cluded to see whether smokers and nonsmokers 
would show different degrees of interest in acquir- 
ing such information, The last page of the question- 
naire inquired into the subject’s present and past 
smoking habits. This information was used in se- 
lecting the smoker and nonsmoker samples. 


RESULTS 


The results will be presented in relation to 
each section of the questionnaire. Factual ma- 
terial and correct responses were based on 
Hammond’s (1962) report on the effects of 
smoking. 


Section 1 


If smokers have avoided contact with rele- 
vant information or selectively recall such in- 
formation, they should have fewer correct 
responses on information items. Since 60% 
of the smoker responses were correct and 
59% of the nonsmoker responses were correct, 
this clearly was not the case. Nor were there 
any significant differences between the two 
groups in terms of accuracy within the four 
statement categories. If responses to this sec- 
tion were not determined by knowledge but 
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rather by opinion and attitude, then disso- 
nance theory would predict that smokers 
would tend to answer the prosmoking items as 
true and the con-smoking items as false, re- 
gardless of the nature of the accurate response. 
If such a response can be called a dissonance- 
reducing response, then with 12 items each 
group (n = 50) could give a maximum of 600 
dissonance-reducing responses. Smokers gave 
333 such responses while nonsmokers gave 329 
such responses, a clearly insignificant differ- 
ence. There was only one significant difference 
in individual items. Compared to smokers, 
nonsmokers more often felt that the damage 
done through the inhalation of cigarette smoke 
and tars is irreversible (x? = 3.24, p < .05). 
Considering direction of differences, 5 were in 
the predicted direction and 7 were in the op- 
posite direction. In sum, there were no signifi- 
cant differences between smokers and non- 
smokers in terms of overall accuracy or in 
terms of accuracy in relation to varying kinds 
of information and little evidence of a smoker 
tendency toward making dissonance-reducing 
responses. 


Section 2 


Dissonance theory would predict that com- 
pared to nonsmokers, smokers should more 
often agree with opinions questioning the 
validity of the smoking-cancer studies and 
disagree with opinions supporting the validity 
of such studies. Since the two groups did not 
differ in the strength with which they agreed 
or disagreed, the responses were scored as 
agree or disagree. The responses of smokers 
and nonsmokers to the group of items in each 
category are given in Table 1. Agreement with 
„views questioning the validity of the studies 
and disagreement with views supporting the 


TABLE 1 


NUMBER OF SMOKERS AND NONSMOKERS EXPRESSING 
AGREEMENT OR DISAGREEMENT WITH OPINIONS 
CONCERNING THE SMOKING-CANCER STUDIES 


Question validity 
of the studies 
(Items 3, 4, 6, 8) 


Support validity 
of the studies 
(Items 1, 2, 5, 7) 


Agree Disagree Agree Disagree 
Smokers 98 102 125 13 
Nonsmokers 92 108 150 50 


validity of the studies may be considered as 
possible dissonance-reducing responses. Of 400 
possible dissonance-reducing responses, smok- 
ers gave 173 and nonsmokers gave 142 (x? = 
4.5, p < .05). Table 1 indicates that the ma- 
jor source of variance was in relation to opin- 
ions supporting the validity of the relevant 
studies (x? = 6.05, p < .01). Concerning indi- 
vidual items, smokers significantly more often 
disagreed with the view that “There is no 
longer any doubt that smokers have a higher 
overall death rate than non-smokers” (x° = 
8.56, p < .01) and significantly more often 
agreed with the view that “The increase in 
lung cancer can be attributed to improve- 
ment in diagnosis” (x? = 3.16, p< .05). 
While such results support dissonance theory, 
it is important to note that 92% of the smok- 
ers agreed with the view that “It is hard to 
escape the conclusion that the statistical rela- 
tionships found reflect a direct causal rela- 
tion.” On three items the differences, though 
not significant, ran contrary to predictions 
from dissonance theory. In sum, smokers show 
a significantly greater tendency to question 
the validity of the relevant studies than do 
nonsmokers but are quite selective in the opin- 
ions they support and reflect. 


Section 3 


It was predicted that smokers would show 
dissonance reduction by minimizing differ- 
ences in death rates and frequency of lung 
cancer among heavy smokers, light smokers, 
and nonsmokers. There were no significant 
differences between the two groups in this 
respect. Both groups tended to minimize the 
differences considered. For example, one item 
read: “The lung cancer death rate has been 
found to be times higher among all 
cigarette smokers than among non-smokers.” 
The reported means were 3.4 for smokers and 


3.5 for nonsmokers while the correct answer 
is 10. 


Section 4 


It was predicted that smokers would reduce 
dissonance by believing that the facts are not 
personally relevant. The first item stated: “If 
there is a predisposition towards cancer, I feel 
that the probability of my being among the 
predisposed is: low. high___.,” Forty 
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TABLE 2 
SMOKER AND NoNsMOKER ESTIMATES OF THE DANGER DUE TO SMOKING 


Smokers Nonsmokers 
pse Mannov akae 
M Median M Median 
“ih Estimate of number of cigarettes smoked per day 
‘that is truly dangerous. 26.5 30 20.1 20 1.67* 
2. Estimate of minimum number of years an in- 
dividual predisposed towards cancer must smoke 24.5 15 22.8 10 -17 
to get cancer at some time. 
3, Estimate of number of years within which a cure 
for all cancer should come. 22.6 20 33.1 20 .67 


*p <.05. 


nonsmokers and 27 smokers reported a low 
probability of predisposition (x? = 6.48, p < 
.01). This finding runs contrary to dissonance 
theory. This is perhaps due to the likelihood 
that smokers interpreted “predisposition 
towards cancer” as following from smoking 


, whereas it was intended to imply a constitu- 


tional predisposition regardless of smoking 
behavior. Concerning the probability that 


` filters successfully screen out poisonous agents, 


42 smokers and 41 nonsmokers said that the 


“probability was low. Within the smoker cate- 


gory there was no difference between smokers 
of filter cigarettes and smokers of nonfiltered 
cigarettes. The data for the remaining three 
items in this section are presented in Table 2. 
Smokers gave significantly lower estimates of 
the number of cigarettes smoked that is 
dangerous. The mean and median for smokers 
are well above the mean of 18.0 cigarettes ac- 
tually smoked per day. Smokers were divided 
intq those who smoked less than half a pack a 


day (medium), and those who smoked more 
than a pack a day (heavy). The median re- 
sponse to this item was 30 for all groups. 
However, the means were as follows: 
heavy = 28.7, medium = 30.0, light = 22.0. 
Compared to light smokers, heavy and medium 
smokers tended to report a higher estimate. 
While there was not a significant difference 
between the two groups on the other items, 
the differences were in the predicted direction. 
Compared to nonsmokers, smokers tend to 
feel that one has to smoke for a longer time 
to be in danger and that a cure for all cancer 
will come sooner. Furthermore, the smokers 
mean for a cure for cancer (Item 2) is lower 
than that for the dangerous number of years 
(Item 3), while the reverse is true for non- 
smokers, Of the 50 smokers, 25 gave an esti- 
mate for Item 3 that was lower than that for 
Item 2, 6 gave an estimate equal to that for 
Item 2, and 18 gave an estimate above that 
for Item 2. The corresponding numbers for 


day (light), those who smoked up toa pack a nonsmokers are 16, 11, and 23 (x? = 4.46, 
TABLE 3 
SMOKER AND NoNsMOKER RATINGS OF POSSIBLE Discomrorts ACCOMPANYING SMOKING 
Serious effect Mild effect No effect 
Item bad 
Smoker Nonsmoker Smoker Nonsmoker Smoker Nonsmoker 

» Smoking kills the tast: 

of food. = a“ 5 13 27 29 18 6 11.45** 
. Smoking causes bad 

breath j 15 30 30 18 5 1 10.62** 
. Cigarettes are bulky to 

carry. 4 0 3 5 10 45 36 4.64 
. Cigarette smoking is exe 

pensive. s 19 31 24 16 i 2 6.75* 
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p= .10, one-tailed test). In sum, there is 
evidence to support the view that many of 
these smokers reduce dissonance by believing 
that their tobacco consumption is below a 
dangerous level and that a cure for cancer 
will come before they are in danger for having 
smoked too many years. 


Section 5 


Rating the pleasure of smoking (for smok- 
ers) as either great or little, 33 smokers and 
27 nonsmokers responded great. This differ- 
ence was not significant. Data for ratings of 
some of the side effects of smoking are given 
in Table 3. Compared to nonsmokers, smokers 
consistently minimize possible discomforts 
accompanying smoking. 


Section 6 


It was predicted that smokers would mini- 
mize the importance of the smoking-cancer 
relationship by emphasizing other dangers in 
the world. This hypothesis was not confirmed. 
There were no significant differences between 
the two groups or evidence of a trend in the 
predicted direction. 

Only one person (a smoker) came to the 
Medical Office requesting the mimeo on the 
smoking-cancer relationship. Thus, this item 
could not be used as a behavioral test of dif- 
ferential interest in relevant information. 

To investigate the relationship among dis- 
sonance-reducing methods, seven items on 
which smokers and nonsmokers differed were 
selected. These items were the one significant 
difference in Section 1 (Variable 1), the two 
significant differences plus a third item from 
Section 2 (Variables 2, 3, 4), Items 1 and 


TABLE 4 


CHI-SQUARE VALUES FOR RELATIONSHIPS AMONG 
RESPONSES ON DISSONANCE-REDUCING ITEMS 


(SMOKERS) 
oee cc T NE 

1 01 03 | .O1 | .02 | 1.76 59 
2 57 | .01 | 9.50*| .02 | 10.73* 
3 52 | 1.13 48 | 1.99 
4 3.24 42 54 
5 .42 .12 
6 4.52 


*p <.01. 
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2 from Table 2 (Variables 5 and 6), and Item 
1 in Table 3 (Variable 7). The smokers were 
then divided into dissonance reducers and 
nondissonance reducers on each item. A me- 
dian split was used for the items taken from 
Table 2. The question asked was whether 
smokers who gave dissonant-reducing re- 
sponses on one item also gave such responses 
on other items. Table 4 reports the results of 
chi-square tests of the interrelationships 
among the seven response variables. Of the 21 
relationships tested only 2 were significant, 
suggesting the use of alternative methods for 
reducing dissonance. 


DISCUSSION 


The data indicate that smokers and non- 
smokers differed on a number of ways of re- 
ducing dissonance. Moreover, some character- 
istics of the samples which may have been 
responsible for the limited number of differ- 
ences were: the nonsmokers may have in- 


cluded a number of potential smokers, the . 


smokers included some potential nonsmokers, 
the smokers have not been smoking for very 
long and may not be very committed. Feather 
(1962) faced similar possible limitations in his 
first study, but replicated his results in a 


second study (Feather, 1963) and concluded `+ 


that young smokers and smokers committed to 
the habit for some time behave in the same 
way. There is reason to believe, therefore, that 
the interpretations of these data are applica- 
ble beyond the population investigated. 

While this discussion is based upon the as- 
sumption that the cigarette smokers were re- 
ducing dissonance, it is also possible that the 
nonsmokers were reducing dissonance. For 
the nonsmoker, the cognition that smoking 
can be pleasurable and a sign of sophistication 
may be dissonant with the cognition that he 
does not smoke. Therefore, he may tend to 
emphasize the harmful effects of smoking. 
This, however, should have led to greater dif- 
ferences between the two groups. 

A number of comparisons between these 
results and those reported by Feather (1963) 
are possible. Since smokers and nonsmokers 
did not differ on awareness of information or 
interest in obtaining information but did differ 
on the evaluation of relevant information, 
these results may be taken to support Feath- 


ser’s view that dissonant states have more ob- 
vious effects on evaluation than on sensitivity. 
While the subjects in Feather’s study indi- 


z. cated interest in the smoking-cancer articles, 


the subjects in this study indicated little be- 
havioral interest in obtaining such informa- 
tion. This was in spite of the fact that the 
mifneo described was available within 10 min- 


‘utes walking distance of all campus class- 


rooms and dormitories. The data in this study 
suggested that beyond questioning the valid- 
ity of smoking-cancer studies, smokers reduce 
dissonance by minimizing the personal rele- 
vance of the danger..This is done by under- 
estimating the dangerous level of cigarette 


“consumption and by believing that a cure for 


cancer will come before the danger to them 
from smoking is greatest. These data support 
Feather’s (1963) suggestion that “some regu- 
lar smokers may reduce cognitive dissonance 
by identifying themselves as light smokers 
only [p. 161].” These data also support the 


-view of Taylor (1962) that in spite of the 


fact that the American public has in general 
accepted the evidence, the individual smoker 


. “does not relate the hazard to himself, either 


because the extent of the risk is not compre- 
„hended or the day of reckoning seems too re- 
mote [p. 777].” 

The significant differences were few in num- 
ber and the responses of smokers on items 
showing evidence of dissonance reduction 
tended to be uncorrelated. The subjects ap- 
peared to be selective in the methods they 
used to reduce dissonance. Even within the 
More specific area of evaluating the relevant 
information, smokers were selective in their 
responses. Thus, smokers showed little tend- 
ency to question a direct causal relationship 
between smoking and cancer but did question 
the meaning of such relationships for indi- 
vidual and group death rates. This raises the 
question of the factors which determine how 


_ dissonance will be reduced. One factor, the 
ease with which a cognition can be fitted into 
; the existing structure, has been suggested in 


the literature (Feather, 1963; Katz & Stot- 


land, 1959;. Krech & Crutchfield, 1948). Re- 
| lated to this is the ease with which one may 
} hold a cognition which differs from the cogni- 
_tion held by others. For example, it is easier 


to hold a cognition based on opinion which is 
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different from the cognitions held by others 
than it is to hold a cognition related to fact 
which is different from the cognitions held by 
others. Dissonance reduction is easier in an 
area filled with uncertainty than in one’ with 
clear cognitive alternatives which increase 
rather than reduce dissonance. In this study, 
smokers and nonsmokers did not disagree so 
much on the evidence as they did on their 
interpretations of the evidence. In the area of 
fact, the push is toward agreement; in the 
area of opinion, it becomes more acceptable 
to hold cognitions which are not shared by 
others. In some cases the former kind of situ- 
ational constraint may force the individual to 
tolerate dissonance, Thus, were refutation of 
fact the only way for cigarette smokers to 
reduce dissonance, many would be forced to 
give up the habit or tolerate the dissonance. 
As Zajonc (1960) has noted, an individual 
will tolerate dissonance where the methods for 
reducing dissonance or their consequences are 
unsatisfactory. On the other hand, where the 
need for reducing dissonance is great the indi- 
vidual may not only hold cognitions which are 
contrary to the opinions of others but also 
cognitions which are contrary to what is gen- 
erally accepted as fact, as in psychopathology. 

Another factor influencing how dissonance 
will be reduced is the valence of alternative 
cognitions. Certain cognitions may carry more 
weight in reducing dissonance than do others. 
The more fundamental a particular cognition 
is to the reduction of dissonance, the more 
likely it is to be incorporated into the system. 
For example, it is more important to believe 
that smoking does not increase the overall 
death rate or one’s chance of early death than 
it is to believe that smoking is unrelated to 
cancer. Therefore, the former cognition has a 
greater valence and is a more likely choice for 
reducing dissonance. 

The above factors can be viewed as situa- 
tional and as exercising some constraint upon 
a personality preference for one or another 
method of dissonance reduction. Where situ- 
ational factors do not place constraints upon 
the choice of method, personality factors will 
determine the method used for reducing dis. 
sonance. Since situational and personal fac- 
tors likely interact to determine the use of 
alternative methods of reducing dissonance, 
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two different studies may yield contradictory 
results because they permit different responses 
or because one permits the use of several re- 
sponses other than the ones which are meas- 
ured, while the other study does not. 

The data indicating the use of alternative 
methods of dissonance reduction and the in- 
terpretation of these data in terms of the 
interaction between personality and situational 
factors correspond to the findings and inter- 
pretations of Steiner and Rogers (1963). They 
suggest that individual responses to disso- 
nance are governed by “habit family hier- 
archies” which are influenced by situational 
factors. The question of personality-related 
preferences for dissonance reduction can be 
studied by comparing individuals across situ- 
ations or by seeing whether subjects with simi- 
lar personality characteristics emphasize the 
same preferences in different situations. For 
example, one might find a relationship be- 
tween the use of a particular method of dis- 
sonance reduction and use of a particular 
method of defense. While such research is 
necessary, there is the task of finding a variety 
of experimental situations in which there are 
comparable alternatives for dissonance reduc- 
tion and nonselective situational constraints. 
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. COMPARISON OF INTERVIEWS WITH QUESTIONNAIRES FOR 
MEASURING MOTHERS’ ATTITUDES TOWARD 
SEX AND AGGRESSION* 


ROBERT R. SEARS 
Stanford University 


40 mothers of preschool children were measured by 2 methods on 5 attitudinal 
variables relating to permissiveness and punitiveness about sex and aggression. 
One was a semistructured interview coded independently by 2 coders; the 
other was a set of objective mother attitude scales. Coder reliability of the 
former and odd-even reliability of the latter were equivalent and satisfactory. 
Correlations between the 2 methods, for individual variables, ranged from .22 
to .67 for 21 boys’ mothers and from .53 to .63 for 19 girls’ mothers. The 


interview measures generated more statistically significant correlations with 
M independently obtained measures of children’s behavior, but the attitude scales 


correlated better with the mothers’ observed behavior, and proved more 
effective for replicating group differences discovered in previous studies, 


No method of measuring mothers’ child- 
rearing behavior and attitudes has escaped 
criticism on one ground or another, though 
each seems to have some special advantages, 
too, which makes it suitable for use with 
certain kinds of problems. The present report 
provides a comparative evaluation of two 
methods—coded interviews and paper-and- 
pencil attitude scales—which appear to be 
useful for different purposes. 

The retrospective mother interview, so ap- 
pealing for its relative ease of administration 


`. and its economy (compared with longitudinal 


methods) in covering many topics in a brief 
time, has been demonstrated to have very 
poor validity with respect to factual historical 
matters such as age of weaning or extent of 
use of scheduled feeding when a concurrent 


‘interview is used as the criterion (Robbins, 


1963). As a measure of current attitudes, 
however, it has whatever face validity may 
be attributed to the expression of feeling 
under the particular conditions of a given 
interview. While students of psychodynamics 
who focus primarily on unconscious processes 
may consider a 1-3 hour interview superficial 
with respect to the uncovering of deeper at- 
titudes of ambivalence, behaviorally oriented 
researchers assume that what the mother says 


1 Acknowledgement is made for financial support 
by a Ford Foundation grant to Stanford University 
for research on mental health, and to Grant M-5398 
from the United States Public Health Service. 
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reflects what she is feeling, though not neces- 
sarily what she has done. In a statistical 
sense, doubtless unconscious ambivalence and 
openly expressed feeling both contribute to 
the variance of measures of such child- 
influencing variables as permissiveness and 
punitiveness. What proportion of the total 
variance each contributes is a matter for 
empirical determination, not argument. 

Even for measurement of current attitudes, 
however, the semistructured mother interview, 
as often used, has certain limitations for any 
research that is designed to utilize dozens or 
hundreds of mothers as subjects. It requires 
an inconvenient amount of time for both 
mother and interviewer, it is expensive to 
transcribe and code, and it provides adequate 
reliability of coding only after arduous train- 
ing of the coders. Furthermore, the final 
quantitative ratings on attitudinal variables, 
from interviews, are almost impossible to 
compare from one study to another (e.g., rela- 
tive permissiveness for aggression toward par- 
ents in two samples of mothers studied by 
different investigators with different coders). 

By contrast, paper-and-pencil attitude scales 
are economical in both administration and 
scoring, a factor of particular importance if 
a research design calls for the selection of 
extreme attitudinal groups from a large gen- 
eral population. Also, they do permit compari- 
son between groups, regardless of experi- 
menter, and thus have an obvious value for 
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cross-national or other group comparisons (cf. 
Lynn & Gordon, 1962). Recent research on 
the effects of response set in various types of 
paper-and-pencil scales, however, raises the 
question as to whether a given attitudinal 
variable can be measured with the same reli- 
ability and validity by an objectively scorable 
attitude scale, as by ratings based on the 
freer, though more cumbersome, interview. 
The present study compares the two methods 
when both were used for measuring the same 
five variables in a group of 40 mothers of 
preschool children. 


METHOD 


In connection with a study ° of the child-rearing 
correlates of identification in 4-year-old children 
(Sears et al., in press), 40 mothers were interviewed 
(by Lucy Rau) with a semistructured interview 
similar to that used for the study reported in Pat- 
terns of Child Rearing (Sears, Maccoby, & Levin, 
1957). From 2 to 7 weeks later, each mother filled 
out a set of five attitude scales designed to meas- 
ure five of the same attitudinal variables meas- 
ured from the interview. In addition, each mother 
and her child were observed together in two half- 
hour mother-child interaction situations, and a 
measure of one of the five variables was obtained 
from that procedure. 

The study, performed during the summer of 1958, 
was focused on the behavior of that child of the 
mother’s who was then registered in the Stanford 
Village Nursery School. The 40 children ranged from 
4-2 to 5-7; 19 were girls and 21 were boys. While 
all but 5 of the children had one or more siblings, 
the interview was directed quite specifically toward 
the mother’s feelings, attitudes, and practices as these 
related to the one child under study. None of the 
children had any significant behavior problems, and 
none of the mothers appeared to have labeled her 
child as having such. We are inclined to believe that 
the conditions for obtaining reliable and valid in- 
formation by the mother-interview method were 
about as satisfactory as one could wish. The attitude 
scales, coming at the end of the summer’s research, 
in which the mothers were very much involved, were 
probably approached by the mothers as having 
special relevance to the research-group child, but the 
items in the scales did not require such a focus to 
the same degree as did the interview. 


Mother Interviews 


The interview required two sessions of 1-1.5 hours 
each, but the material on sex and aggression was 


?The present data are taken from a study in- 
volving the collaboration of a dozen members of our 
Laboratory; theit contributions are noted with 
appreciation in Sears, Rau, and Alpert (in press). At 
an earlier stage, Ralph Haber assisted in preparing 
the mother attitude scales. 
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all covered during the first session. The interviews 
were recorded, transcribed, and then were coded by 
two well-trained coders, working independently. The 
sections of the interview dealing with attitudes 
toward the control of sex and aggression were suf- ` 
ficiently like those of the Sears et al. (1957) study 
that the same scales for measurement of the follow- 
ing five variables were used in coding; the numbers 
in parentheses refer to the variables listed in Sears 
et al. (pp. 505-507): 


1. Permissiveness for indoor nudity (I-53) 
2. Permissiveness for masturbation (II-56) 
3. Permissiveness for social sex play (II-58) 
4: Permissiveness for aggression toward parents 
(III-30) g 

5. Punishment for 
(III-31) 

These five variables were chosen for this compara 
tive study for the following reasons. First, they are 
predominantly attitudinal rather than behaviorally” 
factual; they are of the “I feel” rather than “what 
I do” type. Hence they are probably less susceptible 
to direct and intentional distortion or retroactive 
falsification. Second, the expressions of feelings in the 
interview are commonly couched in verbal terms that 
are very similar to those of “agree-disagree” items 
that are appropriate to objective attitude scales. 
Third, because they require more interpretation and 
more subtle evaluation by the coder, these variables 
are both difficult and expensive to measure by inter- 
view, and if an easy and inexpensive method can 
be found that replicates the results of an interview, 
there is considerable to be gained for future research. 


Mother Attitude Scales 


The five paper-and-pencil attitude scales for meas- 
uring the above variables were constructed in the 
form of sets of declaratory sentences, each sentence 
being prefaced by five boxes labeled, from left to 
right, strongly agree, agree, not sure, disagree, and 
strongly disagree. The instructions on a cover sheet 
indicated that these statements had all been made | 
by mothers of 5-year-old children in the couise of 
interviews, that the statements referred mainly to 
feelings about sex and aggression, and that the 
present effort was to secure information about how 4 
mothers felt concerning these matters. 

The scales were constructed from actual statements 
culled from the 379 original interviews (Sears et al, 
1957). These interview typescripts yielded 167 non- 
duplicative single or double sentence quotations con- 
cerning feelings relevant to the five variables. The 
sentences, edited somewhat for grammar and sim- 
plification, were published in a lithographed booklet. 7 
Items on a given topic (e.g., nudity permissiveness) 


aggression toward parents” 


8 Keyed copies of the five mother attitude scales 
have been deposited with the American Documenta- 
tion Institute. Order Document No. 8381 from 
ADI Auxiliary Publications Project, Photoduplication 
Service, Library of Congress, Washington, D. C. 
20540. Remit in advance $1.25 for microfilm or $1.25 
for photocopies and make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 
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were grouped together under the following five head- 
ings: mothers’ statements about modesty, mothers’ 
statements about sex play with self, mothers’ state- 
ments about sex play among children, mothers’ state- 
ments about angry behavior (punitiveness items), 
and mothers’ statements about emotional self-con- 
trol (permissiveness items). 

For purposes of item validation, a priori scoring 
‘was established for all items. Depending on the 
content, either the agree or disagree end of the five- 
box scale in front of a given sentence was specified 
as the “high” end for that: item. For example, for 


' the statement “I would never undress either my 


son or my daughter if the other were in the same 
room,” the strongly disagree box was scored “high,” 
because disagreement represents positive permissive- 
ness about nudity, to which scale this item obviously 
-belongs. The high box, if checked, was given a score 


“tof “5,” and the other boxes were scored in order 


from “4” down to “1.” The total score for a given 
scale was simply the sum of the scores obtained on 
the individual items belonging to that scale. 

Completed booklets were obtained from a highly 
heterogeneous sample of 135 San Francisco women. 
The age range was 18-65; education was from 
grade-school-incomplete to PhD; religion included 
Protestants, Catholics, and Jews; marital status in- 
cluded unmarried, married, widowed, and divorced. 
Each of the five scales was scored and treated sepa- 
rately, of course. Individual item distributions were 
examined for the top and bottom 27% of the sample. 
Items were discarded if the frequency of the com- 
bined 4 and 5 scores did not differentiate from the 
remainder of the sample at the .05 level, or if there 
were a nondiscriminating imbalance in the distribu- 
tion (e.g, 95% of the whole group marked 4 or 5), 
or if marginal comments indicated frequent incom- 
prehension or irritation. There were 79 items remain- 
ing after this weed-out; the exact number remain- 
ing for each of the five scales is shown in Table 1. 
The numbers of items having agree and disagree as 
“high” are also shown. 

These items formed the final set of scales used 
for the present comparative study. Examples of two 
items from each of the five scales are given here to 
indicate their flavor; the high direction of scoring 


is given in parentheses, 


Permissiveness for nudity: 
(Disagree) I just tell our daughter that neither 
her father nor I take our clothes off so she mustn’t 
take hers off either. 
(Agree) I feel it’s all to the good for my girl 
to see her brother undressed now and then, for 
„when she’s older and learns about sex. 


Permissiveness for masturbation: 


(Disagree) I think if you allow a child to play 
with himself, it may lead to worse habits. 

(Agree) I think if you just leave them alone they 
will outgrow it. 


Permissiveness for social sex play: 


(Disagree) If I ever hear any of that giggling and 
whispering when the kids are playing in the garage, 
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I step right in and put an end to it. 

(Agree) I can’t see the harm of a little sex play, 
if it doesn’t happen too often. After all, they don’t 
understand what they’re doing other than just 
playing. j 


Permissiveness for aggression toward parents: 


(Disagree) There is enough meanness in the world 
already, and I want my children to grow up to be 
always kind and respectful to others. 

(Agree) If Janey never sassed back, I’d think there 
was something wrong with her. 


Punishment for aggression toward parents: 


(Disagree) Spanking kids just makes them madder 
than ever and doesn’t accomplish a thing. 

(Agree) Most of all I want the children to have 
good manners, and they’ll never learn them unless 
they’re punished for naughty behavior. 


Mother-Child Interaction 


Immediately preceding each of the two interview 
sessions, the mother and child spent a half hour 
together in a room equipped with inconspicuous 
recording equipment and a one-way mirror. These 
interaction situations were somewhat structured by 
the mother on the basis of instructions given her 
in advance. On the first occasion she spent the initial 
15 minutes filling out a questionnaire while her child 
played with some rather uninteresting toys. During 
the second 15 minutes she showed the child how 
to use a telephone in a booth, and instructed him 
to make several telephone calls that required role 
playing. On the second occasion, the first 15 minutes 
were used for the child to solve some difficult per- 
formance tests, and the second 15 minutes were 
spent on a fishing game that involved messy play 
with a large and very full tub of water, 

The hour of interaction was recorded on tape, and 
an observer in the mirrored booth simultaneously 
dictated a running account of the behavior. This 
protocol material was then used for making ratings 
on a number of scales characterizing the behavior 
of both mother and child, One scale was mother’s 
punishment of aggression. This was a 4-point scale 
ranging from “no disapproval or restriction of ag- 
gression” to “strong disapproval (irritation, ridicule, 
strong restriction, for example, directs child to stop 
his behavior) when child is aggressive.” A single 
rating was made on the protocols for all four situa- 
tions together. Two independent raters obtained 94% 
agreement on their ratings of 10 interviews. A full 
description of the mother-child interaction procedure, 
and the scoring, has been prepared by Sears et al. 
(in press). 

The interaction situation did not lend itself to 
measurement of the three sex permissiveness vari- 
ables, and initial efforts to construct different scales 
for permissiveness and punishment for aggression 
were not successful. While the retained scale is called 
punishment, it contains some elements of permissive- 
ness as well, and can be used as a validation measure 
against which both the permissiveness and punish- 
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ment measures obtained from the interviews and the 
paper-and-pencil attitude scales can be evaluated. 


RESULTS 
Reliability 

There are obvious differences between the 
operations that can be defined as “reliability 
measurements” for two such different methods 
as coded interviews and summed scores of 
attitude scale items. No really exact compari- 
son can be made. Each method, however, pro- 
vides one point in the measurement process 
at which parallel but procedurally independent 
values are obtained. With the recorded inter- 
views, there is a typescript that provides a 
common source of information for two coders 
who might well be considered as two “meas- 
uring instruments.” With the attitude scales, 
there are two sets of items—the “odd” and 
“even” ones in sequence—which give two 
separate measures. In both instances, the 
summing of the pair of procedurally inde- 
pendent measures may be expected to increase 
the reliability of the total score to the extent 
provided by the Spearman-Brown formula. 

In Table 1, these two sets of “reliability co- 
efficients” are shown for the present group 
of 40 mothers, together with comparable cor- 
relational measures of rater reliability on the 
same scales in the earlier study (Sears et al., 
1957). 

So far as the interviews are concerned, the 
reliability of coding is partly dependent on 
the completeness of data provided by the 
mother, of course. The columns labeled N in 
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Table 1 indicate the number of mothers’ 
reports that could be evaluated. In the Sears 
et al. (1957) study there were 379 mothers 
who contributed interviews, but despite the 
probing of the interviewers, there were 42 
mothers (11%) who contributed too little 
information about their punishment of aggres- 
sion to be rated on this variable. Even on the 
subject of masturbation, on which the fullest 
information was given, 12 of the 379 mothers 
gave reports too meager to be rated. A similar 
incompleteness appears in the interview data 
from the Stanford study. The Ns show that 
the typescripts of from one to five mothers 
could not be coded on one or another of the. 
five scales. In contrast, all 40 mothers com- 
pleted the attitude scales, and perhaps it 
should be noted that these scales were pre- 
sented at the very end of the rather arduous 
summer, when several of the mothers were 
packing for vacation and all were looking to 
the relief of no further obligation to the 
research project. 

The coefficients in Table 1 show that the 
interview coding was somewhat more reliable 
in the present study than in the Sears et al. 
(1957) study, particularly with respect to the 
two aggression scales. The reason for this 
probably lies in the fact that a single pair of 
coders did all the 40 Stanford cases, while 
eight different coders, working in pairs, did 
the same task for the Sears et al. study. In 
any case, by the comparison being made here 
with the interview coding, the attitude scales 
had about equal or better reliability for the 


TABLE 1 


RELIABILITY COEFFICIENTS (r) FOR CODING OF MOTHER INTERVIEWS AND FOR Opp VERSUS 
Even Item Scores ON MOTHER ÅTTITUDE SCALES 


Interview coding 
Mothe aende Sanba S as in 
les mother attitude scales 
AR Sears et al. (1957) Present study 
Spear- Spear- Spear- 
N» | Raw Hee Ns | Raw ae N: | Raw oss Total | Agree | Disagree 

Nudity permissiveness 356 | 84 | 91 | 35 | 92 | 96 | 40 | 85 | 92 
Masturbation permissiveness | 367 | 72 | 84 | 38 | 73 | 84 | 40 | 85 | 92 i5 3 3 
Sex play permissiveness 353 | 68 | 81 | 35 | 71 | 83 | 40 | 88 | 94 17 6 11 
Aggression permissiveness 359 | 71 | 83 | 39 | 85 | 92 | 40 | 76 86°] 20 7 13 
Punishment for aggression 337 | 62 | 77 | 37 | 92 | 96 | 40 | 76 86 15 10 5 


Note.—Decimal points for z's are omitted. 
a See text for explanation of Ns. 
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TABLE 2 
INTERCORRELATIONS AMONG MOTHER INTERVIEW SCALES, SEPARATELY By SEX OF CHILD 


Code number and maternal variable 


Interview scale 


23 30 32 58 60 
23. Nudity permissiveness 63 54 59 31 
30; Masturbation permissiveness 51 27 50 —41 
32. Sex play permissiveness 57 47 40 —41 
58. Aggression permissiveness 12 17 54 —66 
60. Punishment for aggression —22 =32 —56 —66 


Note.—Boys above the diagonal; girls below. 


three sex permissiveness scales, but not quite 
as good reliability for the two aggression 
Scales. 

For neither method were any data of a test- 
retest character obtained that would provide 
information on the consistency, over time, of 
the mothers themselves. 


Interscale Relations within Methods 


In Tables 2 and 3 are shown the intercor- 
relations among the five dimensions as meas- 
ured by each of the two methods, separately 
for mothers of boys and girls. In general, the 
intercorrelations among the attitude scales are 

„larger (medians: boys’ mothers = .65, girls’ 
mothers = .76) than those among the mother 
interview measures (medians: boys’ mothers 
= 48, girls’ mothers = .49). 


Relations between Methods 


In Table 4 are shown, separately by sex 
of child, the correlations between all five di- 
mensions measured by both interview and 
attitude scales. The five pairs of figures in 
the diagonal, from upper left to lower right, 
are the self-correlations for each dimension, 


the basic data for evaluating the equivalence 
of the two methods. 

Examination of the comparable coefficients 
for the mothers of boys and of girls reveals 
a marked discrepancy on two of the variables, 
sex play permissiveness and punishment for 
aggression toward parents. The mothers of 
boys have r’s of .27 and .22, respectively, 
while the girls’ mothers have 7’s of .63 and 
.59, similar in size to the r’s for the other 
three scales, The only other difference between 
the sex-of-child groups can be summarized by 
the statement that for nudity permissiveness, 
the attitude scale measure gave substantially 
higher correlations with the other four scales 
for mothers of boys while the interview meas- 
ure gave higher ones for girls’ mothers. The 
implications of this difference are not obvious. 


Comparison with Behavior in Mother-Child 
Interaction 


In Table 5 are shown the correlations be- 
tween the two aggression measures obtained 
by each of the two methods and the rating 
of punishment for aggression obtained from 
the mother-child interaction situation. The 


TABLE 3 
INTERCORRELATIONS AMONG MOTHER ATTITUDE SCALES, SEPARATELY BY SEX OF CHILD 


Attitude scale 
Code number and maternal variable 
165 166 167 169 168 
165. Nudity permissiveness 69 65 66 —47 
166. Masturbation permissiveness 76 87 65 —38 
167. Sex play permissiveness 82 76 68 —65 
169. Aggression permissiveness 61 73 73 =i 
168. Punishment for aggression —61 —76 —76 176 


Note.—Boys above the diagonal; girls below. 
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TABLE 4 
CORRELATIONS BETWEEN INTERVIEW AND ATTITUDE SCALE MEASURES, SEPARATELY BY SEX OF CHILD 
Mother attitude scales 
165. 166. 167. a 169. a i 
i m ‘unishmen: 
pacar eae Sane epee pamela Boa Pressly for aguression 
Gua late SESTE G BMI G B 
23. Nudity permissiveness 63 56 32 72 68 80 23 56 | —29 | —50 
30. Mastirbation permissiveness 43 53 56 66 52 58 18 37 | —41 | —15 
32. Sex play permissiveness 61 12 30 22 63 27 23 21 | —32 | —20 
58. Aggression permissiveness 64 39 44 56 44 61 53 67 | —53 | —47 
60. Punishment for aggression —53 | —17 | —34 | —26 | —39 | —29 | —40 | —55 59 22 
Note.—G = girls, B = boys. 
behavior from which the latter measure was importance to the stability of the mother 


secured occurred during the half hours im- 
mediately preceding the two interview ses- 
sions; hence half the contribution to the 
mother-child interaction measure was from 
the period adjacent in time to the interview 
session in which aggression was discussed. 
Despite this greater temporal closeness for 
the interview measures, the two attitude scales 
correlated more highly with the observed be- 
havior than did the interview measures. Three 
of the four interview measures showed no re- 
lationship whatever to the observed behavior 
and the fourth is moderate (in the correct 
direction) but nonsignificant. The attitude 
scales for girls’ mothers are significantly cor- 
related, while those for boys’ mothers are 
nonsignificant but in the right direction. 
From these data one may conclude that the 
attitude scales provided a better estimate of 
the mothers’ overt behavior than did the 
interview scales, but there is one cautionary 
point to be made. Two half-hour sessions pro- 
vide a relatively small sample of behavior 
for both mother and child. Possibly the 
meagerness of the child sample is of even more 


measure than is the meagerness of her own 
behavior sample, because her punitiveness 
could be estimated only when the child was 
aggressive; if he provided an unaccustomedly 
small stimulus frequency, his mother’s respon- 
siveness could be improperly assigned a low 
rating.* To what extent such a source of error 
entered into this set of measures is impossible 
to tell with the data available. 


Other Indirect Indications of Validity 


Since neither of the two methods being 
compared can be considered to provide a cri- 
terion measure of any of the five attitudinal 
variables, the concept of validity is perhaps 
inappropriate. One can ask, however, how 
useful each method is for revealing relativn- 
ships either that have been discovered else- 
where, or that imply antecedent-consequent 
connections between maternal attitudes and 
child behavior. 


*The intersession consistency could not be calcu- 
lated because several children did not express ag- 
gression in one or the other half-hour session, and 
hence the sessions could not be rated separately. 


TABLE 5 
CORRELATIONS OF INTERVIEW AND ATTITUDE SCALE MEASURES OF AGGRESSION CONTROL WITH 


Moruer’s PUNISHMENT OF AGGRESSION IN MOTHER-CHILD INTE: 


'RACTION 
Interviews Attitude scales 
Scale 
Code number Girls Boys Code number Girls Boys 
Permissiveness 58 —05 27 169 —51 —29 
Punitiveness 60 —04 —03 168 44 18 
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TABLE 6 


MEANS AND STANDARD DEVIATIONS OF BRITISH MIDDLE-CLASS AND WORKING-CLASS MOTHERS AND OF STANFORD 
Nursery SCHOOL MOTHERS ON THE MOTHER ATTITUDE SCALES 


Sample 
British Stanford 
Scale 
Middle class Working class Total Total 
(N = 43) (N = 51) (N =94) (N = 40) 
M SD M SD M SD M SD 
165. Nudity 48.56 | ~7.46 | 40.88 7.99 44.55 8.66 54.65 1.07 
166. Masturbation 47.28 8.22 
167. Sex play 60.10 9.47 
169. Aggression permissiveness 53.13 6.59 49.61 8.87 51.30 8.00 60.13 10.57 
168. Aggression punitiveness 42.44 8.63 46.67 6.98 44.65 8.07 41.90 7.31 


Note.—British data from Lynn and Gordon (1962). 


Sex-of-child differences. In the Sears et al. 
(1957) study, a significant difference was 
found in the frequency of high permissiveness 
for expression of aggression toward parents 
reported by the mothers of the two sexes of 
children, boys being given greater latitude. 
There was also a nonsignificantly greater 
permissiveness for masturbation by boys. The 
other three variables showed no sex-of-child 


differences. 


In the present data, the interview measures 
reveal no mean difference between the mothers 
of boys and those of girls on any of the five 
scales. The attitude scale measures, however, 
show a slightly greater permissiveness for 
aggression (p = .08) by the mothers of boys; 
the other scales show entirely nonsignificant 
‘differences in the same direction. Thus, the 
attitude scale measures replicate the interview 
findings of the Sears et al. study somewhat 
better than do the interview measures. 

Class and national differences. Using three 
of these attitude scales (permissiveness for 
indoor nudity, permissiveness for aggression, 
and punitiveness for aggression) with a sam- 
ple of British mothers from a nonmetropolitan 
area, Lynn and Gordon (1962) found the cus- 
tomary difference in permissiveness-punitive- 
ness between middle-class and working-class 
mothers. The former were the more permissive 
or nonpunitive on-all three scales at the 
p=.05 level. These findings, shown in 
Table 6, replicate several American studies 
(Bronfenbrenner, 1958). Lynn and Gordon 


also reported less permissiveness for the Brit- 
ish than for the Stanford mothers on the two 
permissiveness scales (p = .05) and somewhat 
greater punitiveness (p = .10), but this find- 
ing must be treated with considerable caution 
because of the nonrepresentative character of 
the Stanford sample. 

Child behavior correlates. Another way of 
comparing the two sets of measures is with 
respect to their effectiveness in generating 
possibly meaningful relations between inde- 
pendent (child rearing) and dependent (child 
behavior) variables. In the present study we 
had 63 measures of child behavior that were 
obtained either from time-sample observa- 
tions in the nursery school or from stand- 
ardized assessment situations. The kinds of 
behavior measured included dependency, 
aggression, adult role, masculinity-femininity, 
resistance to temptation, and guilt. Fifteen of 
the 63 measures were summary scores, and 
hence were not independent of the 48 direct 
measures from which they were derived. If we 
consider only these 48 basic variables, the 
three interview measures of sex permissive- 
ness, for boys, produced 18 correlations reach- 
ing the .05 level of significance, while the 
parallel attitude scale measures produced 15; 
for girls, the comparable figures are 10 and 1. 
The attitude scales made an even poorer 
record with respect to aggression. There were 
7 significant correlations produced by the two 
interview scales for boys’ mothers, and only 
one produced by the attitude scales; for the 
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girls’ mothers, the comparable figures are 
10 and 0. ; 

While the boys’ mothers’ attitude scales on 
sex permissiveness generated almost as many 
significant correlations as the interview meas- 
ures did, the girls’ mothers’ did not; and for 
both sexes of children the aggression attitude 
scales were largely useless by this criterion. 
A similar comparison of the correlations gen- 
erated by the two types of measures with the 
summary scales of the child behavior leads 
to the same conclusion. 


CONCLUSIONS 


The reliability estimates of the two types 
of parental attitude measures leave little to 
choose between the methods. Both are reason- 
ably satisfactory from this standpoint. The 
correlations between the two measures are not 
high, but they are significant. The intercor- 
relations among the five scales are substan- 
tially higher for the attitude scales than for 
the interview measures (medians: .70 and .49, 
respectively). Possibly the element responsible 
for this greater communality among the atti- 
tude scales is also responsible for attenuating 
the correlations between the independent and 
dependent variables, for the evidence seems 
persuasive that the interview measures gen- 
erated a greater number of significant correla- 
tions with individual measurements of child 
behavior. On the other hand, the attitude scale 
measures relating to aggression correlated 
more highly than did the interview measures 
with the mothers’ observed behavior in the 


mother-child interaction situation, and the 
comparison of both sex differences and social 
class differences discovered by the two kinds 
of measures indicates that the attitude scales 
provided a better replication of earlier find- 
ings than did the interview measures. 

Until there is opportunity to determine the 
replicability of these various findings, the 
safest position to take with respect to utiliza- 
tion of the two methods of measurement is 
that the attitude scales are the more eco- 
nomical and the more effective for group 
comparisons, or for selection of extreme 
groups, while the interview method is the 
better for generating antecedent-consequent 
relationships within groups. 
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í HOPE, EXPECTANCY, AND REACTIONS TO 
THE UNEXPECTED’ 


O. J. HARVEY ann WILLIAM F. CLAPP 
University of Colorado 


Using a scale of 15 characteristics, 79 Ss rated themselves, rated another person, 
indicated how they hoped and expected to be rated by the other, and com- 
pleted a scale on self-esteem. Following exposure to fictitious ratings from the 
other person that deviated from their own expectancies by 1 of 2 amounts 
toward either the confirmation or refutation of hope, Ss repeated the above 
ratings and completed a 3-item scale concerned with feelings of positivity- 
° negativity toward self and other and the degree of pleasure-disappointment 
produced by the feedback. Significant differences were found between the 
feedback that deviated from expectancy toward hope and that which deviated 
from expectancy away from hope, indicating that directionality as well as 
magnitude of a discrepancy is an important determinant of its effects, It was 
also found that Ss who expected higher ratings from others reacted more 
positively than Ss of lower expectancies to unexpectedly favorable feedback 
but that in response to unexpectedly unfavorable ratings the reverse was true. 
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That the behavioral and affective conse- 
quences of a stimulus input is a function of 
its perceived discrepancy from or congruity 
with some internal standard or subjective 
base line of the recipient is an assumption 
common to psychologists of diverse back- 
grounds and theoretical interests. Much less 
agreement exists, however, on what the factors 


- are that determine the construal of a base- 


line-event incongruity and, hence, that deter- 
mine the psychological effects produced by it. 
Probably the most frequently investigated 
variable, under one label or another, has been 
the magnitude of the discrepancy. But typi- 
cally this factor has been studied in isolation, 
with little or no concern with its probable 
interaction with such closely related variables 
as the direction of the discrepancy from the 
base line and the mature of the base line from 
which an event is gauged and evaluated 
(Harvey, 1963). 

Of the wide variety of internal factors that 
doubtlessly coalesce into, or in some way 


1 Financial support for this study was provided 
by, the Group Psychology Branch, Office of Naval 
Research, under Contract Nonr 1147 (07) with the 
University of Colorado. In addition to ONR, thanks 
are extended to Diane Davies and to Larry Olsen 
for their help in conducting the experiment and 
preparing the data for analysis. 

Reproduction of this study in whole or part is 
permitted for any purposes of the United States 
Government. 
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become aspects of, the intraorganismic norms 
or yardsticks from which an input is sensed 
as deviant or congruent, expectancy, that 
which the organism has come to anticipate as 
the most probable occurrence from a class of 
possible events, seems to have received the 
most widespread attention. McClelland, At- 
kinson, Clark, and Lowell (1953), for ex- 
ample, specifically treated expectancy as the 
adaptation level, hypothesizing, in their notion 
of the butterfly curve, that small deviations 
from expectancy in either direction would 
produce positive affect along with its motiva- 
tional consequences, that complete confirma- 
tion of expectancy would result in boredom 
and some negative affect, and that large dis- 
crepancies from expectancy in either direction 
would generate more negative affect and 
stronger efforts toward reducing it. 

Concern of other psychologists with be- 
havior that did not fit the traditional models 
of tension reduction led them to posit the 
related notion of optimal stimulation, the 
optimal being represented by what the organ- 
ism was accustomed to or had come to expect 
(Berlyne, 1960; Glanzer, 1958; Hebb, 1955; 
Leuba, 1955; Montgomery, 1954). Deviation 
from the organism’s base line of habituation, 
in the form of either too much or too little 
stimulation, too much or not enough departure 
from expectancy, is assumed to lead to unrest 
and arousal of effort toward bringing the level 
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of stimulation within certain ranges of the 
base line. Glanzer (1958) expressed this 
position clearly: 

The increase or decrease of activity with respect to 
parts df the environment is a function of the differ- 
ence between the average amount of information [or 
stimulation] the individual is accustomed to and 
the current rate of flow of information from the 
environment [p. 312]. 


Hebb (1955) suggests that the maintenance 
of a particular level of stimulation or of a par- 
ticular relation between input and expectancy 


is . . . a phenomenon of the greatest importance for 
understanding motivation in higher animals. This is 
the positive attraction of risk taking, or mild fear, 
and of problem solving, or mild frustration . . . 
[p. 250]. 


Too varied a pattern of stimulation, too much 
deviation from expectancy, too high a level 
of arousal, on the other hand, is assumed to 
lead to efforts at reducing the stimulation by 
withdrawing from the novel and more complex 
and moving toward the simpler and more 
familiar. 

It very well may be that in those instances 
in which the organism is not evaluative, that 
is, has no preference for one outcome over 
another, expectancy may operate as the pre- 
dominant, conceivably even the sole, internal 
standard against which inputs are coded and 
assigned their affective and motivational 
weighting. And it also may be that such 
instances are restricted to the simple sensory 
level. But in many of its discriminations and 
responses to the environing world, the organ- 
ism is not a neutral and passive recipient 
of the impingements. Man, especially, senses, 
perceives, judges, and experiences affect in 
terms of his preferences, hopes, and values, 
internal states which give a directionality to 
motivational predilections and render the 
individual selective and evaluative in varied 
aspects of his psychological functioning. To 
become evaluative means that the individual’s 
interpretive schemata have “good” and “bad” 
poles, and that events which are perceived 
as confirming these schemata by facilitating 
attainment of a valued end are experienced 
as positive and something to approach while 
events that are perceived as refuting these 
evaluative standards and goals are experienced 
as negative and something to avoid. 


The activation of hope or preference would 
seem to dictate, therefore, that the direction 
of a discrepancy from expectancy, as well as 
its magnitude, must be considered, because 
deviations of equal magnitude could produce 
markedly different effects. Deviation from 
expectancy toward achievement of hope or a 
valued end should produce positive affect and 
positive construal of the causal event while 
deviation from expectancy of the same magni- 
tude toward refutation of a value or sought 
end should result in the opposite outcomes 
(Harvey, 1963; Harvey, Hunt, & Schroder, 
1961). 

It was with a facet of this latter question 
that this study was concerned. Subjects were 
exposed to controlled feedback which devi- 
ated from their expectancies by equal magni- 
tudes toward either the confirmation or 
refutation of hope. More specifically, subjects 
indicated how they expected and hoped to be 
rated by a relative stranger on 15 character- 
istics and, in addition to rating themselves 
and the stranger on the same scale, they com- 
pleted a scale aimed at measuring their self- 
esteem. Following exposure to ratings from 
the stranger that deviated from the subject’s 
expectancy by comparable amounts toward 
either confirming or refuting hope (which was 
higher than expectancy), each subject redid 
the above ratings as well as completing a 
3-item scale that asked his degree of pleasure- 
disappointment and his feelings of positivity- 
negativity toward himself and the other per- 
son as a consequence of the evaluations made 
of him. This provided eight dependent vari- 
ables, five relating to change in ratings of 
expectancy, self, the other person, and self- 
esteem and three relating to effective conse- 
quences of the feedback. 

From results of earlier related studies 
(Harvey, 1962; Harvey, Kelley, & Shapiro, 
1957; Van Ostrand, 1960) it was considered 
almost certain that variation in the magnitude 
of the discrepancy from expectancy would 
produce significant effects. Our main concern 
in this study, therefore, was less with magni- 
tude and more with the effects of variation in 
the directionality of the feedback discrep- 
ancies from expectancy (i.e., toward the con- 
firmation or refutation of hope) and the ef- 
fects of differences in pretreatment expectan- 
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= REACTIONS TO THE UNEXPECTED 


cies on reaction to evaluation from another 
person. 

Notions such as those embodied in the 
butterfly curve hypothesis (Haber, 1958; Mc- 
Clelland et al., 1953) would lead to the pre- 
diction that the discrepancies of equal magni- 
tude should have comparable effects, with the 
smaller deviations, both toward and away 
from hope, producing positive affect and with 
the larger discrepancies producing negative 
affect and related behavioral consequences. 

On the effects of pretreatment expectancies, 
it would be predicted from such theoretical 
positions as that of Festinger (1957; Aronson 
& Carlsmith, 1962; Brehm & Cohen, 1962; 
Carlsmith & Aronson, 1963) that persons who 
expected more favorable evaluations from 
another, because of greater consonance of the 
inputs with those cognitive elements embodied 
in expectancy, should react more positively 
to unexpectedly favorable feedback than 
would individuals of lower expectancies 
while persons of lower expectancies should 
react more positively, or less negatively, to 
unexpectedly unfavorable evaluations from 
another. ? 

From the anticipation of directional effects 
from the operation of hope, we hypothesized, 
in contradistinction to what would be- pre- 
dicted from the butterfly curve and related 
notions, that: 

1. The effects of the deviations from 
expectancy toward the confirmation of hope 
would be significantly different from the ef- 
fects of equivalent deviations toward the 
refutation of hope on all dependent variables 
except changes in hope and self-esteem. 

Hope and self-esteem were assumed to be 
more central than the other variables and, as 
a consequence, to not be significantly affected 
by the experimental treatments. 

In contradistinction to the dissonance 
theory of Festinger, we predicted that: 

2. The subjects who expected to be evalu- 
ated more poorly by others would be more 
adversely affected by unexpectedly unfavor- 
able ratings and more positively affected by 
unexpectedly favorable ratings than would 
the individuals who expected higher ratings 
from another person. 

Hypothesis 2 is based on the assumption 
that persons of lower expectancy are lower 
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in self-esteem and as a consequence feel a 
greater need for positive regard from others 
despite how they anticipate being perceived 
by others. In a sense, they hope to have their 
low expectancies refuted by being more 
favorably appraised by others. Such "indi: 
viduals, it was reasoned, should feel better 
than persons of higher expectancy from un- 
expectedly favorable evaluations and feel 
worse than persons of higher expectancy from 
an unexpectedly negative appraisal from an- 
other. Thus we are saying that dissonance 
from expectancy, when it involves confirma- 
tion of hope, should produce positive and not 
negative consequences. 


METHOD 
Instruments 


Four scales were used in the part of the study 
to be reported here. 

The scale on personal characteristics, used to ob- 
tain the subjects’ pre- and posttreatment ratings 
of expectancy, hope, self, and source of feedback, 
consisted of 15 characteristics, such as friendliness, 
dependability, competence, and maturity, presented 
on a single sheet in graphic form, with only the 
opposite poles of each characteristic (viz, very 
mature and very immature) indicated at the ends 
of the 7-inch lines. The subject was instructed to 
respond to the scale by checking the point on each 
line that best represented his view or position on 
the particular factor being rated. The algebraic sum- 
mation of the difference (in Yo inch units) between 
a subject’s pre- and posttreatment ratings of hope, 
expectancy, self, and source across all 15 character- 
istics provided the measure of change on these 
variables. 

The scale of self-esteem, administered both before 
and after the experimental treatment, consisted of 
24 short evaluative statements about the self, most 
of which were taken from a self-concept measure 
constructed by Dymond (1953). Included were such 
statements as “I feel good toward myself,” “I feel 
worthless,” and “I feel personally significant.” The 
subject responded to each statement by circling one 
of six numbers from +3, which indicated that the 
subject perceived the statement as very characteristic 
of him, to —3, which indicated that the subject 
interpreted the statement as very uncharacteristic of 
him. A subject's self-esteem score was represented 
by the algebraic summation of his ratings on the 
6-point scale, and the change in his self-esteem by 
the difference between the sums of the pre- and 
posttreatment ratings. 

In line with the intent of the design to restrict 
the source of feedback to a person with whom the 
subject was not acquainted and of whom he had 
not formed an impression, the index of acquaintance 
scale requested the subject to indicate in the provided 


48 


spaces the names of persons in the class with whom 
he was acquainted or about whom he had formed 
some impression. The subject then raged each name 
he had written in terms of how well he knew its 
owner, from very slightly to extremely well. The 
persons whose names the subject had indicated were 
eliminated as possible sources of feedback for him. 

The fourth instrument, aimed at measuring di- 
rectly the subject’s affective response to the negative 
and positive feedback, consisted of three, 7-point 
scales, that asked the subject to indicate his feelings 
of positivity-negativity toward himself and the other 
person as well as his feelings of pleasure-disappoint- 
ment produced by the feedback ratings. 


Procedure 


The experiment was carried out in two sessions. 

At the beginning of Session 1 packets containing 
three copies of the Personal Characteristics scale and 
one copy each of the Acquaintance and Self-Esteem 
scales were passed out. Subjects were instructed that 
the study was one in social perception, that 


we are interested in how accurately you perceive 
yourself and others. To measure this, you will be 
asked to make certain ratings of yourself on the 
forms you have been provided. 


Using the scale of personal characteristics, subjects 
then rated self (how they perceived themselves), 
indicated how they hoped to be rated by a classmate 
whom they did not know very well, and checked 
where they expected to be rated by the same person. 
They then provided their list of acquaintances and 
completed the scale on self-esteem. 

In Session 2 subjects: rated another person on 
the 15 personal characteristics (the relative stranger 
who subsequently served as the source of the feed- 
back ratings they received) ; received feedback about 
themselvse that deviated from expectancy by either 
«5 or 1.5 inches toward either confirmation or refuta- 
tion of hope; rerated expectancy, hope, self, and 
the source of the feedback ratings; completed the 
Self-Esteem scale a second time; and indicated on 
the 3-item scale their affective response to the feed- 
back, To minimize the chance that the completion 
of one of these tasks might affect response on an- 
other, these five phases of the second session were 
carried out in randomized order. 

The experimental manipulation consisted of expos- 
ing the subject to ratings which he was led to believe 
erroneously had been made of him by a relatively 
unknown classmate. These treatment ratings were 
prepared by the experimenters during the week be- 
tween Sessions 1 and 2. Since this study was con- 
cerned with the effects of controlled deviation of 
feedback from expectancy, the fictitious ratings were 
made discrepant from the subject’s initial ratings of 
expectancy by the average amount of either .5 or 1.5 
inches toward either confirming or refuting hope. To 
prevent the subject from suspecting the validity of 
the feedback, the ratings on an approximately equal 
number of characteristics were made to be slightly 
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more negative than, slightly more positive than, and 
equal to the desired average discrepancy. This meant 
that, although hope was higher than expectancy, the 
larger discrepancy toward hope sometimes resulted 
in hope being exceeded by the feedback rating. 

Subjects were exposed to one of the four feedback 
conditions in the following way. At the beginning 
of Session 2 subjects were seated alphabetically, in 
order that the fictitious ratings could more cor- 
rectly and more expeditiously be given to them, and 
were passed out a copy of the’ Personal Character- 
istics scale on which they were to rate the other 
person. They were then advised: 


Now we want you to rate one other member of this 
group, a person whom you hardly know, if at all. 
The name of the person you are to rate [always a 
person of the same sex as the subject] appears at 
the top of the Personal Characteristics scale after 
“Ratings of.” Make sure that you have the copy 
that has your name on it after “Ratings by.” 


The experimenter acknowledged the difficulty of rat- 
ing a stranger and, to ease this difficulty and to 
make the task less objectionable to the subjects, he 
asked that each person announce aloud his (her) 
name, major, and college or university usually at- 
tended, 

The ratings made of the other person were taken 
up as soon as they were completed in order that 
they could be surreptitiously exchanged for the 
earlier prepared ratings. Subjects were then in- 
structed: 


Now we want your help in determining the ac- 
curacy of your expectancy ratings, that is, how 
closely the ratings made of you by the other 
person agree with what you expected from a class- 
mate you didn’t know very well. To do this, you 
will need the expectancy ratings you made during 
the first session as well as the ratings the other 
person has just made of you. 


The experimenter apologized to the subjects for not 
telling them in advance that the ratings they made 
of the other person would be seen by that person, 


explaining that 


we were afraid if we told you the other person 
would see the ratings you made of him or her 
you would not be as candid and sincere in your 
ratings as you would be without this information. 


Subjects’ expectancy ratings, made during Session 1, 
and the fictitious ratings of them, which they as- 
sumed to have just been made by the person whom 
they rated, were passed back to them, Then, to make 
sure that the subjects noticed the departure of the 
feedback ratings from their expectancy ratings, they 
were instructed: 


Now we want you to calculate the difference be- 
tween your expectancy on each of the 15 char- 
acteristics and the rating made of you on that 
characteristic by the other person, 


° 
Subjects were requested to determine the discrepancy 
by placing the sheet with their expectancy ratings 
(which, for this purpose, was fairly transparent) 
over the sheet containing the ratings they erroneously 
assumed to have been made by the other person, to 
count the number of typewriter dashes between their 
initial expectancy and the received rating on a 
particular characteristic and to write this figure in 
the margin. Upon the subjects’ completing the cal- 
culation of the discrepancies, and hence exposing 
themselves to them, the forms involved in this were 
taken up and the subjects were passed out four 
copies of the Personal Characteristics form for re- 


. ratings of hope, expectancy, self, and the other per- 


son, a copy of the Self-Esteem scale for rerating, and 
the scale to measure the affective consequences of 
the feedback. The scales were completed in random- 
ized order, 

*.At the end of the experiment subjects were dis- 
abused and urged not to reveal the nature of the 
study to other students, That they did as requested 
is indicated by the fact that questioning of each 
group of subjects by the experimenter failed to 
reveal a single instance of premature disclosure about 
the study, 


Subjects 


Seventy-nine students, 39 males and 40 females, 
attending the University of Colorado during the 
summer of 1963 served as subjects. This number 
was distributed among the four discrepancy condi- 
tions as follows: .5 inch toward hope, 20; .5 inch 
away from hope, 16; 1.5 inches toward hope, 20; 
and 1.5 inches away from hope, 21. The number of 


“males and females within each feedback condition 


was, with the exception of the 1.5 deviation away 
from hope, identical. While most of the subjects were 
second- and third-year undergraduates, some were 
seniors and a few were graduate students. Subjects 
were either paid $1.00 an hour for their participation 
or, in the case of some who were taking introductory 
psychology, given lab credit for serving as subject. 
The experiment was carried out in a series of small 
grotips, from 4 to 10 students each. 
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RESULTS 


The first step in the analysis consisted of 
determining whether or not pretreatment dif- 
ferences existed among subjects of the four 
feedback conditions on any of the five vari- 
ables from which change scores were to be 
obtained: self-esteem, hope, expectancy, rat- 
ings of self, and ratings of other. The fact, that 
no significant difference was found means that 
any obtained effects can be attributed to the 
treatments and not to pretreatment differences 
between groups. 

The second step in the analysis consisted 
of intercorrelating the eight dependent vari- 
ables in order to establish their degree of inde- 
pendence and to determine whether they 
should be treated separately or pooled with 
one or more other measures. The highest ob- 
tained r was .48, between degree of disap- 
pointment and feeling toward other. With the 
exception of the .43 correlation between de- 
gree of disappointment and change in ratings 
of other, none of the other 7’s exceeded .35, 
most falling below .20. Thus it is appropriate 
to analyze each of the eight dependent vari- 
ables separately. 

Our main theoretical concern was with the 
effects of the direction of deviation from ex- 
pectancy, toward or away from hope (subse- 
quently termed positive and negative discrep- 
ancies, respectively), and with the effects of 
pretreatment expectancies of ratings from 
others. Accordingly, a two-way analysis of 
variance (for unequal Ns) involving the four 
feedback variations and dichotimization of 


TABLE 1 
MEAN EFFECTS OF THE Four FEEDBACK CONDITIONS 
Deviation (in inches) from expectancy 
Dependent variables F 
1.5 5 -1.5 -5 
Change in self-ratings AT 35 14 02 3.42* 
Change in other ratings 70 62 —.07 31 10:580, 
Change in expectancy 45 .29 —.13 03 3.03 
Change in hope 18 05 —.14 —.14 ae 
Change in self-esteem* 6.00 3.57 5.67 2.50 Rasy 
meei | a | i | GR | uie 
eel toward other* 2 x $ . n 
Disappointment 4.88 4.81 2.94 3.79 13.81** 


$ These four variables were measured in larger units than the other four. 
0S. 


< 05. 
** p < .01, or better. 
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TABLE 2 


COMPARATIVE EFFECTS OF DISCREPANCIES OF EQUAL 
MAGNITUDE BUT OF DIFFERENT DIRECTIONS 


Deviation (in inches) : 
from expectancy from expectancy may be inferred more clearly 
Se Anei from Table 2 which presents the comparisons 
ependent variables | 1,5 — (—1.5) | .S-(-5) of the discrepancies of equal magnitude but 
a aa ae of different directions for each dependent vari- 
able. In confirmation of our hypothesis, the 
pron neti p ae eH oct six variables that were affected significantly 
‘hangein other ratings | - ; “ 5 P ay 
Change in E 7 |S] ‘26 Loo» DY magnitude of deviation from expectancy 
Change in hope .32 | 1.58 D .97 were also influenced significantly by the direc- 
Change in self-esteem* .33 | 08 1.07 | .40 ; . * i 
Feel toward self 3 latte iia lasos tonality ortae discrepancy. Changes in self 
Feel toward other* 1145 |oo] 93 |336 ratings, other ratings, and expectancy, as 
Disappointment* 1.94 |5.16**| 1.02 |3.49** well as pleasure-disappointment and feelings 


© These four variables were measured in larger units than 
the other four. 
* > <.05, one-tailed test. 
++p < .01, one-tailed test. 


pretreatment expectancy scores (into groups 
of higher and lower expectancy) was run for 
each of the eight dependent variables. Certain 
specific comparisons were then made by ¢ 
tests. 


Directionality of Deviation from Expectancy 


The mean effects of the four deviations from 
expectancy on each of the dependent variables, 
along with corresponding F values, are pre- 
sented in Table 1. These results indicate that 
variation in feedback produced significant 
effects on all but two of the dependent vari- 
ables. Hope and self-esteem, presumably be- 


TABLE 3 


MEAN EFFECTS OF THE Four FEEDBACK CONDITIONS FOR SUBJECTS OF HIGHER AND LOWER 
PRETREATMENT EXPECTANCIES 


c 


cause of their greater centrality, were, as ex- 
pected, unaffected by the experimental manip- ; 
ulations. 

The effects of the directionality of deviation — 


toward self and others, were significantly more 
positive under the positive than under the 
negative deviations from expectancy. Differ- 
ences between comparable discrepancies, al- 
though not significant, were in the same 
direction for changes in hope and self-esteem. 

It would seem that these results permit the 
clear inference that not only the magnitude of 
the deviation of an input from expectancy but 
that direction of the discrepancy, toward the 
confirmation or refutation of hope, also con- 
tributes significantly to the affective and be- 
havioral consequences of an event. The fact. 
that equivalent deviations from expectancy 
produced different effects depending on their 
relationship to hope indicates that such dis- 
crepancy hypotheses as those embodied in the 
notion of the butterfly curve (Haber, 1958; 


1.5” deviation .5" deviation —1.5” deviation —.5" deviation 
Dependent variablen; ii tio. High | Low High Low | High Low High 4 
expect- expect- | expect- expect- expect- | expect- expect- expect- 
ancies ancies ancies ancies ancies ancies ancies ancies 
Change in self-ratings 33 71 at 51 23 .07 —.21 “15 3.86* 
Change in other 
ratings 59 83 10 1.11 —.32 55) 03 .78 | 25.40°* 
Change in expectancy 45 44 36 .23 —.12 —.13 17 22 1.39 
Change in hope —.07 59 05 .04 —.12 16 —.12 —.18 25 
Change in self-esteem* 7.92 2.50 3.20 3.91 8.00 3.92 1.90 3.50 .03 
Feeling toward self* 4.55 4.35 4.33 5.23 4.07 3.75 3.36 4.17 2.73* 
Feeling toward other* 4.62 4.50 4.06 5.36 2.82 3.36 3.44 3.97 3.12* 
Disappointment* 458 | 5.38 | 430 | 527 | 302 | 288 | 3.60 | 410 | 4.626 


i These four variables were measured in larger units than the other four. 
p <.05. 


*p <.01. 


McClelland et al., 1953) do not hold when 
‘aspiration and preferences are called into play. 
‘Unlike what would be predicted from the 
butterfly curve hypothesis, the smaller dis- 
crepancies in this study did not consistently 
produce more positive reactions than the 
larger deviations. Rather, as can be seen from 
Table 1, the larger positive discrepancy re- 
sulted in more positive responses than the 
smaller positive deviation on all dependent 
variables but one. At the same time, the 
smaller negative deviation produced more 
negative reactions than the larger negative 
discrepancy on three of the dependent vari- 
„ables. 


Pretreatment Expectancies of Ratings from 
Others 


The mean effects of the four feedback con- 
ditions on each of the eight dependent vari- 
ables for subjects of lower and higher expec- 
tancies, along with F values for variation in 
expectancy, are presented in Table 3. Changes 
in ratings of self and the source of the feed- 
back ratings, feelings toward the source and 
toward the self, and degree of pleasure-disap- 
pointment were significantly affected by the 
lower-higher variation in expectancy; changes 

` in hope, expectancy, and self-esteem were not. 

The results in Table 4 indicate that the 
significant Fs from the lower-higher difference 
in pretreatment expectancies were generated 
mainly by the effects of the ~wo smaller feed- 
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e 


51 


back discrepancies. Under both the 1.5 and 
— 1.5 conditions no significant difference was 
found between subjects of lower and higher 
expectancies for any of the dependent vari- 
ables. However, under the .5 and — .5, dis- 
crepancies the lower-higher difference in pre- 
treatment expectancy significantly affected 
five of the eight dependent variables. In con- 
tradiction of our prediction, under the .5 feed- 
back condition subjects of higher. expectancies 
were affected significantly more positively 
than subjects of lower expectancies on changes 
in ratings of self and others, feelings of posi- 
tivity-negativity toward self and others, and 
in the degree of pleasure-disappointment. 
Under the — .5 discrepancy just the reverse 
pattern occurred: in confirmation of our pre- 
diction, subjects of lower expectancies were 
affected significantly more negatively than 
those of higher expectancies on the same five 
measures. 

The fact that only under the less extreme 
feedback conditions did differences in pre- 
treatment expectancies have any significant 
influence probably means that the more ex- 
treme deviations represented stimuli of such 
intensity that they dominated the resulting 
response, and in so doing limited the effects 
that might have resulted in differences in ex- 
pectancy. 

The fact that only one part of our hypothe- 
sis concerning the effects of pretreatment ex- 
pectancies was verified seems to be explain- 


TABLE 4 


Comparative EFFECTS OF THE FOUR FEEDBACK CONDITIONS FOR SUBJECTS OF HIGHER AND LOWER 
PRETREATMENT EXPECTANCIES 


1.5” deviation 


.5" deviation 


—1.5” deviation —.5” deviation 


Dependent variables Low E — High E 


Low E — High E 


Low E — High E Low E — High E 


M diff. t M diff. t M diff, t M diff. t 
Change in self-ratings —.38 1.58 —.34 1.78* 16 85 —.36 1,79* 
Change in other ratings | —.24 89 —1.01 5.15** — 43 2.16 —.75 2.69*+ 
"Change in expectancy 01 .00 3 64 01 00 —.05 .02 
Change in hope —.66 1.51 01 .00 04 08 06 .09 
Change in self-esteem* 5.42 -13 —.71 12 2.08 58 —1.60 AL 
Feeling toward self* .20 A3 —.90 | 2.51** 32 59 —.81 1.87* 
Feeling toward other* 12 -26 —1.30 3.10** —.54 1.39 —.53 1.81* 
Disappointment* —.80 1.44 —.97 2,4944 14 19 —.50 1.76* 


a These four variables were measured in units smaller than the other four. 


*p < .05, one-tailed test. 
** > < .01, one-tailed test. 
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able in the following way. In line with our 
assumption, subjects of lower expectancy were 
lower in self-esteem than were individuals 
who expected higher ratings from others (r 
between initial expectancy and self-esteem = 
26, p < .05). While apparently being more 
hurt than persons of higher expectancy by 
unexpectedly unfavorable feedback, the indi- 
viduals of lower expectancy seemed less able 
than persons of higher expectancy to accept 
the’ more favorable evaluations from others. 
These results and this interpretation are in 
agreement with the findings of Van Ostrand 
(1960) that subjects of lower self-esteem were 
more adversely affected by negative feedback 
but, owing to their tendency to discount 
favorable evaluations from others, were at the 
same time less positively influenced than in- 
dividuals of higher self-esteem by evaluations 
that were more positive than their ratings of 
themselves. 

While not negating the possibility that in 
the absence of preferred end-states expectancy 
may be the primary determinant of the effects 
of a deviant input, these results do suggest 
clearly that when hope and concomitant eval- 
uation are involved the direction of the dis- 
crepancy, as well as its magnitude, is an im- 
portant determinant of the behavioral and 
affective outcomes. 
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White and Negro Es administered digit-symbol tasks of 3 levels of difficulty to 
Southern male Negro college students. The task was described as a test of 
eye-hand coordination, On the most difficult task Ss worked more efficiently 
when tested by a white than when tested by a Negro. On the 2 easier codes the 
skin color of the administrator had no effect. Next, 2 additional groups of 
Negro Ss were tested by the same Negro and white Es on the most difficult 
task only, but now the task was described as a test of intelligence, rather than 
as a test of motor coordination. The effect of the IQ instructions was to ele- 
vate slightly performance with a Negro tester and to lower scorces markedly 
in the white-tester group, with the net result that the white-tester and Negro- 
tester groups performed at the same level in the hard task-IQ instructions 


situation. 


Recently, Katz and Greenbaum (1963) 
reported an interaction effect of racial en- 
vironment and stress upon the digit-symbol 
‘performance of Negro male college students. 
Individual Negro subjects worked better in 
the presence of whites than of other Negroes 
when there was no threat of strong electric 
shock, but in a shock-threat condition this 

_ relationship reversed, so that performance was 
then better in the Negro environment. The 
present experiment was designed to explore 
further the factors that influence Negro ef- 
ficiency in different racial settings. Using non- 
intellectual instructions, white and Negro 
testers individually administered digit-symbol 
tasks of three levels of difficulty to Negro 
male students; in addition, one of the tasks 
was also presented as an intelligence test. 

In order to create a strong variation of the 
racial environment, Katz and Greenbaum had 
individual Negro subjects work in the pres- 
ence of two persons who were both either 
Negro or white, a task administrator and a 
confederate who posed as another subject. 

1This research was carried out under Contract 


Nonr 285 (24) between the Office of Naval Research 
and New York University. 
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Their experiment did not ascertain whether 
the race of the task administrator alone (i.e; 
in the absence of the confederate) had an 
impact upon Negro behavior. The present 
investigation dealt with this question. If the 
color of the tester is found to influence Negro 
efficiency, one can then attempt in future 
research to identify the differential character- 
istics of white and Negro testers that underlie 
the effect. In addition, this finding would have 
obvious implications for the interpretation of 
Negro scores on tests of ability and capacity. 

Katz and Greenbaum used only one digit- 
symbol task. This fact limits the generality 
of their findings, since the effect of motiva- 
tion on performance is known to depend upon 
the nature of the task. For example, Sarason 
and Palola (1960) have shown that such 
motivational factors as test anxiety and test 
instructions have different effects on white 
subjects’ digit-symbol performance, depending 
on whether the code is relatively hard or easy. 
If one regards anxiety as drive or activa- 
tion (following, among others, Duffy, 1957; 
Easterbrook, 1959; Malmo, 1957; Schlosberg, 
1954; Spence, 1958) then Sarason and 
Palola’s results appear to be consistent with 
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the hypothesis, first formulated by Yerkes 
and Dodson (1908), that the optimal in- 
tensity of motivation is inversely related to 
the difficulty of a learning task. Spence and 
other Hullians have developed a theoretical 
basis for this hypothesis in terms of response 
interference, and have obtained supporting 
evidence from studies on the learning of 
competitional and noncompetitional paired- 
word lists by subjects varying in drive level 
(Spence, 1958). Broadhurst (1959) has re- 
viewed other experimental findings that are 
consistent with the Yerkes-Dodson hypothesis. 

Returning to the present problem, Katz and 
Greenbaum’s data suggest that a white tester 
arouses more drive in Negro male subjects 
than does a Negro tester. If this is so, it 
would follow from the Yerkes-Dodson hy- 
pothesis that on a suitably easy task Negroes 
ought to perform better with a white tester 
than with a Negro tester, while on tasks of 
increasing difficulty this relationship ought to 
tend toward reversal. Therefore, it was de- 
cided to employ three digit-symbol codes of 
varying degrees of difficulty, the easiest of 
which had been used successfully by Katz and 
Greenbaum to show that when personal threat 
was low, Negroes worked better with whites 
than with other Negroes. 

In all experimental conditions, Katz and 
Greenbaum used instructions which empha- 
sized that the task had no significance as 
an index of intellectual ability. How would 
intelligence-test instructions affect Negro ef- 
ficiency? According to the Yerkes-Dodson 
hypothesis, for most tasks the optimal 
strength of motivation lies somewhere in a 
broad middle range of intensities—that is, the 
relationship between drive as the independent 
variable and efficiency tends to have the form 
of an inverted U. Recently, Burgess and 
Hokanson (1964) obtained data that approxi- 
mated an inverted U function for the effect 
of drive, measured by heart rate, on the digit- 
symbol performance of white college students. 
Let us assume that in the Katz and Green- 
baum experiment, under the low-threat condi- 
tion, the high digit-symbol scores that Negro 
subjects obtained in the white environment 
reflected a level of drive that was relatively 
close to the optimum, while the lower scores 
of Negro subjects in the Negro environ- 


ment resulted from insufficient drive. Since 
intelligence-test instructions are likely to be 
more motivating than neutral instructions, 
when the former are used the total activation 
of Negroes with a white administrator might 
become too high for efficient performance, 
while that of subjects with a Negro adminis- 
trator might rise closer to the optimum. — 
Hence, when a task is presented as an intel- 
ligence test, there would be a tendency toward 
a reversal of Katz and Greenbaum’s finding 
for the low-threat condition. This prediction 
was to be tested in the present experiment by 
selecting that level of difficulty of the digit- 
symbol task which, under neutral instructions, | 
showed maximum advantage for subjects with | 
a white tester as compared to subjects with | 
a Negro tester, and then running two addi- 
tional groups on that task under intelligence- 
test instructions. 

Restating the predictions: (a) when a digit- 
symbol task is described to Negro subjects as 
nonintellectual, the presence of a white tester 
will facilitate performance more than will the 
presence of a Negro tester on a relatively 
easy version of the task, but this relationship 
will tend to reverse on medium and hard 
versions; (b) on the version that shows the 
most advantageous effect of a white tester 
in comparison with a Negro tester, the in- 
troduction of intelligence-test instructions 
will have the effect of lowering scores with 
a white tester and of raising scores with a 
Negro tester. 


METHOD 


Subjects 


One hundred and eighty-four male Negro under- 
graduates at Fisk University volunteered as subjects 
and were paid $1.00. Forty-five were from the 
the North and the rest were from the South. None 
of the men was acquainted with the test adminis- 
trators or had previously been used in research of 
this kind. 


Procedure 


Two student assistants, both male Negroes, re- 
cruited 138 boys in classrooms and scheduled them 
in small groups for a pretest session. The assistants 
administered a hard, medium, or easy version of 
the digit-symbol task. Oral instructions were as 
follows: 


We want you to work on a task as part of a 
Psychology Department research project. The task 
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has nothing to do with your course grades or your 
aptitude as a student. Still, it will help us if you 
do the best that you can... . 


: Subjects were then given task booklets and were 

further instructed that they contained several digit- 

i symbol pages, all with the same code at the top. 

A procedure was described for starting and stopping 

. work on each page when the administrator signaled. 

The booklets actually contained, in addition to a 

face sheet, four digit-symbol pages with the same 

code at the top of every page. All pages had four 

fr „rows of empty boxes, 19 to a row, with digits above 

the empty boxes that were arranged in a different 

random order on each page. Three 1-minute trials 

were administered. The purpose of the fourth digit- 

symbol page was to conceal the fact that there were 

sA only three trials, and thus prevent an end-spurt on 

i + Trial 3. 

In the first phase of the experiment, which dealt 

with the effects of variations in task difficulty and 

i race of tester, 46 subjects were given the easy task, 

Š in which the symbols consisted of letters of the 

alphabet. Forty-six men had the code of medium 

difficulty, which was identical with the Wechsler 

subtest. Finally, 46 received the hard code, in which 

the symbols resembled sticks with triangular flags, 

arranged to create strong response interference effects. 

Approximately the same proportion of Northern and 

Southern students were assigned to each of the three 

code tasks. On the basis of their total scores on 

5 the preliminary trials, subjects in each of the three 

subsamples were split into matched groups that were 

assigned to the white-tester and Negro-tester condi- 

tions. The matching was done by equating group 

means. The ratios of Northern to Southern students 

turned out to be very similar in all treatment groups. 

A few days after the first session each subject 

individually worked on the same task again, this 

time in the presence of either a white or a Negro 

male adult experimenter whom he had never seen 

. before, The experimenter introduced himself as Dr. 
, and then said: 


` e Iam a psychologist and I am doing research on 

eye-hand coordination. I have the code task that 

you did the other day, and now you are going 
a, to do a code task for me, to give me a more 
complete picture of how you work on this type 
of task. This is not an intelligence test, and it has 
nothing to do with your course grades or your 
aptitude as a student. Nonetheless, it is important 
that you do the best that you can.... 


The subject was informed that the present task 
had more trials than the earlier one. Then he was 
‘given a booklet which contained six digit-symbol 
pages, all having the same code and format as 
the previous task. Five 1-minute trials were ad- 
ministered. At the back of the booklet was a brief 
questionnaire which „dealt with task motivation, 
perception of the task, and expectations about being 
compared with other students. 

When the data were scored it was found that the 
relative advantage of subjects in the white- 
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administrator condition, as compared with those in 
the Negro-administrator condition, was greatest on 
the hard task. Hence, this task was used for the 
intelligence-test condition. Several days after the first 
phase of the experiment had been completed, 54 
additional subjects were pretested on the hard task, 
and then two groups of 23 were selected in a way 
that seemed to provide the closest equation of pre- 
test means with one another, and with the pretest 
means of the previous two hard-task groups. This 
was done by setting up 27 matched pairs, discarding 
the 2 most heterogeneous pairs, and then discarding 
2 additional pairs in order to close a small gap 
between the means of the 2 new groups and the 
means of the 2 old groups. The new groups were 
assigned to the same Negro and white testers and 
subjected to the same second-session procedure as 
were the earlier groups, except that instead of de- 
scribing the task as a measure of eye-hand coordina- 
tion the following was said: 


I am a psychologist and I am doing research on 
the measurement of intelligence. I have the code 
task that you did the other day, and now you 
are going to do a code task for me to give me 
a more complete picture of your ability on this 
type of mental test. This test measures the speed 
and accuracy of your intellectual processes, as well 
as how quickly you can learn abstract material. 
It also reveals your capacity for concentration. 
Eventually this test will be part of a new intel- 
ligence-test battery that will be used in colleges. 
Your score on this test will not be shown to your 
teachers at Fisk. It will be used only for the 
purpose of developing the new intelligence-test 
battery. ... 


Criterion of Efficiency 


The index of digit-symbol efficiency was the dif- 
ference between total scores in the first and second 
sessions when the second-session scores were ad- 
justed for the greater number of trials by multiplying 
them by #. 

RESULTS 
Performance 


The first prediction stated that when the 
task was defined nonintellectually a white 
tester would facilitate performance more than 
would a Negro tester on the easy version, 
but that this relationship would tend to 
reverse on the medium and hard versions. 
This prediction was evaluated by means of a 
2x3 analysis of variance of subjects’ ef- 
ficiency scores when the task was defined as 
a test of eye-hand coordination, Results of 
the analysis of variance and direct compari- 
sons of cell means are presented in Table 1. 
They reveal a significant Difficulty x Race of 
Tester interaction (F= 4.38, p< .05) in 
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TABLE 1 


ANALYSIS OF VARIANCE AND CELL MEANS OF DIGIT- 
SYMBOL EFFICIENCY SCORES FOR THREE LEVELS 
or Task DIFFICULTY AND NEGRO-WHITE 


TESTER 
Source af F 
Racè of tester (R) 1 12 
Difficulty (D) 2 2.23 
RxD 2 4.38* 
Error (w) 132 
Difficulty 
Tester 
Easy Medium Hard 
White 20.26 20.83 28.968 
Negro 21.83 24.61 21.39 


Note.—In all cells, N =23. Individual cell comparisons fol- 
lowed Winer's (1962, PP. 207-209) method. 

The hard task/white tester mean is significantly larger 
(p <.02) than each of the following: medium-white, easy-white, 
hard-Negro, easy-Negro. No other between-cell comparisons are 
significant. 


* p <05. 


which the only significant difference between 
scores under white and Negro administrators 
occurred on the hardest task. Here the white 
person occasioned higher scores than did the 
Negro (b < 05). Thus, the nature of the 
interaction of Difficulty and Race of Tester 
was not as predicted. 

After the first prediction had been tested, 
an intelligence-test condition was added to the 
experiment, using only the hard task. The 
second prediction was that intelligence-test 
instructions would lower the efficiency of sub- 
jects with a white administrator but improve 
the performance of subjects with a Negro 
administrator. It can be seen in Table 2 that 
the prediction was upheld by a significant 
Instructions X Race of Tester interaction 
(F = 5.17, p<.05). Comparison of cell 
means shows that subjects tested by a white 
person scored significantly lower under intel- 
ligence-test instructions than under motor-test 
instructions (p < .05). IQ instructions tended 
—though not significantly—to facilitate the 
performance of subjects tested by a Negro. 
Thus, the net effect of defining the task 
intellectually was to eliminate the perform- 
ance differences due to Race of Tester that 
occurred under the motor-test instructions. It 
is also noteworthy that the motor test/white 
administrator treatment was significantly 
more favorable for performance than any 


other treatment ( < .05 for all individual 
comparisons). 

Performance curves of the four Race of 
Tester X Instructions groups on the five sepa- 
rate trials of the second session are shown in 
Figure 1. These curves are based upon actual 
scores in the second session, instead of the 
more refined efficiency scores (second session 
— first session) which were used for the 
analysis of variance. Figure 1 reveals that the 
differences in performance among the four 
groups were consistent on every trial. In- 
cluded are the mean scores of groups on the 
third trial of the pretest. These show that 
the groups were closely equated on initial, 
ability. 


Questionnaire Responses 


The terminal questionnaire began with an 
item which required self-ratings on 8-point 
scales of three dimensions of task experience: 
nervous-calm, tense-relaxed, and cared about 
doing well-did not care. Subjects’ scores on 
the first two scales were combined to form a. 
“stress” index, and subjected to separate 
analyses of variance for Race of Tester 
X Difficulty (3 X 2), and Instructions X Race 
of Tester (2 X 2). There were no significant | 
effects. The most suggestive finding was ob- 
tained in the 2 X 2 analysis, where subjects 
tested by a white person tended to report 


TABLE 2 


ANALYSIS OF VARIANCE AND CELL Means or DIGIT- 
SYMBOL EFFICIENCY SCORES FOR Two TYPES OF 
TEST INSTRUCTIONS AND NEGRO-WHITE 
TESTER ON HARD TASK 


Source df F 
Race of tester (R) 1 3.83* 
Instructions (I) 1 1.23 
RXI 1 E k a 
Error (w) 88 

Test instructions 

Tester 

Intelligence Motor 
White 22.91 28,96" 
Negro 23.48 21.39 


Note.—In all cells, N =23. Individual cell comparisons fol- 
lowed Winer’s (1962, pp. 207-209) method. 
* The motor test/white tester mean is significantly higher 
(@ <.05) than every other cell mean. No other between-cell 
comparisons are significant. 
* p <.10, 
* p <.05, 


WHITE TESTER 


TRIALS 


a. Last trial of pretest 
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Intelligence 


TRIALS 


Fic. 1. Mean digit-symbol scores on five 1-minute trials of the hard task for Negro subjects 
who had a white or a Negro experimenter and motor-test or intelligence-test instructions. 


(N = 23 for every group.) 


greater stress than did those tested by a Negro 
(F = 2.61, p < .15, for the Race of Tester 
main effect). There were no significant 
differences on the “cared” scale. 

_ Another set of questionnaire items required 
ratings on 8-point scales of the extent to 
which each of the following types of ability 
was tapped by the digit-symbol task: ‘intel- 
ligence,” “speed of reaction,” “memory,” 
“muscular coordination,” and “learning abil- 
ity.” Two analyses of variance were per- 
formed. One employed a 3 (Levels of Dif- 
ficulty) X 2 (Race of Tester) X 5 (Types of 
Ability) design, and the other a 2 (Types 
of Instructions) X 2 (Race of Tester) X 5 
(Types of Ability) design. Both analyses 
showed a significant main effect for Abilities 


(for both,  < .01). When mean ratings of 
the various abilities were compared with one 
another, the same rank order was obtained 
in both sets of data. Speed of reaction re- 
ceived the highest ratings, muscular coordina- 
tion and memory were rated very similarly 
and were next highest, learning ability ranked 
fourth, and intelligence received the lowest 
ratings. (All of these differences were signifi- 
cant at p < .05.) 

In addition, the 2 X 3 X 5 analysis of vari- 
ance-showed a significant interaction effect 
of Abilities and Task Difficulty (p < .05). 
When ratings of each type of ability at dif- 
ferent levels of difficulty were compared, it 
was found that both memory and learning 
ability were seen by subjects as less relevant 


58 


when the task became more difficult ($ < .05 
for differences between ratings of each of 
these abilities in the easy and hard condi- 
tions). Ratings of other abilities were not 
affected by level of difficulty. The second 
analysis of variance revealed a significant In- 
structions X Abilities interaction (p < .005). 
Comparison of mean ratings of each type of 
ability under IQ-test and motor-test instruc- 
tions showed that three abilities were rated 
significantly more relevant in the intelligence- 
test condition: intelligence (p < 01), mem- 
ory ($ < .05), and learning ability (p < .01). 
Thus, the variation in instructions had the 
intended effect upon subjects’ perception of 
the task. 

A final questionnaire item asked the sub- 
ject whether he expected to be compared with 
students at his own college, or other colleges, 
or did not expect to be compared with any- 
one. In every treatment at least 75% of the 
subjects expected to be compared with stu- 
dents at both their own and other colleges. 
There were no significant differences in 
expectations among groups. 


Discussion 


Contrary to our first prediction from the 
Yerkes-Dodson hypothesis we found that 
when tasks were described nonintellectually a 
white tester elicited better performance than 
did a Negro tester only on the hardest of 
three digit-symbol codes, while on the two 
easier codes the color of the administrator 
did not influence efficiency. Possibly the 
three codes were too easy as a group for the 
predicted interaction to occur, Our reason for 
selecting the easiest code was that it had been 
used successfully by Katz and Greenbaum 
(1963) to show that under low-stress instruc- 
tions Negroes scored higher in the presence 
of whites than of other Negroes. Our failure 
to obtain similar results on the easiest task 
may have been due to a relatively weak 
manipulation of the racial environment: Katz 
and Greenbaum created a social setting that 
included both a tester and another “subject,” 
while ours consisted of a tester only. 

The second derivation from the Yerkes- 
Dodson hypothesis was that on a task which 
occasioned better performance with a white 
experimenter than with a Negro experimenter 
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under motor-test instructions, the substitu- 
tion of intelligence-test instructions would 
have the effect of lowering efficiency with the 
white person and of raising it with the Negro, 
This prediction was supported by a signifi- 
cant interaction effect of Race of Tester and 
Instructions on scores for the hardest task. 
In order to relate these results to the Yerkes- 
Dodson hypothesis it is necessary to make 
certain assumptions: first, that the white 
person aroused more drive than did the 
Negro; second, that intelligence-test instruc- 
tions were more motivating than motor-test 
instructions; and third, that the increments 
of drive were additive (Hull, 1943). One can 
then propose that when the task was de- 
scribed as an index of motor skill, activation 
with a Negro tester was too low for efficiency, 
while with a white tester it was closer to the 
optimal level; and when an increment of 
drive was added to both racial conditions 
by the introduction of intelligence-test in- 
structions, total activation with the white 
administrator became excessive, while with 
the Negro administrator it rose closer to the 
optimum. That an intermediate level of drive 
(measured by heart rate) leads to better 
digit-symbol performance than do very high 
or very low drive levels has been shown by 
Burgess and Hokanson (1964), 

There are alternative interpretations of our 
results which deal with the Significance to the 
subject of the various experimental condi- 
tions. Of particular relevance is Atkinson’s 
(1958) conception of motivation as a joint 
function of the subjective probability and 
incentive value of success, Assume that a 
white experimenter has higher status for the 
Negro subject than does a Negro experi- 
menter, so that the white person’s approval 
is more attractive. It follows that when the 
likelihood of winning approval by scoring 
well is equally high whether the tester is 
Negro or white, the subject will work harder 
for the white person. Thus, in the present 
experiment, Negro students performed better 
with a white tester than with a Negro tester 
when the task was Supposed to assess an 
ability which Negroes are not stereotyped as 
lacking (eye-hand coordination). Presenting 
the task as an intelligence test ought to have 
raised the incentive value of achievement in 
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_ both racial conditions, with perhaps a greater 
‘jicrement occurring when the experimenter 
was white (since intellectual approval by a 
white person might be uniquely graitfying to 
the Negro student’s self-esteem). 

But suppose that on an intellectual task 
the Negro subject sees very little likelihood 
of meeting a white person’s standard of ex- 
cellence. In that situation, Atkinson’s theory 
would account for the observed decrement 
in performance under a white tester by postu- 
lating a marked reduction in the perceived 
probability of success. However, when the 
tester is Negro there is no reason to assume 
that the subject’s expectation of success 
would be markedly lower on a task described 
as intellectual than on one that was presented 
as a motor test. In both instances the skin 
color of the tester would suggest to the sub- 
ject that he was to be evaluated by Negro 
standards. Accordingly, performance with the 
Negro tester ought to go up under IQ instruc- 
tions. The fact that it rose only slightly in 
our experiment may be ascribed to the sub- 
ject’s unclarity about the tester’s frame of 
reference for evaluating his score. That is, he 
was not actually informed whether he would 
be compared with norms for Negro students 
only, or with norms for all college students. 
It will be recalled that in aX treatments at 
least 75% of subjects expected to be com- 
pared with students at both their own and 
other colleges. 

A further point to be noted in trying to 
evaluate the experimental findings is that 
across all conditions subjects tended to per- 
ceive the digit-symbol task more as a motor 
test than as an intellectual one (even though 
it was seen as more intellectual under IQ in- 
structions than under motor instructions). 
Common sense suggests that the nonintellec- 
tual appearance of the task may have influ- 
enced subjects’ behavior, though one can only 
guess about the nature of the influence. 

Future research can determine whether the 
Yerkes-Dodson hypothesis or Atkinson’s 
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theory is more useful for understanding Ne- 
gro instrumental behavior in white environ- 
ments. For example, an interesting predic- 
tion from the latter formulation is that if the 
subjective probability of success were , held 
constant across racial conditions, Negro sub- 
jects would display greater efficiency on cer- 
tain types of intellectual test when the ad- 
ministrator was white than when he was 
Negro. 
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THE EFFECT OF VARIABLE PERSPECTIVES ON 


JUDGMENTS OF OPINION STATEMENTS 
FOR THURSTONE SCALES: 


EQUAL-APPEARING INTERVALS ' 
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Bryn Mawr College 


Data from an earlier study were reanalyzed to test methodological and sub- 
stantive hypotheses. The possibility was entertained that the ambiguity cri- 
terion eliminates items biased according to judges’ attitudes. The variable series 
model, however, specifies two types of differences among judges, both types 
pervading all items: differences in level (reference scale origin) and in dispersion 
(scale unit). Furthermore these differences are attributed to judgmental lan- 
guage, rather than to perception. The data were consistent with a reference 
scale interpretation, although the precise predictions of the theory were only 
partially confirmed. Any judgmental differences attributable to reference scale 
parameters, as was the case in this study, cannot be said to invalidate attitude 


scale construction. 


It was nearly 40 years ago that Thurstone 
first published his hypothesis that statements 
of opinion can be rated for the degree of im- 
plied affect without bias due to the rater’s 
own attitude (Thurstone, 1928). In the in- 
terim there have been many experimental 
tests of the hypothesis (Edwards, 1957, pro- 
vides a summary of these studies). Until the 
Hovland and Sherif (1952) study, the gen- 
eral conclusion of this line of research was in 
support of Thurstone’s hypothesis, Most, or 
all, of these studies employed the so-called 
“carelessness criterion,” as prescribed by 
Thurstone. By means of this criterion, judges 
who rate many items in the same category 
are removed from the analysis. Hovland and 
Sherif contended that the carelessness cri- 
terion served to purge the judging group of 
members holding extreme attitudes, thus pre- 
venting a fair test of Thurstone’s hypothesis. 
Without the disputed criterion, they found 
a negative relationship between equal-appear- 
ing intervals scale values and judges’ atti- 
tudes. This relationship was interpreted by 
Hovland and Sherif and by others (e.g., Ed- 
wards, 1957; Loevinger, 195 7; Selltiz, Jahoda, 
Deutsch, & Cook, 1959) as evidence of in- 


1 This study was supported by a grant from the 
National Science Foundation. Gratitude is expressed 
to Thomas M. Ostrom who assisted in every phase 
of the research, and to John Schopler who read 
critically an earlier draft of the manuscript. 


validity (in some sense) of attitude scale con- 
struction as proposed by Thurstone, using the 
method of equal-appearing intervals. 

One purpose of the present paper is to con- 
sider the challenge to attitude scale construc- 
tion by means of the Thurstone procedures 
(Thurstone & Chave, 1929) following dem- 
onstrations of a dependence of scale values on 
judges’ attitudes. To this end, data will be 
Presented in terms of a statistical design 
aimed at some of the critical methodological 
issues implicit in Thurstone’s hypothesis. A 
second purpose of this study is to provide 
information bearing on a theoretical account 
of the relationship between characteristics of 
the judge and his judgments. Accordingly 
the data will be examined for both ethos 
logical and substantive implications. 


Scaling Problem 


Thurstone conceived of attitude as the de- 
gree of affect one holds for some psychologi- 
cal object. With this conception, the key to 
the measurement of attitude lies in finding 
some procedure by which to locate the per- 
son whose attitude is to be measured on the 
affective (favorable-unfavorable) continuum. 
To accomplish this, Thurstone proposed that 
the values of opinion statements on this con- 
tinuum be determined by his law of compara- 
tive judgment, and that the Tespondent be 
assigned a location on the continuum corre- 
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sponding to the ‘average value of the state- 
Ments he endorses. The method of equal- 
appearing intervals, and consequently the 


. substitution of absolute for comparative 


judgments, was offered as a labor-saving de- 
vice to overcome the practical difficulties of 


scaling by the law of comparative judgment 


(Thurstone & Chave, 1929, pp. 16-21). It is, 
therefore, Thurstone’s labor-saving device, 
not his conception of attitude measurement, 
which is challenged by any demonstration of 
a relationship between judges’ attitudes and 
their absolute judgments of item positions on 
the affective continuum. 

“Thurstone scaling” proceeds by collecting 
judges’ ratings of a large pool of items relat- 
ing to a specified topic. Scale values for all 
items in the pool are determined, using either 
the method of equal-appearing intervals or 
its successor, the method of successive inter- 
vals, A sample of the item pool is then chosen 
to serve as the “final scale.” The aim of 
sampling for the final scale is to cover the en- 
tire range of scale values, in roughly equal 
steps, with items that are minimally ambig- 
uous (as measured by the dispersion of rat- 
ings assigned by the various judges to each 
item). 


> Since it is the final scale, not the original 


item pool, which is used to assign scores to 
respondents, it is the final scale which must 
be the focus of any discussion of the validity 
of Thurstone scaling. If it were shown that 


_ the criteria for constructing a final scale 


served to eliminate those items for which 
there is a dependence of scale values on 
judges’ attitudes, then the scaling procedures 
could not be claimed invalid by reason of the 
dependency. In this regard it may be noted 
that Hovland and Sherif (1952) assumed that 
the relationship between judges’ attitudes and 
item values is mediated through item am- 
biguity. Specifically, they hypothesized that 
the bias due to judges’ attitudes is propor- 
tional to the ambiguity of the item. Because 
itéms are selected for the final scale so as to 
minimize ambiguity it is at least possible, as- 
suming the Hovland and Sherif hypothesis, 
that final Thurstone scales are free of this 
source of bias. 

In order to obtain evidence bearing on this 
question it is necessary to test the relation- 
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ship between item values and judges’ atti- 
tudes for items that have been culled accord- 
ing to the rules for the construction of a final 
scale. Furthermore, since generalization re- 
quires replication in the experimental design, 
it is desirable to test this relationship in the 
context of more than one final scale. For 
these reasons, the present design utilizes two 
item-by-item parallel final scales drawn from 
a single item pool. 


Theoretical Problem 


Recently an experimental test of four al- 
ternative interpretations of the influence of 
judges’ attitudes on equal-appearing inter- 
vals scale values was reported (Upshaw, 
1962). Of the four, the data most strongly 
supported a “variable series” model. This 
model was based on Volkmann’s (1951) posi- 
tion that the rating assigned to a particular 
stimulus is a function of its place within the 
range of stimuli which the judge takes into 
account when performing the absolute judg- 
ment task, that is, the judge’s perspective at 
the time of judging, The variable series model 
was intended as a minimal representation of 
Volkmann’s position, related specifically to 
the attitude scaling task. According to Volk- 
mann the most potent determinants of per- 
spective are the extreme items of the series 
to which a judge responds. In psychophysical 
research it is a relatively simple matter to 
insure that all judges adopt the perspective 
implied by the experimental series. However, 
Hunt and Volkmann (1937) showed that per- 
spective may be influenced by stimuli not 
physically present in the series, when these 
stimuli are made salient in an experiment. In 
formulating the variable series model, such a 
role was assigned to the judge’s own attitude, 
under the assumption that the fact of being 
instructed to rate a series of statements relat- 
ing to a controversial issue makes salient to 
the judge his own position. If his own atti- 
tude, thus made salient, exceeds the range of 
the item series, then the perspective of this 
judge, presumably will be broader than that 
implied by the series. The perspective defined 
by the series is assumed to be adopted by 
judges whose own positions lie within the 
range of the series. By this model, the criti- 
cal factor to consider in the absolute judg- 
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ment scaling task is whether the judge’s own 
position is inside or outside the item range. 

The variable series model would completely 
specify the judgment of a particular item on 
the basis of knowledge of two perspective 
parameters (i.e., the stimulus values of the 
upper and lower end items), the stimulus 
value of the item, and the number of cate- 
gories available in the judgmental language. 
For attitude scale construction it may be as- 
sumed that the latter two determinants are 
constant for all judges. Any differences among 
judges in their ratings of a particular item 
are, by the model, attributable to differences 
in the perspectives which they bring to bear 
on the judging task. 

In the earlier study only two possible de- 
terminants of perspective were considered: 
the extreme items of the experimental series, 
and the judge’s attitude, whichever is more 
extreme at either end of the scale. Obviously 
it was expediency, rather than logic, that led 
to restricting consideration of the deter- 
minants of perspective to just these factors. If 
the attitude scaling task serves to make sali- 
ent a judge’s own attitude, it may similarly 
make salient the attitudes of significant ref- 
erence groups or persons. Thus, a judge’s 
Perspective may be determined by a wide 
variety of influences other than his own 
attitude and the items of the series. Research 
will be reported elsewhere dealing with some 
of these nonattitudinal variables relating to 
perspective. However, the present study, rep- 
resenting reanalyses of the 1962 data, con- 
centrates on the two previously discussed 
determinants of perspective. 

The experimental task of the 1962 study 
was the judgment of items in terms of 11 
ordered categories according to the degree of 
favorableness toward the Negro implied by 
each. The data consisted of judgments ren- 
dered under three conditions of item range by 
white college students, partitioned by quin- 
tiles according to scores on the Murphy- 
Likert Scale of Attitude toward the Negro 
(Likert, 1932). Only three quintiles, the first, 
third, and fifth, were retained in the experi- 
mental design (Ns varying from 29 to 45, 
because no two subjects with the same score 
were assigned to different quintiles). These 
groups were designated anti (A), neutral 
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(N), and pro (P). For a total series condition 
(t), all 114 items prepared by Hinckley 
(1932) were presented for judgment. In an- 
other condition, the 28 most pro-Negro items ; 
were removed, thus producing a series aborted |) 
at the P end (a+). In a third condition, | 
the series was aborted at the A end (a —) 
by removing the 28 most anti-Negro items. 

Analysis of the equal-appearing intervals v 
values resulting from the 3 X 3, factorial de- | 
sign yielded an interaction between judges’ 
attitudes and item range. On inspection it was 
confirmed that the pattern of the interaction 
was as predicted by variable series. The dif- 
ferences between the item values in the total © 
series and those in the two aborted series were 
greatest for subjects who presumably re- 
mained “in-range” with the removal of items 
at one end of the series. Compared to P 
judges in the t condition, P judges in the 
a— condition appeared to displace items in 
the anti direction, whereas a similar compari- 
son of A judges in the t and a+ conditions 
indicated that those judges displaced items in 
the pro direction. The variable series rationale 
for this interaction is that out-of-range judges, 
because of their own positions, retain rela- 
tively wider perspectives than do in-range 
judges when items at their preferred end of — 
the scale are removed from the series. This 
rationale is illustrated schematically in Fig- 
ure 1, which is reproduced here from the 
earlier paper (Upshaw, 1962, p. 86). 

The width of a judge’s perspective has im- 
plications beyond those treated in the earlier 
study. Variable series assumes that a judge 
subjectively divides his Perspective into seg- 
ments corresponding to the categories of his 
judgmental language. In general it may be 
assumed that the size of a judge’s typical 
category directly reflects his perspective. A 
judge who takes into account a wide range of 
stimulus values is expected to establish wide 
categories, as compared to a judge with a 
narrow perspective (see Figure 1). This 
means that each judgmental category corre- 
sponds to a segment of the stimulus con- 
tinuum, the size of which is a function of the 
range of stimuli which the judge adopts as 
his perspective. With the same number of 
categories, therefore, the judge with a rela- 
tively wide perspective is expected to dis- 
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Fic, 1, Hypothetical reference scales of in-range and out-of-range judges showing differences in 


tribute a set of items over a smaller range 
than does a judge with a narrower perspec- 
tive. What emerges is a model which essen- 
tially states that the “sophisticated” judge 
with a broad view makes less fine distinc- 
tions than does his “provincial” counterpart 
among elements which they both consider. 
The variable series model, therefore, refers 


* to two classes of judgmental data: that cor- 


responding to Zevel, which is reflected in the 
average rating of a set of items; and that 
corresponding to discrimination, which is re- 
flected in the dispersion of the ratings of a 
set of items. It is not unusual in discussions 
of the psychology of judgment to speak of 
the judge’s reference scale, which implies an 


‘ origin and a unit (cf. Torgerson, 1958, pp. 


78 ff.). The reference scale origin refers es- 
sentially to a point of perceived neutrality in 
terms of some attribute. The origin may be 


- viewed as the conceptual equivalent of adap- 


tation level (Michels & Helson, 1949). Thus, 
the higher the origin, the lower the average 
judgment of a common set of items. As a first 


< approximation the variable series model at- 


tributes the origin of a reference scale to the 
sum (or average) of the stimulus values de- 
fining a judge’s perspective (i.e. the ex- 
treme items at either end of his series). The 
reference scale unit, on the other hand, refers 
to the stimulus range corresponding to a 
judge’s typical response category. Thus, the 
wider the unit, the smaller the dispersion of 
item values assigned to a common set of items. 
The unit size is attributed by the variable 


judgments of an item ¿ and differences in the width of response categories. 


series model to width of the judge’s perspec- 
tive; that is, to the differences between those 
items which define his perspective. 

The reference scale unit, as here defined, 
bears only on one aspect of judgmental dis- 
crimination. In order to show another equally 
important facet to discrimination, and in order 
to make the present formulation more explicit, 
consider the scatterplot relating the work of 
two judges for a common set of items. Sup- 
pose, first, that the two regression lines of 
such a plot were nonlinear, This finding 
would constitute evidence that the two judges 
differ in their judgmental units in ways that 
are related to particular regions of the scale. 
(The assimilation-contrast model of Sherif & 
Hovland, 1961, specifies nonlinearity due to 
“disturbances” in discrimination attributable 
to latitudes of acceptance and rejection. See 
Upshaw, 1964.) If the judgmental unit is not 
constant throughout the scale, there is no 
economy in discussing the judgmental be- 
havior in reference scale terms. In fact, such 
a discussion is likely to be misleading. If, 
however, the regression lines are linear, it 
follows that differences between the judges 
can be totally described in terms of the two 
parameters that define a straight line, namely, 
the intercept and slope. The intercept param- 
eter reflects differences in level of judgments, 
or in the terminology of the reference scale, 
differences in origin. The slope of the regres- 
sion line comprises two meaningful indices of 
judgmental behavior. It will be recalled from 
statistics that the regression coefficient (slope) 
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of y on x is defined as ray (Sy/Sz), where Toy 
is the correlation coefficient and s,/s, is the 
ratio of the two standard deviations. The 
first of these components, the correlation 
between judges, indicates the degree to which 
the item discriminations made by one judge 
match those made by the other. The second 
component, the ratio of the standard devia- 
tions, indicates the extensiveness of the re- 
sponse language of one judge relative to that 
of another. Viewing reference scales in this 
context makes it clear that, contrary to the 
impression one might obtain in reading the 
literature, a judge who “bunches” his ratings 
in a subset of prescribed response categories 
is not necessarily a poorer judge than his 
more expansive counterpart. In comparing 
the reference scales of judges, three variables 
are of interest: differences in origin, in judg- 
mental unit (relative expansiveness of the 
judgmental language), and the degree to 
which there is a correspondence of discrimi- 
nation. Questions concerning goodness of dis- 
crimination would appear to refer to the 
third variable. The first two variables suggest 
differences of style. 

The variable series model, by virtue of its 
specification of the effects of perspective on 
reference scales, leads to the following predic- 
tions: 

1. Judges who differ in perspective differ 
on all items in a series, without regard to 
considerations such as item ambiguity. There- 
fore, the selection of items for a final Thur- 
stone scale is not expected to alter any rela- 
tionships between reference scale properties 
and item values which are observable in the 
item pool, as a whole. 

2. Judges whose own positions lie within 
the item range adopt narrower perspectives 
than do out-of-range judges. As a conse- 
quence, compared to out-of-range judges, in- 
range judges: (a) Adopt scale origins which 
are further from the end of the continuum 
preferred by the out-of-range judge, result- 
ing in an apparent displacement of the aver- 
age judgment in the direction of the “miss- 
ing” items. (b) Adopt small scale units, that 
is, distribute the common items with a greater 
dispersion corresponding to their narrower 
response categories. (c) Discriminate equally 
astutely among the set of items. 
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The implications of these hypotheses fer ~ 
the validity of Thurstone scaling will be dis- 
cussed following an examination of the data, 


METHOD 


For the present study, some of the data pre- 
viously reported (Upshaw, 1962) were reanalyzed. 
From the total item series (114 items) of- the 
earlier study, a sample of 40 attitude statements 
was drawn so as to cover the range of items in the 1 
total set with items of minimum ambiguity (Q 
value). As a basis of sampling, the total set was 
partitioned into 20 classes representing half-point 
intervals on a “control” equal-appearing intervals 
scale, This was the scale that was determined from 
the judgments of all subjects, without regard to © 
their attitudes, who participated in the total series 
condition (Upshaw, 1962, p. 89). From each of 
these 20 classes the two items with the smallest Q 
values were selected for the sample. In a few in- 
stances items were rejected because they appeared to 
state a fact, rather than an opinion. Except for the 
criterion of irrelevance (Thurstone & Chave, 1929, 
p. 59) the items selected were the best ones avail- 
able for a final scale. Two item-by-item parallel 
forms of a final scale were constructed by arbi- 
trarily assigning to one or the other form each of 
the two items drawn from the 20 classes.” 

The equal-appearing intervals scale values for the 
items of the two parallel forms were determined 
from the 1962 data, and were analyzed in a fac- 
torial design consisting of the following factors: 
judge’s attitude, item range, scale position (ie. 
rank order of the items in terms of the control scale 
values), and form. Only those items which were 
common to the three conditions of item range 
that is, the nine middle items of each form, were 
included in the analysis. Of the four factors, only 
form was considered to be random; the others, fixed. 
As a consequence of this decision, no effect involv- 
ing form was testable, but the results of the three 
fixed effects can be generalized to forms of equal- 
appearing intervals scales constructed from the 
Hinckley item pool (Green & Tukey, 1960). 

The statistical design permits an answer to the 
methodological question of whether final scales show 
the same bias as the original item pool. Specifically, 
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2 Accompanying the earlier report an 11-page docu- 
ment was filed with the American Documentation 
Institute listing all of the scaling data from the 
original study. Order Document No. 6942 from ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress, Washington, D.. c 
20540. Remit in advance $1.75 for microfilm Ot 
$2.50 for photocopies and make checks payable to: 
Chief, Photoduplication Service, Library of Con- 
gress. The two final scales in the present work were 
drawn from that document. One form consisted 0 
items: 7, 9, 12, 26, 28, 31, 41, 42, 51, 52, 53, 5% 
73, 78, 79, 83, 88, 95, 104, 109, The second form i 
consisted of items: 6, 8, 14, 16, 18, 20, 30, 35, 38 
45, 46, 47, 54, 57, 58, 68, 70, 81, 98, 108. 
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a main effect or any interaction involving attitude 
would constitute evidence against the notion that 
Thurstone’s ambiguity criterion serves to “purify” 
final scales. (Effects due to range are deemed ir- 
relevant to this question, since the use of varying 
item ranges in attitude scaling is not a methodo- 
logical issue, and is part of this design for theoreti- 
cal purposes only.) 

The substantive issues in this study relate to two 
reference scale parameters, origin and unit. Because 
the former refers to the average placement of a set 
of items, differences in origin as a function of the 
independent variables would be manifested by ef- 
fects due to attitude, range, or attitude-range. „The 
judgmental unit, on the other hand, refers to the 
relative dispersion of a set of items. Thus, differences 
in this parameter would be reflected only in inter- 
actions involving the scale position variable with 
attitude, range, and attitude-range. Furthermore, 
these interaction terms may be interpreted as arising 
from differences in unit only if, as discussed above, 
the component sets of item values are related to 
each other with a high linear correlation. 


RESULTS AND DISCUSSION 


Table 1 presents the results of the analysis 
of variance. From these results it is clear that 
the ambiguity criterion did not eliminate the 
influences of variable perspectives of judges 
in the final scale. All testable effects involving 
attitude, except attitude-range-scale position, 
were adjudged significant. Discussion of the 
implications of this finding for attitude scale 
construction will be postponed until after con- 
sideration of the substantive issues of the 
study. 


TABLE 1 


ANALYSIS OF VARIANCE OF EquaL-APPEARING 
InTERVALS SCALE VALUES 


. 
Source MS df F 
Attitude (A) 10,08 2 28,00* 
Range (R) 11.43 2 285.75** 
Scale position (S) 37.87 8 54.10** 
XR 1.23 4 6.15 
AXS -96 16 2.40* 
RXS 38 16 7.60** 
AXRXS 36 32 84 
Form (F) 04 1 
AXF 36 2 oa 
RXF 04 2 =i 
SXF 10 8 = 
AXRXF -20 4 = 
AXSXE 40 16 = 
RXSXF 05 16 ai 
AXRXSXF 43 32 c= 


Note.—The error terms used for testing effects were in all 
Rec lie MS due to the interaction of Form with the source 
sted, 
* p <.05. 
** b <.01. 
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TABLE 2 


Mean EQUAL-APPEARING INTERVALS VALUES FOR 
A X R Interaction 


Item range = 
Attitude of 
judges 
a— t a+ 
A 5.09 5.01 5.77 
N 4.08 4,65 5.44 
R 4.03 4.56 4.12 


Note.—A =anti, N =neutral, P =pro, a — =series aborted at 
the anti end, t =total series, a+ =series aborted at the pro end. 


Attitude-Range and Attitude-Range-Scale 
Position Effects 


These issues concern predicted differences 
in origin and unit between in-range and out- 
of-range judges produced by shortening the 
item range. The critical test of the variable 
series predictions concerning origin is the 
attitude-range interaction. This effect is 
shown in Table 1 to be significant at about 
the .05 level. The item means (averaged over 
forms) for this interaction are presented in 
Table 2. Inspection of the table indicates that 
the pattern of means here is of the same na- 
ture as that reported earlier (Upshaw, 1962) 
for the entire item pool. The differences be- 
tween the mean item values in Table 2 for 
the t versus a+ conditions are |.76| for A 
judges compared to |.16| for P judges. Thus 
A judges were more responsive to the re- 
moval of pro items than were P judges. 
Similarly, the differences between the t ver- 
sus a — conditions are |.08| for A judges and 
|.53| for P judges, showing the symmetry of 
the relationship. According to variable series 
these results are due to the retention of rela- 
tively wide perspectives by judges who were 
rendered out-of-range by the experimental 
restriction of the item range. The relatively 
wider perspectives of out-of-range judges are 
expected to influence not only reference 
scale origin, but the unit of judgment, as well. 
Thus, the model implies an attitude-range— 
scale position interaction wherein the disper- 
sion of’ item values over all scale positions is 
smaller for out-of-range judges. The observed 
interaction effect is clearly insignificant, with 
F<. 

In search of insight into the failure to ob- 
serve an interaction between attitude, range, 
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TABLE 3 


MEAN EQUAL-APPEARING INTERVALS VALUES FOR 
A X S Interaction 


4 Attitude of judges 
Ordinal scale 
positions 
A N P 
1 3.87 2.53 2.16 
2 3.85 2.87 2,60 
3 4.42 3.65 3.34 
4 4.44 4.14 4.03 
5 5.47 4.56 3.88 
6 5.88 5.34 5.14 
7 6.72 6.23 5.34 
8 6.26 6.33 6.19 
9 6.68 6.88 7.26 


Note,—A =anti, N =neutral, P =pro. 

a This variable corresponds to the nine items, ordered from 
anti-Negro to pro-Negro according to ‘‘control scale values,” 
yah were included in all three conditions of item range (see 
text). 


and scale position, the other lines of the anal- 
ysis were examined, and will be discussed in 
turn. 


Main Effects 


The significant effect due to attitude corre- 
sponds to mean item judgments (where 1 
represents a maximally anti score, and 11 
represents a maximally pro score) of 5.29 for 
A judges, 4.72 for N judges, and 4.44 for P 
judges. The reference scale origins are, there- 
fore, directly related to the attitudes of the 
judges, since it was assumed that origin and 
average placement are inversely related. If 
the experimental inductions were valid, these 
differences in origin are due to the fact that 
A judges invoked special anti anchors when 
the series was aborted at the anti end, whereas 
the P judges performed similarly at the pro 
end when pro items were missing from the se- 
ries. In what follows, it is important to distin- 
guish between the assumption about the ex- 
perimental induction, on the one hand, and the 
basic hypothesis of the variable series model, 
on the other. This hypothesis is that origin 
is a function of the sum (or average) of the 
perspective parameters, and that unit is a 
function of their difference. The attitude ef- 
fect is consistent both with the assumed in- 
duction and the theoretical hypothesis. 

The range effect is, similarly, consistent 
with both the theoretical hypothesis and the 
assumption concerning the experimental in- 
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duction. The effect is significant, with mean 
values of 4.40 for the a — condition, 4.74 for 
the t condition, and 5.31 for the a + condi- 
tion. In the absence of anti items the midscale 
items were judged to be more anti; in the 
absence of pro items, they were judged to be 
more pro. 


The scale position main effect is of trivial | 
interest in this design, for items were selected ~ 


so as to differ in scale position. The interac- 
tion of this variable with other variables, 
however, under some circumstances relates to 
judgmental unit, as is discussed below. 


Attitude-Scale Position and Range-Scale 
Position Interactions 


Table 3 lists the mean scale values (aver- 
aged over forms) for each scale position ac- 
cording to judges’ attitudes. In order to de- 
termine whether the attitude-scale position 
effect represents differences in judgmental 
unit as a function of attitude, the scatter- 
plots corresponding to pairs of columns of 


Table 3 were made. By inspection it ap- | 


peared that the judgments of A, N, and P , 


judges were linearly related to each other, 4 


conclusion that is confirmed by the large | 
linear correlation coefficients relating the — 


groups (.98 for A versus N and for N versus 


P, .92 for A versus P). In accord with the © 


earlier discussion of discrimination phenom- 
ena, the finding of nearly perfect linear cor- 
relations among the groups indicates that the 
significant attitude-scale position interaction 
resulted from differences in dispersions of 
judgments, or, equivalently, judgmental units. 
The standard deviations of the item values in 
each column of Table 3 are: A, 1.10; N, 
1.48; P, 1.58, indicating an increasing ex 
pansiveness of the judgmental language from 
anti to pro positions. The weight of the evi- 


dence, therefore, concerning the attitude 


scale position interaction is that it resulted 


from differences in response language, where- 


by P judges utilized the smallest judgmental 
unit. There is no suggestion that the various 


judging groups differ in their astuteness in 


detecting stimulus differences since the inter- 
correlations were virtually perfect. There is 


no suggestion that judges’ attitudes are ing 


any way related to differential discrimination 
in various regions of the scale since the re- 
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gression lines relating the groups are linear. 

In order to be consistent with the variable 
series model, it would have to be shown that 
the smaller units of P judges were associ- 

ated with narrower perspectives. To the ex- 
tent that the experimental induction suc- 
+ ceeded, all judges had equal perspectives in 
the’ t conditions, A judges had the widest 
perspectives in the a— condition, and P 
judges in the a + condition. The finding that 
P judges had the smallest units suggests that 
ə the attitudes of A and P judges were asym- 
metrically located about the item series. Thus, 
in those range conditions for which some 
jtidges supplied their own perspective anchors, 
A judges supplied anti anchors that were 
more remote than the pro anchors of the P 
judges. This reasoning contradicts two sets 
of data. First, accompanying the earlier study 
(Upshaw, 1962, pp. 92 ff.) were data indicat- 
ing that many P judges probably held own 
positions that exceeded the item series, even 
in the t condition. This suggests that P judges 
had the broadest perspectives of the three 
groups of judges, which, in turn, implies that, 
* according to the model, they should have 
adopted the largest unit, not the smallest. 
Furthermore, the argument that A and P 
judges supplied asymmetrically remote an- 
chors in the aborted range conditions implies 
a significant attitude-range-scale position in- 
teraction, which was not observed. 

Given all of the evidence, the two most 

* plausible explanations of the attitude-scale 
position interaction seem to be these: 

1! The experimental induction failed, per- 
haps because the assumption that perspective 
anchors result only from items in the series 
or from own attitude is a gross oversimplifi- 
cation. 

2. The theoretical expectation of an in- 

> verse relationship between dispersion of item 
judgments ‘and perspective width is in error. 
~ The first possibility is intuitively appeal- 
ing. Correlated with own attitude there are 
likely to be any number of special “anchors” 
that may be invoked in addition to one’s own 
position, The second possibility requires a 
more direct test than ‘the present design per- 
mits. Here perspective is inferred. If it could 
be measured, the basic formulation of the 
variable series model could be evaluated less 


TABLE 4 


Mean EQUAL-APPEARING INTERVALS VALUES FOR 
R X S Interaction 


Item range 
Ordinal scale 
positions 

a- t a+ 
1 2.57 2,63 3.36 
2 2.55 3.41 3.37 
3 3.56 3.62 4.24 
4 3.54 4.36 4.71 
5 4.18 4.70 5.03 
6 4.93 5.22 6.20 
7 5.56 5.69 7.05 
8 6.05 6.18 6.55 
9 6.66 6.86 7.30 


Note.—a — =series aborted at anti end, t =total series, a+ 
=series aborted at pro end, 

a This variable corresponds to the nine items, ordered from 
anti-Negro to pro-Negro according to “control scale values," 
a were included in all three conditions of item range (see 
text). 


equivocally. Ostrom (1964) has produced 
such direct evidence in support of the model. 
In terms of the present design, the range- 
scale position interaction can be expected to 
provide a clearer test of whether item disper- 
sion is related to perspective. This is so be- 
cause range was a genuine experimental ma- 
nipulation. Attitude, on the other hand, was 
manipulated by the selection of subjects, with 
all their concomitant attributes (such as ex- 
traneous anchors). à 
Table 4 presents the item values (averaged 
over form) for the range-scale position inter- 
action. Again, the scatterplots relating pairs 
of columns of the table appeared by inspec- 
tion to be linear, and the correlation coeffi- 
cients were nearly perfect (.96 for t versus 
a+, .98 for both t versus a — and a + ver- 
sus a —). The standard deviations of the scale 
values were 1.30 for t, 1.40 for a —, and 1.43 
for a+. As with the attitude-scale position 
effect, the significance of this interaction ap- 
pears to be due to judgmental unit. Averag- 
ing over attitude conditions, the t condition 
is, in principle, associated with the broadest 
perspective. By the theory, this condition 
should have the smallest dispersion, and it 
does. The other two conditions should have 
roughly equal dispersions, and they do. 
The experimental design of the present 
study was intended to provide a manipulation 
of item range and judge’s attitude. Assuming 
the validity of the experimental induction of 
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perspective the effects ascribable to differences 
in reference scale origin should have been 
found in the interaction of attitude-range. 
Those ascribable to differences in unit should 
have been found in an interaction of attitude- 
range-scale position. The data supported the 
hypothesized effects due to origin. The data 
did not support precisely those effects pre- 
dicted for unit, although they did suggest 
clearly that unit differences occurred as pre- 
dicted for range, and in an unexpected fashion 
for attitude. Furthermore, all judges were 
deemed to have made equally astute discrim- 
inations among the stimuli, confirming the 
idea that only judgmental language differed 
among judges. 

Perhaps the most important conclusion of 
the present study is that the various judges 
included in the design may be viewed as dif- 
fering from each other only in reference 
scales. All of the data from the total series 
condition of the parent study have been 
analyzed according to judges’ attitudes, with 
the same result (Upshaw, 1964). Manis 
(1960) observed a U shaped curve relating 
the standard deviation of the judgments of a 
common set of items to judges’ attitudes. Be- 
cause the A judges in the present experi- 
ment held positions near the middle of the 
series used in the t condition (see Manis, 
1964), Manis (personal communication) has 
noted that the findings here that the standard 
deviation of item values increases with the 
pro-ness of attitude may be viewed as a rep- 
lication of part of the relationship which he 
observed. Whatever the true relationship be- 
tween attitude and judgmental unit, and 
whatever the reasons for the relationship, the 
reference scale unit is a long-neglected aspect 
of social judgment that warrants attention. 


Validity of Thurstone Scales 


Thurstone (1928) himself raised the ques- 
tion of whether, in practice, judges could rate 
opinion statements without bias due to their 
own sentiments, as the scaling technique de- 
mands. Hinckley (1932) apparently inter- 
preted the validity question as that of 
whether the ratings of judges with differing 
attitudes are highly correlated, whereas Hov- 
land and Sherif (1952) apparently interpreted 
the question as that of whether the item 


UPSHAW 


ratings differ at all according to attitude. In- 
terestingly, Hinckley (1963) has provided 
replication of his earlier study, with linear 
correlations that are almost perfect. 

The present design, unlike earlier studies, 
has focused attention on the final scale result- 
ing from the selection of items to minimize 
ambiguity while adequately covering all scale 
positions. With this restricted set of items 
the present data have confirmed Hinckley’s 
finding that judges of differing attitudes 
are almost perfectly correlated in their rat- 
ings. Furthermore, the data have also con- 
firmed two findings of Hovland and Sherif, 
A generally negative relationship was ob- 
served between judges’ attitudes and their 
item judgments, and the P judges evaluated 
the most pro items as more favorable and the 
most anti items as more anti than did other 
judges. However, because of the high inter- 
correlations among judges (as predicted by 
Hinckley), the generally negative relationship 
between attitude and scale value has been 
attributed to differences in origin, rather than 
to a process called “contrast.” For the same 
reason, the fact that P judges rated pro items” 
as more pro than did other judges is here 
viewed as a manifestation of differences ind 
judgmental unit and not as a result of an 
“assimilation” process. 

In the context of the variable series formu 
lation it appears that Hinckley emphasized 
the degree to which judges detect stimulus 
differences whereas Hovland and Sherif em- 
phasized judgmental language, both origin 
and unit. The present research indicates ‘that 
in final Thurstone scales, as well as with en- 
tire item pools, judges of varying attitudes are 
equally astute in detecting item differences, 
while, at the same time, they differ in judg- 
mental language. What, then, is the impact of 
this state of affairs on the validity of attitude 
scale construction? The main consideration 
appears to be that the final scale values até 
invariant over judges up to a linear transfor- 
mation, That is to say, the scale values ob- 
tained with any one set of judges may be €% 
pected to differ from each other only with 
respect to an additive constant (correspond 
ing to origin) and a multiplicative one (cot 
responding to unit). Nothing more is de- 
manded of an interval scale (Stevens, 1951, 


defines an interval scale as a set of values 
having this degree of invariance). Thus, in 
the dispute between Thurstone and Hinckley, 
on the one hand, and Hovland and Sherif, on 
the other, the evidence provided by the pres- 
ent data supports the validity of scale con- 
struction by equal-appearing intervals. 
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A RELATION BETWEEN HEART RATE AND 


PERFORMANCE 


HANS O. DOERR anD JACK E. HOKANSON 


Florida State 


42 Ss were selected from students in each of 3 age groups—7, 9, and 11—for 
a total of 126 boys and girls. In each age group, Ss were divided into a group 
receiving frustrating instructions (which raised heart rate approximately 8.2 
beats/min.) and a control group receiving neutral instructions (approximately 
0.2 beat/min. increase). Also, within each condition the students were di- 
vided into low, medium, and high heart-rate Ss on the basis of an initial re- 
cording under resting conditions. The central measure was change in perform- 
ance on a timed coding task following frustrating or neutral instructions. The 


results indicated that low heart-rate Ss 


ment in performance following frustration while high heart-rate Ss showed a 
relative decrement. The data were viewed as being generally consistent with 


the inverted U hypothesis. 


A number of recent studies have demon- 
strated that: adult, human subjects with 
characteristically low resting heart rates 
achieve scores on simple performance tasks 
which were lower than those obtained by high 
heart-rate subjects; and after an imposed 
stress, which typically increased heart rate 
approximately 16 beats per minute, the low 
heart-rate subjects significantly improved in 
performance relative to a nonstressed control 
group, while high heart-rate subjects mani- 
fest a relative decrement (Burgess & Hokan- 
son, 1964; Hokanson & Burgess, 1964). 
Viewing heart rate as an indicant of general 
physiological “arousal,” these results were 
considered to be consistent with Malmo’s 
(1958) inverted U hypothesis. 

The purpose of the present study was to 
investigate whether similar-type relationships 
are obtained with children. In addition, this 
project included subjects with resting heart 
rates of average levels, thus providing data 
pertaining to performance changes in this 
range of “arousal.” 

To these ends, children in three different 
age groups were utilized (ages 7, 9, and 11). 
Subjects in each age group were divided into 
low, medium, and high heart-rate categories 
on the basis of resting heart-rate measures. 
Each age group was subsequently arranged in 
a 3X2 factorial design, The variable or- 


1This project was supported by Public Health 
Service Grant MH 07341-01, Jack E. Hokanson, 
principal investigator. 
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IN CHILDREN* 


University 


generally showed significant improve- 


thogonal to the subject variable was arous 
by a frustrating experience versus nonarousa 
Number of items completed on a simple cod: 
ing task prior to the arousal manipulation: 


nate form after arousal served as measures 0 
performance. Heart-rate recordings made 4 
each phase of the investigation served as am 
indicant of physiological status. 


METHOD 
Subjects 


Twenty-four female and 18 male subjects were se- 
lected from students in each of three age groups (7, 
9, and 11) attending Florida State University School) 
for a total of 72 female and 54 male subjects. The 
subjects for each age group were arranged in a 3 X4 
factorial design with four females and three m 
in each of the three frustration and three nonfrus 
tion conditions. 

Initially, for each age group, 16 females and 1 
males were selected randomly from the pool of sub 
jects available. They were used as subjects for 
nonfrustration conditions of the experiment. The fou 
girls and three boys with the highest resting h 
rates (75-100 percentile) were designated high he: 
rate subjects; the ones with the lowest heart T 
(0-25 percentile), low heart-rate subjects. The 
dren with heart rates centering around the n 
of the distribution (37.5-62.5 percentile) were d 
nated medium heart-rate subjects. The data for tl 
seven subjects falling between the 62.5-75 percentil 
and the 25-37.5 percentile were discarded. i 

Next, the 21 subjects per age group for the frus 
tration condition were selected randomly from th 
rest of the students in the age group. As their 
ing heart rate became established at the beginning 
the experimental session, it was determined wh 
it fell within the range of heart rates of members 0 
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the same sex in one of the nonfrustration conditions. 
. If this was the case, the experiment was continued 
and the subject assigned to the corresponing frustra- 
tion condition. If not, the subject was dismissed at 
this point and another one randomly selected from 
the available pool until all cells were filled. This 


| “= mode of selection assured that there was little differ- 


ence between the resting heart rates of the groups in 
one experimental condition and the corresponding 
group in the control condition. It also controlled for 
sex differences in heart rate and differences in heart 
rates between the three age groups, as each sex and 
age group set its own norms. 


Procedure 


-The experimental session lasted about 20 minutes. 
All sessions took place between 8 a.m. and 12 noon. 
A's the subject entered the experimental room, he was 
told that he was going to see his own heart rate re- 
corded and that he was also to work on some prob- 
lems. In Phase 1 of the experiment the subject was 
seated in a chair, had EKG leads placed on his right 
arm, left arm, and right leg (where they remained 
throughout the experiment), and was told to relax 
completely. The adaptation period lasted 5 minutes. 
Heart-rate recordings for periods of 15 seconds each 
were made during the first, third, and fifth minute 
of this phase. 

Heart rate was recorded on a Beck-Lee electro- 
cardiograph equipped with flexible leads which en- 
abled the subject to work on the performance tasks 
unimpaired. Cardiac pulses were recorded as a promi- 
nent vertical pen displacement on EKG recording 
paper. The chart drive on this apparatus is excep- 
tionally reliable and fast (25 millimeters per second). 
In addition, the line divisions on the recording paper 
are in -second spaces; thus making the counting of 
heart rates relatively simple during each recording 
interval. Heart-rate recordings were made for 30- 
second intervals at each step in the procedure by 
simply turning a single, silent switch. 

Phase 2 consisted of presenting the first perform- 
ance,problem, the prefrustration task, to the subject. 
He was asked to perform on a modified version of 
the WISC Digit Symbol test. The child was in- 
structed for 30 seconds on how to do the task. One 
minute of actual testing followed. At the end of the 
Performance heart rate was again recorded. 

Phase 3 consisted of a 2-minute rest period, termi- 
nated by another heart-rate measure, the prefrustra- 
tion measure. 

Phase 4 differed for subjects in the frustration and 
no-frustration groups. The frustrating manipulation 
consisted of having the subject work for 1 minute on 
a Series of simple arithmetic problems so designed 
that all subjects were able to get at least a number 
of them correct. At the end of the minute the experi- 
menter checked the answers with a spurious key, 
crossed out some answers with a red pencil, marked 
the paper with a big red “F,” and told the child that 
his performance was very poor. The experimenter 
also entered the child’s name in the F column of a 
list labeled prominently, “distribution of scores.” Im- 


mediately after that, heart rate was again recorded. 
The subjects in the no-frustration group were given 
the same problems to work for 1 minute, after which 
the examiner simply said “OK” and recorded the 
heart rate. 

Phase 5 consisted of an immediate presentatidn of 
the postfrustration problem. It consisted of an al- 
ternate form of the Digit Symbol test used in Phase 
2, and the subject was again given 1 minute to work 
on it, 

Phase 6 applied only to members of the frustration 
group, It was concerned with removing the effects of 
frustration, The child was given an alternate form of 
the arithmetic problems used in the frustrating ma- 
nipulation. He was allowed to work on them for 1 
minute after which the examiner produced another 
key, checked the answers and found them “very 
good.” At this time the experimenter also remarked 
that he must have used the wrong key to grade the 
first paper and that these answers, also, were very 
good. He then erased the child’s name from the F 
column of the distribution of scores list and added 
it to the A column. 

The effect of communication of procedures to fu- 
ture subjects appeared to be negligible as the experi- 
ence of “having one’s heart rate taken” dominated 
the children’s conversations when returning to the 
classrooms. 


RESULTS 


For each age group, analyses of variance 
were performed on the following sets of data: 
Resting heart rate or drive (the two terms as 
used in this paper are synonymous); score on 
the prefrustration performance task; dif- 
ference between pre- and _postfrustration 
heart rate; and, difference between pre- and 
postfrustration performance score. The Dun- 
can range test was also applied to further 
evaluate the significant main and interaction 
effects. 


Age Group 7 


The analyses of variance of the resting 
heart rates revealed only one significant F 
ratio: that for the main effect of drive which 
was significant at the .01 level. The low-drive 
subjects had an average heart rate of 82 beats 
per minute; the medium-drive subjects, an 
average rate of 93.5 beats per minute; and 
the high-drive subjects, an average heart rate 
of 102.9 beats per minute. Since these ex- 
pected findings were the only ones which even 
approached significance, it appears that there 
was no bias, in terms of heart rate, in the 
assignment of the 7-year-old subjects to ex- 
perimental and control conditions. 
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TABLE 1 


DIFFERENCE SCORE BETWEEN PRE- AND POSTFRUS- 
TRATION Heart RATE 
(Ace Group 7) 


Source af MS F 
Frustration (A) 1 414.90 | 18.88** 
Drive (B) 2 21.75 
AXB 2 7.95 
Within 36 21.97 

Total 41 


Treatment means of the difference between pre- and 
postfrustration heart rate 


Medium 


Low drive drive High drive 
Frustration 1.5, 7.5p 75 
No frustration — 5a 2A, 6s 


a Cells containing the same subscript are not significantly 
different from each other at the .05 level, by Duncan multiple 
range test. 

** p <.01, 


The analysis of variance of the prefrus- 
tration performance scores revealed no sig- 
nificant effects, again suggesting that there 
was no apparent bias in assignment of sub- 
jects to the frustration and no-frustration cells 
on this variable. 

Table 1 presents an analysis of variance 
and range test results of changes in heart 
rate due to the frustration manipulations. The 
main effect for frustration was significant at 
the .01 level, with frustrated subjects show- 
ing an average increase of 5.3 beats per min- 
ute, and control subjects manifesting only a 
.7 beat per minute elevation. A comparison of 
the condition means, however, indicated that 
not all of the frustrated subjects had a uni- 
form increase in heart rate. While medium- 
and high-drive subjects had increases of 7.5 
beats per minute, the low-drive children 
manifested an average increase of 1.5 beats 
per minute. This latter change was not sig- 
nificantly greater than that in the nonfrus- 
trated conditions. 

Table 2 presents a repeated measures 
analysis of variance of the pre- and post- 
frustration performance scores for the 7-year- 
old group. The F ratio computed for the 
difference between pre- and postfrustration 
performance scores (phase) was significant at 
the .05 level, indicating that generally scores 
on the second administration of the Digit 
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Symbol test were higher. This finding, how- 
ever, is tempered by the significant second- 
order interaction and the range test results, 


which reveal that only low drive-frustrated | 


and medium drive-nonfrustrated subjects 
manifested appreciable increases on the sec- 
ond performance measure. Further inspection 


of the cell means indicates that high-drive | 


subjects who were frustrated displayed a 
suggestive decrement in performance. 


Age*Group 9 


Preliminary analysis of variance again re- 
vealed that there were no significant initial | 
heart-rate differences between frustrated and 


nonfrustrated groups, although the antici- — 


pated differences between subjects in drive 


i 


Eg A 


categories did emerge (low drive, 76 beats | 


per minute; medium drive, 86.8 beats per 
minute; high drive, 97.8 beats per minute). 
In addition, there were no differences among 
any of the comparisons made with respect to 
initial performance score on the Digit Symbol 


TABLE 2 


DIFFERENCE BETWEEN PRE- AND POST- 
TREATMENT FRUSTRATION SCORES 
(AcE Grove 7) 


Source df MS F 

Frustration (A) 1 6 
Drive (B) 2 6.5 
AXB 2 12 
SS/groups (C) 36 11.76 
Phase (D) 1 9.3 4,69* 
DXA 1 3.3 
DXB 2 6.25 
DXBXA 2 10.65 5.38" 
cxD 36 1.98 

Total 83 


Treatment means of pre- and postfrustration 
performance scores 


Low drive | Medium | pigh drive 

Frustration 

Pre 9.7are® | 9.57ave | 11.00ca 

Post 12.294 9.1405 9.8600 

Difference 2.58 —A3 1.14 
No frustration 

Ue 8.71, | 8.57, | 10.29at0 

Post 9.14. | 10.71p0q | 10.7 1o0a 

Difference 43 2.14 42 


_.* Cells containing the same subscript are not significantly 
different from each other at the .03 level by Duncan multiple 
range test. 
* p <.05. 
Dp <.01. 
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test. Hence there is again reasonable assur- 
ance that any subsequent performance 
changes cannot be attributed to unequal 
initial performance levels. 

Table 3 portrays the results of an analysis 
of variance on the heart-rate changes due to 
the frustration treatments. Here, the signifi- 
carit main effect for frustration indicates that 
the experimental manipulation had its in- 
tended effect in all three of the subject groups 
(mean increase of 8.6 beats per minute). 
The corresponding average heart change for 
nonfrustrated subjects was 0 beats per 
minute. 

*, The repeated measures analysis of variance 
for performance scores before and after the 
frustration manipulations is presented in 
Table 4, The significant main effect for phase 
again indicates that subjects generally show 
improved performance on the second admin- 
istration of the Digit Symbol test. Here again, 
however, the range test results presented at 
the bottom of the table suggest that signifi- 
cant improvement occurred only in three of 
the conditions: low-drive subjects who were 
either frustrated or not and high-drive-non- 
frustrated subjects. It is also of interest to 


. note that the high-drive-frustrated group dis- 


played significantly less improvement than 
their nonfrustrated counterparts. Finally, the 
fact that the low-drive-frustrated subjects 


TABLE 3 


DIFFERENCE SCORE BETWEEN PRE- AND 
POSTFRUSTRATION HEART RATE 


° (AcE Grove 9) 
Source df MS F 
Frustration (A) 1 771.43 | 16.69** 
. Drive (B) 2 44.57 
AXB 2 8 
Within 36 46.22 
Total AL 


Treatment means of the difference between pre- and 
postfrustration heart rate 


Medium 


Low drive | Medium | High drive 
Frustration. 6.952 11.45 TAs 
No frustration —1itp 1.1, 0.0, 


a Cells containing the same subscript are not significantly 
different from each other at the .05 level, by Duncan multiple 
Tange test. 

** p <01. 


TABLE 4 


DIFFERENCE BETWEEN PRE- AND POST- 
FRUSTRATION PERFORMANCE SCORES 


(AcE Group 9) 
Source df MS F 

Frustration (A) 1 5.77 
Drive (B) 2 17.18 

2 12.51 
SS/groups (C) 36 36.56 
Phase 1 114,34 | 32.95** 
DXA 1 18 
DXB 2 5.51 
DXAXB 2 8.23 
cxD 36 3.47 

Total 83 


Treatment means of pre- and postfrustration 
performance scores 


Low drive | Medium | High drive 
Frustration 
Pre 15.3.8 16.4ab0 17 Aabe 
Post 19.746 18.5ca 18.1ea 
Difference 4.4 2.1 Aff 
No frustration : 
Pre 17 Abe 15.7ab 17.8bea 
Post 19.746 Ildi 20.75 
Difference 2.3 1.6 29 


a Cells containing the same subscript are not significantly 
different from each other at the .05 level, by Duncan multiple 
Tange test. 

** p <.01. 


had the greatest numerical increase in number 
of symbols completed should be mentioned. 


Age Group 11 


Preliminary analyses again revealed that 
there were no significant differences among 
subjects in this age group on either initial 
performance scores or resting heart rate, other 
than the expected heart-rate differences due 
to the original assignment of subjects to 
drive level categories (low drive, 71.4 beats 
per minute; medium drive, 84.6 beats per 
minute; high drive, 95.6 beats per minute). 

The F ratio computed for the main effect 
of frustration on heart rate was significant 
at the .01 level (Table 5). The frustrated 
subjects showed an average increase of 10.5 
beats‘ per minute, while nonfrustrated sub- 
jects had only a .2 beat per minute increase. 
These changes were relatively uniform across 
the three drive-level conditions. 

Table 6 presents the performance scores 
before and after the frustration manipulations 
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for subjects in the 11-year group. Again, the 
significant phase term can be best viewed in 
reference to the range test results at the 
bottom of the table. In only two of the cells 
(low-drive—frustrated and medium-drive— 
frustrated) is there a significant improvement 
in Digit Symbol performance (4.6 and 2.9 
items, respectively). As in the two previous 
age groups, the improvement score for the high- 
drive—frustrated subjects is again numerically 
the lowest (0.0 item). 

Summarizing the results for all three age 
groups, it is apparent that the groups ob- 
tained by dividing subjects into those having 
low, medium, and high heart rates differed 
significantly with respect to mean resting 
heart rate. Also, in none of the age groups 
was there a significant difference between 
initial performance scores of subjects in the 
six treatment conditions. Subjects in all age 
groups experienced a significant increase in 
heart rate when frustrated, with only one 
exception (low-drive subjects in Age Group 
7). Subjects who underwent the no-frustra- 
tion treatment showed no significant change 
in heart rate. The effects of the frustration 
treatments on the performance scores were 
not perfectly uniform in the three age groups. 
However, one pattern clearly emerges: low- 
drive subjects who were frustrated showed a 
consistent significant increase in their per- 


TABLE 5 


DIFFERENCE SCORE BETWEEN PRE- AND 
POSTFRUSTRATION HEART RATE 
(AGE Group 11) 


Source df MS F 
Frustration (A) iE 952.40 | 17.4** 
Drive (B) 2 21.70 
AXB 2 57.55 
Within 36 55.43 

Total 41 


Treatment means of the difference between pre- 
and postfrustration heart rate 


Medium 


Low drive | Medium | High drive 
Frustration 12.6,° 9.70 9.14 
No frustration —.57py 1.1, —1.4, 


„^ Cells containing the same subscript are not significantly 
different from each other at the .05 level, by Duncan multiple 
range test. 

** p <.01. 
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TABLE 6 


DIFFERENCE BETWEEN PRE- AND Post- 
FRUSTRATION PERFORMANCE SCORES 
(AcE Grove 11) 


Source df MS F 
Frustration (A) 1 119.1 
Drive (B) 2 6.1 
AXB 2 56.05 
SS/groups (C) 36 37.54 
Phase (D) 1 88.1 20,07** 
DXA 1 3.8 
DXB 2 7.5 
DXAXB 2 11.65 
cxD 36 4.39 
Total 83 
Treatment means of pre- and postfrustration A 
performance scores 
Low drive | Medium | High drive 
Frustration 
Pre 21.Tab® | 24.04 21. 7abo 
Post 26.34e 26.9 21.7abo 
Difference 4.6 2.9 0.0 
No frustration 
Pre 20.5a» 19.7, 21.3nb0 
Post 22. lave 21.0a | 23.3b0 
Difference 1.6 1.3 2.0 


^ Cells containing the same subscript are not significantly 
different from each other at the .05 level, by Duncan multiple 
Tange test. 
** p <01. 


5 O——O__s FRUSTRATION. 
@——@ NO FRUSTRATION 
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Fic. 1. The relationship between changeʻin per- 
formance score and postfrustration heart rate in 
beats per minute, for all age groups combined. 
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formance scores. In each age group, their 
increase in performance was numerically the 
largest, exceeding the increase exhibited by 


< nonfrustrated subjects. High-drive subjects 


who received the frustrating instructions 
showed in no instance a significant increase 
. in, performance score. In all age groups, they 
showed the least increase or the greatest 
decrease in performance scores. The perform- 
ance data for the medium-drive group were 
less consistent, although when viewed across 
all age groups, the subjects in this category 
who were frustrated displayed approximately 
the same slight improvement score as did the 
nonfrustrated groups. Thus it seems likely 
that the effects of frustration for this group 
did not differ from the mere practice effects 
manifested by the control group. 

Figure 1 summarizes graphically the rela- 
tionship between change in performance score 
and postfrustration heart rate for the three 
age groups combined. 


Discussion 


The results of this study may be inter- 
preted from a stimulus-response (SR) point 
of view or within the framework of activation 
‘theory (Malmo, 1958). An S-R formulation 
would treat verbal instructions as the inde- 
pendent variable and heart rate and per- 
formance on the coding task as dependent 
variables. It was found that frustrating 
instructions quite consistently resulted in an 
- increment in the heart rate (with only one 
exception), while neutral instructions failed 
to bring about such an increase. The relation- 
ship of differential instructions and the sec- 
ond dependent variable, performance on the 
coding task, appeared to be more complex. 
Neutral instructions brought about a slight 
` but uniform increment in performance score 
for almost all subjects. This could be expected 
as a result of practice. The frustrating in- 
structions also led to an overall increment in 
performance; however, the distribution of 


* performance scores in these groups showed 


much greater variability. Part of the subjects 
failed to improve their performance score or 
actually showed a decrease, while others 
exhibited much larger improvements in per- 
formance scores relative to their nonfrustrated 
controls. 
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Within the S-R framework, this observed 
differential effect of frustrating instructions 
on performance has been typically explained 
on the basis of individual differences in 
learned reactions to imposed frustration. 
While this theorizing may be valid, it would 
be a post hoc explanation in the present 
study, which attempts to predict perform- 
ance on the basis of a subject’s heart rate. 
Reliance on the aspect of this approach which 
postulates the evocation of competing, task- 
irrelevant responses under stress appears to 
be tenuous since both the low- and high-drive 
groups were given the same frustrating in- 
structions, and both groups had essentially 
the same heart-rate response to those instruc- 
tions. Indeed, prior research using this type 
of performance task found no evidence of 
more wrong responses among high-drive— 
frustrated subjects, even as the task became 
more complex (Hokanson & Burgess, 1964). 

The results of the present study are, 
however, consistent with the inverted U hy- 
pothesis proposed by Malmo (1958). This 
hypothesis states that performance level 
should increase as general activation increases 
up to a moderate level, and thereafter de- 
crease as activation increases from moderate 
to high levels. If the assumption is made 
that heart rate is an indication of activation 
level, and that subjects with heart rates fall- 
ing into the lowest quartile of the distribu- 
tion of heart rates perform at a low activation 
level, then their performance level should 
increase as they are subjected to manipula- 
tions that increase heart rate. This prediction 
was upheld by the present study. Subjects 
with resting heart rates in the lowest quartile 
of the distribution showed a significantly 
greater increase in performance following the 
frustrating instructions than any other sub- 
jects. This held true for all age groups. In 
turn, if the assumption is made that subjects 
with heart rates falling in the highest quartile 
of the heart-rate distribution already are per- 
forming at or near the optimal level, then 
an operation increasing heart rate should 
bring about a decrease in level of perform- 
ance relative to the practice effect shown by 
the control subjects. Again, the data largely 
indicated that this was the case. The high 
heart-rate subjects showed the least increase 
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in performance after receiving the frus- 
trating instructions, relative to all subject 
groups. This was again the case in all age 
groups. Finally, applying the above model 
to the medium-drive subjects who were frus- 
trated, it was expected that change in neither 
direction should be found. Reference to 
Figure 1 indicates that this indeed was the 
case when viewed across all age groups. 

While the relationship between heart rate 
and performance investigated in the present 
study is consistent with the inverted U curve, 
several considerations tend to limit the ap- 
plicability of this hypothesis. One of them is 
the finding that the low, medium, and high 
heart-rate subjects did not differ in initial 
performance scores. It has been assumed that 
heart rate is a valid indicant of arousal and 
the point at which an individual’s heart rate 
is located on a distribution of heart rates in 
a population allows a reasonable estimate of 
that individual’s current activation level. If 
true, then the high heart-rate subjects should 
have had higher initial performance scores 
than the medium and low heart-rate subjects. 
This pattern of results was obtained in two 
earlier studies using college-age subjects 
(Burgess & Hokanson, 1964; Hokanson & 
Burgess, 1964), but not with the current 
samples of children. 

Another finding not easily explained by the 
inverted U hypothesis is that the low heart- 
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rate subjects in Age Group 7 showed the 
expected increase in performance level fol- 
lowing frustration; however, this increase 


was not accompanied by a significant increase ` 
in heart rate. Malmo (1962), however, has _ 
pointed out that activation is a phenomenon “ 
of slow changes, of drifts in level with a time © 
order of minutes or hours. It is possible that ` 


the increase in level of activation was not 
reflected in the subjects’ heart rates until 
after the measures were taken. 

While the current data do not allow any 
unequivocal conclusions, perhaps the critical 
finding of this study is that performance of 
subjects in different age groups, following the 
experimental treatment, could be reasonably 
predicted on the basis of their resting heart 
rate level. 


REFERENCES 


Burcess, M., & Hoxanson, J. E. Effects of increased 
heart rate on intellectual performance. Journal of 
Abnormal and Social Psychology, 1964, 68, 85-91. 

Hoxanson, J. E., & Burcrss, M, Effects of physio- 
logical arousal level, frustration, and task com- 
plexity on performance. Journal of Abnormal and 
Social Psychology, 1964, 68, 698-702. 


Marmo, R. B. Measurement of drive: An unsolved — 


problem in psychology. In M. R. Jones (Ed.), 
Nebraska symposium on motivation: 1958. Lin- 
coln: Univer. Nebraska Press, 1958. Pp. 229-265. 

Marmo, R. B. Activation. In A. J. Bachrach (Ed.), 
Experimental foundations of clinical psychology. 
New York: Basic Books, 1962. Pp. 386-422. 


(Received November 20, 1963) 


Journal of Personality and 


Social Psychology 
1965, Vol. 2, No. 1, 77-83 
° 


HYPNOTIC ALTERATION OF SOCIAL ATTITUDES? 


JAMES G. STACHOWIAK 


University of Kansas 


anp C. SCOTT MOSS 
National Institute of Mental Health, San Francisco, California 


This study measures the effectiveness of influencing S attitudes toward Negroes 
through the medium of a hypnotically administered communication. Precision 


was added by conceptualizing the 


interaction between 2 variables, the concept 


to be influenced (Negro) and the source of the influence (hypnotist), within 


the theoretical model provided by Osgood’s principle of congruity. Ss who were 


exposed to a positive communication about Negroes while they were hypno- 
tized showed significantly greater attitude change than did Ss who were ex- 


posed to the same 
by the congruity 


magnitude of the changes. 


Since hypersuggestibility is generally iden- 
tified as a major characteristic of hypnosis, 
one might expect that there would be con- 
siderable research directed toward measure- 
ment of the effectiveness of manipulating 
attitudes and behavior through the medium 
of hypnosis. This is not the case, however, 
at least not in any controlled, quantifiable 
sense. In a recent article Orne (1962) stated 
‘that it has long been held “that hypnosis 
increases the control of the hypnotist over the 
subject; however, to our knowledge this as- 
sumption has never been put to experimental 
test [p. 142].” By this he meant that the 
role contexts in which hypnosis occurs gen- 
erally possess a high degree of social control, 
and before it is possible to arrive at an 
evaluation of any increment in the degree of 
social control by hypnosis, it is essential to 
isolate this variable. Orne then goes on to 
formulate a basic consideration: “Does hyp- 
nosis increase the amount of social control 
over the subject above that which existed 
before the induction of hypnosis [p. 143]?” 

In the present experiment, an attempt was 
made to objectively and quantitatively deter- 
- mine the effectiveness of hypnosis in altering 
an established social attitude. The experiment 
was cast in the theoretical framework of the 
“principle of congruity” as formulated by 


1Portions of this paper were presented at the 
meeting of the American Psychological Association 
in Philadelphia, August 30, 1963. 


ell 


17 


communication in the “waking state.” Predictions generated 
principle held up w 
change, but insufficiencies were evident with regard 


with respect to direction of attitude 
to predictions of the 


Osgood and Tannenbaum (1955). This prin- 
ciple takes into account the interactions be- 
tween a source of influence, the concept to be 
influenced, and the nature of the assertion or 
communication made about the concept by 
the source. The experiment was specifically 
directed toward providing an answer to the 
following question: What is the influence of 
hypnosis in a situation where a subject is 
faced with a situation of “incongruity,” such 
that a favorable source (in this case the 
experimenter-hypnotist) delivers a positive 
communication about a less favorable concept 
(Negro)? 

As will be apparent, the experiment to be 
reported has methodological limitations; how- 
ever, its basic value is as a paradigm for 
objectifying the especial influence quality of 
the hypnotic relationship. 


METHOD 


Forty male undergraduate students volunteered for 
a study represented only as involving hypnosis and 
were randomly assigned to an experimental and a 
control group. None of the subjects had previously 
experienced hypnosis. 

Each subject in the experimental group was seen 
twice individually with a 15-18 day interval elapsing 
between appointments. At the beginning of the first 
session’ he rated six concepts on a form of the 
semantic differential: experimenter, Negro, com- 
munist, Japanese, Jew, and Republican, All concepts, 
with the exception of experimenter, were also rated 
on a modified form of the Bogardus Social Distance 
Scale. Subjects were then given the standard hyp- 
notic induction presented in the Stanford Hypnotic 
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Susceptibility Scale, Form A (Weitzenhoffer & Hil- 
gard, 1959). Just prior to arousal they were read an 
11-minute statement designed to positively influence 
their attitudes toward Negroes. After termination of 
hypnosis, subjects were again asked to complete the 
same» rating forms. At the second appointment, sub- 
jects were requested to fill out the materials once 
more. A code and drop-box arrangement allowed 
subjects to retain anonymity in all of their ratings. 
They were repeatedly cautioned not to discuss 
their experiences until a week following the second 
appointment. 

Subjects in the control group followed the same 
procedure with the exception that they were not 
hypnotized until the conclusion of the second session, 
after they had completed their third set of ratings, 
In an effort to provide the control subjects with a 
time period comparable to that spent by the experi- 
mental subjects in being hypnotized, the experi- 
menter engaged each subject in a friendly discussion 
of course work, vocational plans, hobbies, and so 
forth. 

The 11-minute influence presentation which was 
administered to both groups was obtained largely 
from The Nature of Prejudice (Allport, 1954), and 
was comprised of four sections: 

1. A general statement of the relationship between 
treatment of the Negro in the United States, and 
our threatened position of international leadership, 

2, An explanation of stereotyped beliefs and 
attitudes, 

3. A description of how unconscious psychological 
mechanisms relate to prejudiced beliefs. 


4, Suggestion of a solution through migration and 
intermarriage, 


Hypotheses 


Two hypotheses, derived from the congru- 
ity principle, concern the Prediction of the 
direction and magnitude of attitude change 
toward the source and concept within the 
experimental and the control groups. 

Hypothesis I. Direction of the attitude 
change: Attitude change toward a source and 
a concept related by an assertion will be in 
the direction of greater congruity. 

According to one of the corollaries derived 
from Osgood’s congruity principle, whenever 
two differently polarized objects of judgment 
are related by an associative assertion, the 
less polarized object of judgment becomes 
more polarized, and the more polarized object 
of judgment becomes less so (Osgood & Tan- 
nenbaum, 1955), It would thus be predicted 
that the Negro (slightly Positive concept) 
will become more positive, and the experi- 
menter (quite positive source) will become 
less positive, if the experimenter makes a 


positive assertion about the Negro. It should 

be noted that this prediction is expected to 
hold for the control group, but not for the 

experimental group, since the anticipated ef- 

fect of hypnotizing the subjects would be to | 
enhance the esteem of the experimenter, thus 

increasing the polarization of an already 
favorable source. 3 

Hypothesis II. Magnitude of the attitude 
change: (a) The amount of positive attitude 
change toward the concept will be propor- 
tional to the degree of favorableness of the 
original attitude toward the source. (b) The 
amount of negative attitude change toward 
the source will be proportional to the degree 
of favorableness of the original attitude 
toward the concept. 

Two additional hypotheses relate to the 
effects of hypnosis on the attitude changes 
of the subjects. 

Hypothesis III. The amount and per- 
sistence of positive attitude change will be 
significantly greater for subjects who are 
exposed to a persuasive communication while 
they are in hypnosis. 

Hypothesis IV. The amount and persist- | 
ence of positive attitude change induced will 
be proportional to a subject’s susceptibility. 
to hypnosis. 

On the basis of an assumed relationship 
between waking and hypnotic suggestibility, 
it was predicted that even among the control 
subjects there would be a greater positive 
impact of the assertion on the concept, both 
immediate and long term, and a lessened 
amount of attitude change in a negative 
direction toward the experimenter, for 
those subjects who evidenced greater hypnotic ` 
susceptibility, 


RESULTS 


The basic semantic differential data sub- 
mitted to statistical analyses consisted of 
two sets of “difference scores” for each sub- | 
ject: one set was obtained by subtracting < 
each subject’s first rating of each concept 
from his second rating, thereby yielding an 
index of “immediate” attitude change; the 
second was obtained by subtracting the first 
ratings from the third ratings, providing 4 
measure of the “persistence” of the induced 
attitude change. 
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Comparison between Change Scores for 
Experimental and Control Concepts 


To test whether the change scores on the 
*’ four control concepts differed significantly 
*. from one another, the Friedman two-way 
analysis of variance for related samples was 
` applied. Analyses were carried out on both 
* sets of change scores, for the experimental 
and control groups. Since the obtained x*r did 
.-not reach the .05 level on any of the analyses, 
it was assumed that the change scores for the 
control concepts did not significantly differ 
from one another. These scores were then 
+» pooled within each group in order to next 
determine whether the subjects’ change scores 
for experimenter and Negro were significantly 
different from the means of their change 
scores for the control concepts. The Wilcoxon 
matched-pairs signed-ranks test was employed 
and the results are presented in Table 1. 

For the experimental group the mean 
change score for the concept Negro was sig- 
nificantly more positive than the mean change 
score for the control concepts, for both sets 
of scores. For the control group, the change 
scores for Negro did not differ significantly, 
but change scores for experimenter were sig- 
‘nificantly more negative than the means of 
the change scores for the control concepts on 
both sets of ratings. 


Direction of Attitude Change 


From Hypothesis I it was predicted that, 
for the control group, immediate attitude 
change would be in a positive direction for 
Negro and in a negative direction for experi- 
menter. For the experimental group, it was 
predicted that change toward both Negro and 
. experimenter would be in a positive direction. 
‘>. The predicted direction of change was con- 
~" firmed in all four cases. It can be seen in 
Figure 1 that the original location of experi- 
menter on the 7-step semantic differential 
ze gtid was approximately the same for both 

groups, and the resulting attitude change was 
in opposite directions for the experimental 
and control groups. The original location of 
Negro was slightly more favorable for the 
i, control group than for the experimental 
group; however, the greater obtained change 


TABLE 1 


DIFFERENCES BETWEEN CHANGE SCORES FOR THE 
EXPERIMENTAL AND CONTROL CONCEPTS 


Index Change scores |Wileoxon Z 
Experimental group 
Negro versus mean of | Ratings 1-2 | 2.40* 
control concepts Ratings 1-3 | 2.33* 
Experimenter versus Ratings 1-2 | 1.29 
mean of control Ratings 1-3 295 


concepts 
Control group 
Negro versus mean of 
control concepts 
Experimenter versus 
mean of control 
concepts 


Ratings 1-2 04 
Ratings 1-3 52 
Ratings 1-2 
Ratings 1-3 


Note.—All probabilities are one-tailed and significant dif- 
ferences are in the direction: Negro > mean of control concepts 
for the experimental group; experimenter < mean of control 
concepts for the control group. 


in the experimental group resulted in Negro 
becoming more favorable for this group. 

The observation of greater congruity be- 
tween attitudes held toward experimenter and 
toward Negro can be demonstrated even more 
strikingly by comparing the amount of dis- 
tance between each subject’s ratings of these 
concepts on the semantic differential for 
Ratings 1, 2, and 3. It would be expected 
that the amount of distance would tend to 
be smaller on Ratings 2 and 3 (after expo- 


RATINGS 


l z II exTREMELY 


POSITIVE 


QUITE 
POSITIVE 


SLIGHTLY 
POSITIVE 


DEGREE OF POSITIVE POLARIZATION 


NEUTRAL 
@ EXPERIMENTAL GROUP 


{ H 
! H 
-| OCONTROL GROUP : l 
H 

i 


Fic. 1. Means of the three “ratings toward experi- 
menter and Negro on the semantic differential grid 
for the experimental and control groups. 
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TABLE 2 


ANALYSIS OF THE CHANGE IN AMOUNT OF DISTANCE 
BETWEEN Ratincs OF EXPERIMENTER AND 
NEGRO ON THE SEMANTIC DIFFERENTIAL 


Index Wilcoxon Z 
Experimental group 
Ratings 1-2 1.45 
Ratings 1-3 234" 
Control group 
Ratings 1-2 3.70*** 
Ratings 1-3 2.56** 
* 
"p So: 
wD < 0001. 


sure to the communication) than the amount 
of distance present before exposure to the 
communication. The Wilcoxon matched-pairs 
signed-ranks test was again employed, and the 
results are presented in Table 2. 

The highly significant results for the con- 
trol group lend considerable support for 
Hypothesis I. The analysis for Ratings 1-2 
within the experimental group also approach 
significance at the .07 level. 


Magnitude of Attitude Change 


Two specific predictions were made con- 
cerning the amounts of change to be expected 
toward Negro and toward experimenter. First, 
it was hypothesized that the amount of posi- 
tive change toward Negro should be propor- 
tional to the degree of favorableness of the 
original attitude toward experimenter. This 
prediction was tested by applying the Spear- 
man rank-correlation coefficient. The results 
were in the predicted direction for both the 
experimental and the control groups, but 
neither of the coefficients reaches significance. 

With respect to predicted amounts of 
change toward experimenter, the resulting co- 
efficients were again in a positive direction, 
but quite negligible in magnitude. Thus, there 
appeared to be no significant relationship 
between the subject’s original attitudes 
toward Negro and the amount of obtained 
change toward experimenter. 


Effects of Hypnosis 


An analysis of the scores obtained on the 
Stanford Hypnotic Susceptibility Scale for 
the subjects within both the experimental and 
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control groups revealed no significant differ- _ 


ence in the hypnotic susceptibility of the two — 


groups. Both groups averaged above the 


Stanford norms, the mean for the experi: 


mental group being 7.10 and for the control 
group, 7.05. In each group 50% of the sub- 


jects scored between 8 and 12 on the Stanford ` 


Scale, which is the interval assigned to highly 
susceptible individuals. 

It was expected that the experimental and 
control groups would differ significantly with 
respect to the amount and persistence of the 
obtained positive attitude changes toward 
experimenter and toward Negro. These pre- 
dictions were tested by means of the Mann 
Whitney test for two independent samples, 
and the results are presented in Table 3. 

It can be seen that the amounts of im- 
mediate positive attitude change toward 
experimenter and Negro are significantly 
greater for the experimental than for the 
control group. With respect to experimenter, 
this means that while the attitudes of the 
control group become /ess favorable, the atti- 
tudes of the experimental group become 


slightly more favorable. With respect to ` 


Negro, both groups change in a positive direc- 
tion, but the amount was significantly greater 
for the experimental group. 

Analyses with respect to persistence of 
attitude change again revealed that the two 
groups differed significantly. While attitude 
change toward the Negro remained substan- 
tially stable for the experimental group, it 
diminished for the control group, with the 


TABLE 3 


COMPARISON OF THE AMOUNT AND PERSISTENCE OF 
Positive AtrirupE CHANGE TOWARD EXPERI- 
MENTER AND NEGRO FOR THE EXPERIMENTAL 
AND CONTROL GROUPS 


Difference between groups 


Concept Mann-Whitney 
Ratings 1-2 
Experimenter 2.85** ; 
Negro 1.76* 
Ratings 1-3 
Experimenter 1.89* 
Negro 1.69* 


Note.—In all cases, the direction of difference is experimental 
> control. 
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- attitudes returning to their original location 
on the semantic differential grid. It should 
be noted that the experimental subjects’ atti- 

a” tudes towards experimenter became slightly 

* less favorable than they were originally. The 
“finding of a significant difference between the 
. experimental and control groups, however, 

results from the fact that the latter group 
tended to become even less favorable than 

Ie they were directly after being exposed to the 

‘persuasive communication. $ 

ə It was predicted that both the amount 

and persistence of positive attitude change 

. would be proportional to the degree of hyp- 

notic susceptibility. The Spearman rank- 
correlation coefficient (rho) was again uti- 
lized, in order to obtain a measure of the 
association between the various sets of vari- 
ables. In terms of immediate alterations in 

_ attitude there was only one significant 

- change: for the control group there was a 

negative correlation with respect to change 
toward experimenter and hypnotic suscepti- 
bility beyond the .01 level. Such a finding 
suggests that while the amount of change 

~ induced may be proportional to a subject’s 
hypnotic susceptibility, the direction of 
change is not readily predictable from the 
nature of the influence attempt. 

For the experimental group the persistence 
of change toward Negro correlated with hyp- 
notic susceptibility beyond the .05 level, and 

_, the correlation between persistence of change 

“toward experimenter and hypnotic suscepti- 
bility approaches significance ( = .06). The 

» coefficients for the control group did not 

reach significance in this instance. 


Social Distance Scale 


Tt had originally been planned to submit 
change scores on the Social Distance Scale to 
analyses similar to those undertaken for the 
a change scores on the semantic differential. It 
” was found, however, that the ordering of the 
‘concepts along the scale by the subjects was 
highly similar for both groups, with little 
change from one rating to another. Although 
the original rankings of the concepts remained 
quite stable and resistant to change, it should 
be noted that the change in ratings of the 
Negro for the experimental subjects was 
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greater than the change in ratings on any of 
the other concepts for either group. 


DISCUSSION 


A major corollary derived from Osgodd’s 
theory is that, if two positive but unequally 
polarized concepts are associated by a posi- 
tive assertion, the less polarized should be- 
come more polarized, and the more polarized 
concept less so. In the present investigation 
the effect of introducing the variable of 
hypnosis into this situation altered the ordi- 
narily obtained results in both the experi- 
mental and control groups; that is, in the 
former both the concepts Negro and experi- 
menter changed in a positive direction, while 
in the latter there was a marked decrease in 
the positive connotation of experimenter. 
All changes were in the predicted direction, 
however, in accordance with the expected 
effects of hypnosis upon the esteem of the 
experimenter. 

Hypothetically, the effect of hypnotizing 
the subjects was to increase the degree of 
positive polarization for the experimenter, 
and his positive communication about the 
Negro created a greater pressure toward 
congruity than was present in the control 
group, since the distance between attitude 
ratings toward experimenter and Negro was 
presumably greater at this point. The positive 
attitude toward Negro persisted over the 2- 
week interval, despite a decline in favorable- 
ness toward the experimenter. Further re- 
search would be necessary in order to clarify 
this effect. 

Some insufficiencies are evident with regard 
to predictions of the magnitude of the ob- 
tained attitude changes. While changes in 
direction were all as predicted, the magnitude 
of changes in both groups was small (on the 
average much less than one scale unit), and 
consequently it could not be demonstrated 
that the amount of change for either concept 
was proportional to the original attitude 
toward the other concept. According to 
Osgood’s theory, one would predict that the 
less polarized object, Negro, should change 
more than the more polarized, experimenter. 
For the control group, however, the mean 
attitude change in a negative direction toward 
experimenter was almost twice as large as 
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the change in a positive direction toward 
Negro.” 

It may be that the degree of polarization 
cannot be directly equated with resistance to 
change as postulated by Osgood: for example, 
an individual may hold an “intensely neutral” 
attitude. Another explanation is that waking 
subjects are less receptive to directive at- 
tempts to manipulate their attitudes, and 
displayed their resentment by discrediting 
the source but not the object of the assertion. 
A third, plausible explanation resides in the 
fact that subjects volunteered with an ex- 
pectation of being hypnotized. The negative 
reaction of the control group towards the 
experimenter could be attributable to an 
experience of disappointment over not being 
hypnotized. It is noteworthy that those who 
developed the strongest negative reaction 
were subsequently demonstrated to possess 
the highest tested hypnotic susceptibility. 
Needed to clarify this issue is a third group 
in which this factor of expectation is con- 
trolled by avoidance of any mention of 
hypnosis, or a group of subjects who are 
hypnotized but not given the influencing mes- 
sage. It would also have illumined the situa- 
tion if the control group could have ulti- 
mately had exposure to the assertion during 
their experience in hypnosis. 

One final, important factor, directly rele- 
vant to the present study, is that the congru- 
ity principle does not provide adequately for 
differences in the complexity and the degree 
of intensity of the assertion made by the 
source. An assertion is identified merely as 
being “associative” or “dissociative.” A de- 
liberate attempt was made in the present 
experiment, for example, to develop a strong, 
favorable, rational communication about the 
Negro. However, the experimenter’s stated 
position on intermarriage may have served to 


2 In a personal communication, Osgood has raised 
the question of whether the obtained original ratings 
of Negro were an adequate representation of the 
subjects’ “true” feelings. That is, if the “true” 
original locus of Negro on the semantic differential 
grid was —2, the obtained change would be in close 
agreement with the change predicted by the con- 
gruity principle. This raises the possibility of using 
congruity shifts as estimates of subjects’ “true” 
attitudes toward socially legitimized concepts. 


place the focus of the communication on the. 
experimenter in a negative fashion for some 
of the subjects. They may have experienced 
the experimenter as putting them in & 
“double-bind,” with the result being that 
they responded in a contradictory or aml 
valent fashion. It should be noted that this 
is not unlike the situation frequently encoun 
tered in psychotherapy, where the therapist's 
assertions (interpretations) are often complet 
and involved, and may be met with consider 
able ambivalence on the part of the patient, 
In any case, further research on the nature 
and intensity of the assertion is definitely! 
indicated. i 
The predictions concerning the increased, 
effectiveness of presenting the persuasivd 
communication to subjects in hypnosis weté 
generally confirmed, both with respect 
immediate and persisting changes in attitude: 
However, the expected correlation between 
the degree of measured susceptibility to hyp 
nosis and the amount and persistence of atti 
tude change was not substantiated. There iš 
apparently no simple and direct relationship 
between these two behaviors. These resul 
would appear to be consistent with the com 
clusions of experienced hypnotherapists, suci 
as Gill and Brenman (1959), who stalé 
“there is no correlation between the depti 
of hypnosis obtainable in a patient and the 
therapeutic results [p. 333].” The mere fat 
of being subjected to a formal hypnotic pr 
cedure seems sufficient to increase the peti 
suasibility of some subjects, independent 
the degree of hypnotic susceptibility ® 
measured on an instrument such as 
Weitzenhoffer-Hilgard scale. Hilgard? ba 
suggested the possibility that “the genera 
attitude of relaxation and detachment, giving 


matters, and an interesting control would 
to use people simulating hypnosis in ; 
same situation.” It could thus be determined 
whether persons simulating hypnosis wout 
also be susceptible to the social influe 
Procedure, even though they themselves Wet 
not (strictly speaking) hypnotized. 

The inability to obtain significant chang 
in ratings on the Bogardus scale suggest 


2 Personal communication, 1963. 


that this particular influence attempt might 
be expected to have little or no effect on 
the subject’s actual behavior with respect to 
. “minority groups such as Negroes. The 
*semantic differential measurement technique 
is apparently more sensitive to small or re- 
«stricted changes than the gross units of the 
. Bogatdus scale. At the same time, however, 
the Bogardus scale may be considered to be 
.. Closer to the “action level,” that is, it is 
“more directly related to how a person might 
¿actually behave in social situations. It is 
relatively easy for a person to report a 
. change in his attitudes without necessarily 
. changing his behavior. Then, too, it is impor- 
tant to emphasize that in the present study 
it was in no way suggested that the subjects 
" were to “change” their behavior. Being 
* “good hypnotic subjects’ they may be 
| considered to have performed quite literally 
+. in accordance with the hypnotist’s suggestions. 
Although ethical considerations are in- 
volved, it would be of interest to attempt to 
increase the magnitude of the presently ob- 
tained attitude changes through repeated 
‘inductions with the same subjects. While it is 
: possible that this might result in a greater 
degree of change in attitude ratings, it could 
- just as easily affect a subject’s receptivity to 
the experience of being hypnotized. The hyp- 
notic interpersonal relationship is highly in- 
volved, and the question of the hypnotist’s 
assertions (suggestions) and their reception 
“fs a matter of extreme importance for future 
research, 
» The effort to educate and indoctrinate, to 
alter and influence behavior is a paramount 
feature of human social intercourse. Most 
_ experiments in this area have involved simple 
measurement of the magnitude of change as 
» function of gross content. Osgood’s prin- 
“ciple of congruity provided a theoretical 
framework for isolating and measuring with 
2 considerable exactness the interaction between 
“tue source of an assertion about specific 
concepts. While it is difficult to generalize 
from the relatively simple experiment re- 
Ported here to specific problems in everyday 
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life, there is obvious possible extension to 
such situations as psychotherapy. 

It seems demonstrated that, as frequently 
maintained, hypnosis precipitates an especial 
rapport or transference reaction, albeit the 
exact nature of the relationship remains to be 
established. To many modern-day psycho- 
therapists the employment of suggestion, par- 
ticularly in association with hypnosis, is an 
anathema. Yet, psychotherapy is in essence 
only a professionalized effort to persuade 
people to adopt new ideas and different ways 
of behaving. Whether or not the clinician 
approves, suggestion, direct and covert, is 
an integral and important aspect of the thera- 
peutic relationship (Frank, 1961). Thus, 
Alexander and French (1946) have stated 
that, “Methodical psychotherapy is, to a 
large degree, nothing more than the system- 
atic conscious application of those methods 
by which we influence our fellow man [p. 
viii].” No substantial progress can be made 
in evaluating the efficacy of the many com- 
peting systems of psychotherapy until the 
pervasive common element of suggestion is 
isolated and controlled. This study presents 
a basic method of broad applicability for 
objectifying this important aspect of the 
psychotherapeutic transaction. 
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EVALUATIONAL REACTIONS OF JEWISH AND 
ARAB ADOLESCENTS TO DIALECT AND 
LANGUAGE VARIATIONS* 
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McGill University 


Jewish and Arab adolescents in Tel-Aviv and Jaffa were asked to evaluate the 
personality characteristics of speakers from taped recordings of their voices, 
some using different dialects of standard Hebrew (Yemenite and Ashkenazic), 
others using Arabic. The Ss were kept unaware that the speakers were bi- 
linguals who spoke in different dialects or languages during the sequence of 
recordings. Comparisons of evaluational reactions made to the matched guises 
were analyzed and compared with standard measures of attitudes toward 
Yemenite Jews, Ashkenazic Jews, and Arabs. A hypothesis is proposed that 
the matched-guise technique reflects stereotyped attitudinal and perceptual 
reactions of a more private nature than do standard measures of attitudes. 


Several recent investigations have demon- 
strated that one’s perception of another per- 
son is markedly modified when the latter 
changes his speech style from one dialect or 
language to another. Lambert, Hodgson, 
Gardner, and Fillenbaum (1960) showed that 
English-speaking Canadian college students 
judged the personalities of bilingual male 
speakers much more favorably when listening 
to taped recordings of them speaking English 
in contrast to the style of French typically 
used by French Canadians. This finding held 
for both monolingual English Canadians and 
those who were bilingual in French and thus 
able to understand the French as well as the 
English content of the recordings. On the other 
hand, college-age French-Canadian listeners 
perceived the same bilingual speakers more 
favorably when they spoke in English—a 
result that was interpreted as a minority 
group’s adoption of the majority group’s 
views. These negative views, however, were 
apparently restricted to Canadian-style 
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French since neither the English nor Frendi 
Canadian listeners downgraded to any notice 
able degree one speaker who used Europeatt 
style French, A more recent study by Anisfel 
and Lambert (1964) showed that youn 
French-Canadian subjects did not downgr 
the French-Canadian guises of _ bilingui 
speakers. In fact, 10-year-old monolingttl 
French-Canadian children had a much mo 
favorable view of the French Canadian M 
contrast to the English guises of young bi 
lingual speakers. On the other hand, Frendi 
Canadian children who had become ‘bili g 
in English, and who gave other indicati 
of being comfortably identified with b 
cultural groups, made few invidious compati 
sons one way or the other between the Frendi 
and English guises of bilingual speakers. 
Anisfeld, Bogo, and Lambert (1962) & 
amined listeners? reactions to speakers Wi 
changed from standard English to English § 
spoken by Jewish immigrants to North Amer 
ica. It was found that gentile listeners i 
acted less favorably to a speaker when ™ 
speech was accented than when it was j 
the standard English style, Jewish listen 
showed ambivalence in their evaluations ú 
persons with a Jewish accent, liking certai 
traits and disliking others. | 
The findings of these three studies indicat 
that speech variations play an important! at 
in social perception and that the techniatt 
of matching guises of the same speaker ™ 
be a useful means of examining the ster 
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typed thinking of members of one cultural, 
ethnic, or social group when they consider 
typical representatives of other groups. The 
major purpose of the present study was to 
apply the technique in Israel, a setting where 
” language and dialect variations are presumed 
. to have particularly important social and 
political implications. For example, with re- 
gard to language differences, one can examine 
the stereotyped images Jews and Arabs have 
of one another by comparing the changes in 
listeners’ perceptions of bilingual speakers 
whose Hebrew and Arabic guises are matched. 
Dialect differences are also of potentially 
great significance in Israel since people can 
be identified as Jews of European background 
by their Ashkenazic * style of Hebrew or as 
emigrants from Yemen by their distinctive 
dialect. In contrast to European Jews, the 
Yemenites in Israel are said to be regarded 


. „ as less sophisticated and possibly less cul- 


tured. Both the Arabic-Hebrew and the 
Ashkenazic-Yemenite contrasts were examined 
in the present investigation. 


i METHOD 
Subjects 


Jewish subjects. Forty high-school students, 15-16 
years of age, from a Tel-Aviv school noted for its 
high academic standards, volunteered to spend one 
morning of their 1961 summer vacation to partici- 
pate in the study. There were 28 girls and 12 boys 
in all. Most were born in Israel of European parent- 
age, except for 3 whose parents came from Syria 


few and Iraq. All students were taking a course in 


Arabic as part of their curriculum, The educational 
and economic backgrounds of the parents place the 
sample in Israel’s middle class. 

Arab subjects. The Arab students ranged in age 


=, from 13 to 18 years with a mean age of 14.5. They 


Yan belonged to a scout group in Jaffa. There were 
29 in all, 23 boys and 6 girls, They were from 
Giades 6 to 8 in Christian schools where instruction 
is conducted mainly in Arabic, although they also 


$» study Hebrew, English, and French. All spoke Arabic 


in their homes and half reported that they used both 


» Arabic and Hebrew with friends. Thus, the Arab 


subjects as a group were more bilingual in Arabic 


:, cand Hebrew than were the Jewish subjects. More 


than half of their parents were born in what is now 
Israel and the rest in other parts of the Arab world. 
The social class, estimated from what the youngsters 
stated their fathers’ occupations were, is generally 


_ ‘The Ashkenazic style of Hebrew speech refers 
in this paper to modern Hebrew as spoken in Israel 
by Jews of European descent, 
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lower than that of the Jewish subjects, making them, 
as a group, either lower-middle or upper-lower 
social class. 


Procedure j 


Jewish subjects. The experiment was conducted in 
an auditorium of the school normally attended by 
the subjects. All instructions and questionnaires were 
given in Hebrew by the second author, an Israeli. 
Tape recordings were made of speakers reading a 
2.5-minute philosophical passage (the same used in 
the previous studies mentioned) in either Hebrew 
or Arabic. Speakers were selected who were suf- 
ficiently bilingual in Hebrew and Arabic or in the 
Ashkenazic and Yemenite dialects of Hebrew to 
satisfy several competent judges that they would 
sound genuine to most listeners. One, who habitually 
used the Ashkenazic version of Hebrew, was able to 
imitate a Yemenite manner of speaking Hebrew so 
that it sounded nativelike rather than caricatured, 
A second speaker was a professional actor whose 
native dialect was Yemenite but who could imitate 
the Ashkenazic form quite well. Although it was 
possible to detect traces of Yemenite style in his 
standard Hebrew, his guises were still used in order 
to have one natural and one acquired manner of 
speaking the two dialects. 

Two speakers were chosen for the Hebrew-Arabic 
comparison. One was a senior high-school student 
whose native language was Hebrew and who had 
learned Arabic in school and through usage. He 
was a native of Israel of European parentage. The 
other was a young Arab who spoke Hebrew with 
an accent similar to that of Jews of Arabic-speaking 
background. They were judged to speak both lan- 
gauges well enough to be taken as native speakers 
in each of their guises by most listeners, 

The guises of any speaker were separated maxi- 
mally on the tape. The first voice heard on the 
tape—that of another speaker—was used as a filler 
to acquaint the subjects with the rating procedure. 

Arab subjects. The study was carried out in Jaffa 
in the building where the Scouts met regularly on 
Friday afternoon, with the Scoutmaster present. 
All instructions were administered by a professor 
of Arabic. For the Arab subjects, one speaker was 
the same Arab bilingual used with the Jewish sub- 
jects and the second was a Jew from Syria who 
was perfectly bilingual in Hebrew and Arabic, The 
two guises of speakers were maximally separated on 
the tape, and were preceded and interspersed with 
other speakers. Competent bilingual judges agreed 
that the variations in guises of the speakers sounded 
authentic. 

General. The instructional set given all subjects 
was to listen attentively to a series of speakers read- 
ing a standard passage and to imagine, much as 
one does when he hears a voice for the first time 
over a phone or on the air, what type of person 
each speaker was. Subjects were told that some 
would be speaking Hebrew and some Arabic, but 
they were to disregard the language differences, 
which purportedly were used to make the task more 


86 


interesting, and to concentrate on the speaker in 
order to make accurate judgments of his personality. 
They were given a rating sheet to be used for each 
voice listing the 20 traits presented in Table 1, with 
6-point rating scales, each labeled from “very little” 
to “very much,” They were to start rating whenever 
they got a clear notion of a speaker’s personality. 
About half of the subjects made their ratings of 
the traits in the order presented in Table 1 while 
the remainder had the traits listed in reversed order. 

At the bottom of each sheet, they were asked to 
express their feelings about the speaker on four 
social-distance scales: I would befriend this person; 
I would accept this person as a relative by mar- 
riage; I would accept this person as a neighbor; 
and I would help this person if he were in need. 
They answered “yes” or “no” to each question. 

It should be realized that the selection of speakers 
for both parts of the study was made with the help 
of Israeli teachers of Arabic and Hebrew. How- 
ever, we did not have the advice and assistance of 
professional linguists in this matter. There is no 
problem in the case of the speakers used with the 
Jewish subjects because an important control was 
used. After all voices had been rated, the tape was 
replayed for the Jewish subjects and they indicated 
what linguistic-ethnic group they thought each 
speaker belonged to. If they stated that an Ash- 
kenazic guise was that of an Ashkenazi or a Sabra 
(a native of Israel) and. that the matched guise 
was that of a Yemenite, their ratings for the pair 
were used for analysis in the Ashkenazi-Yemenite 
comparisons. For the Hebrew-Arabic guise compari- 
sons, the ratings of subjects were used if they 
identified one guise as Arabic and the other as either 
Ashkenazic or Sephardic (North-African Jews). 
However, we were not permitted to take the extra 
time for this check with the Arab subjects and 
therefore are open to the criticism of linguists who 
might question the judgments of language teachers 
about the adequacy of the speakers used. 

Standard attitude measures. This part of the study 
was carried out with the Jewish subjects only since 
we were advised not to administer standard attitude 
questionnaires to the Arab subjects. When the Jewish 
subjects had completed the rating of personality 
traits, they were given a second booklet that in- 
cluded three different measures of attitudes. 

1. The attitude rating scales made use of 18 bi- 
polar scales (see Table 2) for measuring attitudes 
toward Ashkenazic Jews, Arabs, and Yemenite Jews, 
Subjects were given instructions in the use of the 
scales, for example, to indicate whether Arabs in 
general were “interesting” or “boring” on a 6-point 
rating scale. These scales are similar in form to 
those used for the evaluation of speakers. 

2. An 18-item attitude questionnaire designed to 
measure attitudes toward Yemenites was then ad- 
ministered. Sample items were: Yemenite Jews 
contribute to the richness of the Israeli society; The 
intellectual capacities of the Yemenites are clearly 
Jower than those of Ashkenazic Jews. Subjects were 
given six choices for agreeing or disagreeing with 
each item in this and in the following questionnaire. 
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Eleven out of 18 were written with a negal 
theme. i 

3. Finally, a 7-item attitude questionnaire abo 
Arabs was administered. A sample item is: Aral 
must be considered a bad influence on Jewish 
ture and civilization. In this case, the Israeli ad- 
visors suggested that all the items have a negative 
theme. 


RESULTS AND DISCUSSION 


The algebraic difference (D score) betwe 
the ratings given to the two guises of 
speaker was obtained for each subject on each 
of the 20 traits. For the Jewish subjects, 
the ratings based on correct identification 
both guises of a speaker were employed in 
analysis. Thus, there were two D scores 
each trait for those subjects who correct 
identified the two guises of both speakers, 
only one D score for subjects who correc 
identified the two guises of one of the 
speakers. An average D score was taken for 
those subjects who had two D scores in or 
to obtain one entry for each subject on 
trait. For all Arab subjects, average D sco 
were used in the analysis since no data 0 
guise identification was available. 

For each trait, the sum of the D scores 
all subjects was tested for the statistical 
nificance of its departure from zero, Thi 
results of this analysis, using ¢ tests for 
correlated samples, are presented in Table 
Positive and negative signs were added to 
t values to indicate which guise was mi 
favorably evaluated. f 


Jewish Subjects 


Ashkenazic and Yemenite Jewish guise com 
parisons. The results are presented in the 
column of Table 1. The pattern of findi 
suggests that Yemenite in comparison to 4 
kenazic Jews were considered by Ashkeni 
Jews as less intelligent, ambitious, 
wealthy, as less socially acceptable (cl 
ness, prestige, potential neighbor), and as les 
honest and reliable. However, the Yemeni 
Jews were viewed as having a greater sé 
of humor. 

Hebrew and Arabic guise comparisons. 
results are presented in the second col 
of Table 1. The pattern suggests that t 


nod 
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TABLE 1 
COMPARATIVE EVALUATIONAL REACTIONS OF JEWISH AND ARAB SUBJECTS TO BILINGUAL GUISES 
OF SPEAKERS 
Reactions to Ashkenazic . 
and Yemenite Reactions to Hebrew and Arabic guises 
- guises 
A Jewish zablects Jewish gablects, Arable publents 
Trait 
Height 40 61 59 
Good looks 1.18 1.58 43 
Leadership * 1.36 35 —.20 
Sense of humor —2.73%* 2.25* .06 
Intelligence 350°" 1.08 —2.07* 
Honesty 3.35*** 3,06*** —1.83 
+.  Self-confidence —1.44 —.58 —2,22* 
*Friendliness —.14 1.96 —1.85 
Generosity 77 1.49 —.14 
Entertainingness —1.53 1,82 —.82 
Nervousness 49 —.13 —.82 
Good-heartedness 64 1.86 —2.64"" 
Reliability 3.41*** 1.48 -10 
Ambition 2.30* 71 —1.17 
Stability —.79 —.56 —.32 
Good character —.30 54 —.58 
Likability 1.01 .92 —1.45 
Wealth 2.60** —2.39* 07 
Cleanliness 4,746" AT —1.05 
Prestige 231%. 14 —.95 
Social distance responses 
1. Would befriend .82 2.16* —.25 
2. Would accept as relative by marriage 44 3.02*** —2.30* 
3. Would accept as neighbor 3.75%** 1,25 —.53 
„ 4. Would help if in need 1,00 65 —2.04 


Note.—Entries are two-tailed tests of significance based on a total N of 32 for the Jewish subjects and on N of 29 for the Arab 


Subjects, 


^ A positive sign indicates that the Ashkenazic guises were evaluated more positively than the Yemenite guises of the same 


speakers, Negative entries indicate the oj 


ppt, 
b Positive entries indicate that the Hebrew guise was judged more favorably and negative entries mean that the Arabic 


guise was judged more favorably. 
*p <.05, 


dD < .02, 
wry < 01, 


friendly, less honest, and less desirable as 
friends or potential marriage partners. 

It is not clear why the Arabs were viewed 
‘as more wealthy. This would not fit the social 
facts of Arab life in Israel, but it might 
reflect a wider view on the part of the sub- 

-Jects: they may have interpreted wealth in 
this case in terms of land possessions and 


» Seen their own country as small in comparison 


E to,the enormity of the Arab world. 


Arab Subjects 


Hebrew and Arabic guise comparisons. The 
Pattern of findings (third column of Table 1) 
Suggests that Arab subjects considered mem- 
bers of the Jewish group less capable (intelli- 
gence and confidence), less dependable (good- 


/ 


hearted, friendly, honest), and less desirable 
for marriage than their own people. They also 
tend to be more willing to help members of 
their own group than Jews in time of need. 

It is of particular interest that the Jewish 
and Arab subjects responded to representa- 
tives of one another’s group in mutually 
antagonistic manners in the sense that both 
samples of subjects saw their own group as 
more honest, friendly, good-hearted, and more 
desirable as relatives through marriage. Mu- 
tual distrust of this sort would certainly re- 
strict social interaction since members of 
neither group would initiate a friendly over- 
ture if they anticipated the others to react in 
a relatively dishonest, unfriendly, and selfish 
manner, 
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TABLE 2 


COMPARISONS OF JEWISH SUBJECTS’ ATTITUDE RATINGS 
OF ASHKENAZIC JEWS VERSUS YEMENITE JEWS 
AND OF ASHKENAZIC JEWS VERSUS ARABS 


Comparisons of | Comparisons of 
Ashkenazic and | Ashkenazic Jews 
Hale Yemenite Jews | and Arabs 

Interesting —5.01*** —.55 
Unprejudiced 80 1.64 
Courageous —1.17 2.33* 
Good looking 1.13 275" 
Friendly —2.50* 1,03 
Honest —4,25*** 1.03 
Intelligent 1.19 Seer 
Good-hearted —3.15*** 4.55*** 
Pleasant f —1.68 A117999. 
Good-mannered 2.:53** 5.19%** 
Truthful —1.60 2.58" 
Successful 10.50*** 9.279* 
Self-confident O77 4.49*** 
Permissive —1.19 3.67% 
Reliable 1.15 7.68%** 
Leadership 10.25*** 6.67*** 
Cheerful —1,82 2.52** 
Hard working —1.60 4.39%" 


Note.—The total N is 40. A positive entry signifies that 
Ashkenazi Jew is rated more favorably than the Yemenite Jew 
or the Arab. A negative entry signifies the opposite, 

Only the favorable adjective of any pair is presented here, 
The complete list of adjectives and traits used in this study, 
in both Dag may be obtained from the first author. 


Standard Attitude Measures and Evaluational 
Reactions to Matched Guises 


In Table 2 the comparative attitude rat- 
ings of Ashkenazic and Yemenite Jews, and 
of Ashkenazic Jews and Arabs are presented 
in detail for the Jewish subjects. It will be 
noted that a quite different pattern of results 


emerged when attitudes were measured by 
rating scales. In this case, the Jewish subjects 
considered Yemenite Jews significantly more 
interesting, friendly, honest, and good-natured 
than Ashkenazic Jews, but less good-man- 
nered, successful, self-confident, and leader- 
like. In the second column, it is apparent that 
Arabs were rated lower on nearly all traits, 
significantly so on courage, good looks, intelli- 
gence, good-naturedness, pleasantness, man- — 
ners, truthfulness, successfulness, self-confi- 
dence, permissiveness, reliability, leadership, 
cheerfulness, and capacity for hard work. 
Seven identical traits were used for rating 
both guises and attitudes toward the different 
cultural groups. These are listed in Table 3 
where the reactions evoked by the two pro- 
cedures are compared. There are remarkably 
sharp contrasts in these data. In terms of 
attitude ratings, Yemenite Jews were consid- 
ered by Jewish subjects as more friendly and 
honest, equally intelligent, reliable, and good 
looking, and less self-confident and leaderlike 
than Ashkenazic Jews. However, as revealed 
by the comparison of reactions to matched — 
guises, Yemenite Jews were viewed as Jess” 
honest, intelligent, and reliable, similar in 
friendliness and good looks, and somewhat 
more self-confident and leaderlike than Ash- 
kenazic Jews. In every case but one (good 
looks) where there is a reliable difference on 
one form of rating, it is not different oF 
reversed on the other form. The same type of 
contrasts are apparent in the reactions of 
Jewish subjects to Arabs. From the attitude 


TABLE 3 
COMPARABLE TRAITS FOR GUISE EVALUATIONS AND ATTITUDE RATINGS 4 
E 
Ashkenazic-Yemenite Jew comparisons Hebrew-Arab comparisons 
Trait Ashki ioy. i ma 
enazic- m, 
K eh aE ite eats: we ete Tews Hebrew-Arabic guises Ratings, ae pene 
t t 
Good looks 1.18 1.13 bar’ 
Friendliness —14 —2.50* 96 103 : 
Honesty 3.35e* 242s 3.06*** 1.03 “ 
Intelligence 3.50*#* 1.19 1.08 5.07 
Self-confidence —1.44 6.7744* TS 4.49 
Reliability Bait 115 1.48 7.68*** 
Leadership ~ 1.36 10.25*** 35 6.67*** 


Note.—Positive entries signify that the guise and attitude ratings for Ashkenazi H 'han for 
Arabs, and negative entries indicate the opposite. azim or Hebrews were more favorable ti 


Yemenite Jews or 
*p <.05. 
++p <01 
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ratings, Arabs were considered relatively less 
good looking, intelligent, self-confident, reli- 
able, and leaderlike people, whereas the view 
revealed in the guise ratings was that Arabs 
were less honest and friendly, but not reliably 
different as far as appearance, intelligence, 
self-confidence, reliability, or leadership is 
concerned. 

These contrasts may be due in part to 
procedural differences. The matched-guise 
technique directs attention to indiyiduals 
whose language style makes them representa- 
tive of a particular linguistic and ethnic group 
whereas the rating scale, like most standard 

‘measures, directs attention to an ethnic group 
in general, giving it a stereotyped label such 
as “Yemenite Jews.” Furthermore, in the case 
of the Hebrew-Arabic contrasts, the style of 
Arabic used by the individual speakers was 
classical, similar to the Arabic the subjects 
studied in school, whereas “Arabs” in general 
use the colloquial form of the language. 

Table 4 presents the intercorrelations of 
three measures: the evaluational reactions to 
matched guises, the attitude questionnaires, 
and the attitude rating scales. For the matched 
guises (Items 1 and 4 in Table 4), a mean D 

+ score was calculated for each subject by con- 
sidering his reactions to a particular pair of 
guises on all traits except height, nervousness, 
and wealth since they were not clearly evalua- 
tive in nature. The higher a subject’s score, 
the more favorable was his reaction to Ash- 
kenazic in contrast to Yemenite guises (Item 
1) or to Hebrew in contrast to Arabic guises 
(Item 4). 

For the attitude questionnaires (Items 2 


`, and 5 in Table 4) a total score was deter- 


mined for each subject by considering his 
reactions to all questions, the higher the 
score, the more favorable the attitude toward 
Yemenite Jews (Item 2) and toward Arabs 
(Item 5). Similarly, total scores for the atti- 
tude rating scale (Items 3 and 6 in Table 4) 


_ were determined by considering ratings on all 


18 scales, and again, the higher the score the 
more favorable the attitude toward Yemenite 
Jews (Item 3) and toward Arabs (Item 6). 
The correlations in Table 4 are low if one 
considers that all three procedures measure 
the same components of attitude. The fact 
that the rating scales and attitude question- 


TABLE 4 


CORRELATIONS OF EVALUATIONAL REACTIONS ANI 
ÅTTITUDE MEASURES: JEWISH SUBJECTS 


Item 1 PAETE, 
1. Ashkenazic-Yemenite 
Jewish guise comparisons | — | —.42* | —.39* 
2. Attitude questionnaire 
toward Yemenite Jews — Are 
3. Rating scales of attitudes 
toward Yemenite Jews == 
4 s 6 
4. Hebrew-Arabic guise 
comparisons — |—.36*| .00 
5. Attitude questionnaire 
toward Arabs — .40* 
6. Rating scales of attitudes 
toward Arabs — 


Note.—Comparisons of guises were determined by consider 
ing all traits except height, nervousness, and wealth, The highe 
the score, the more favorable is the evaluation on all 17 trait 
of the Ashkenazic or Hebrew guises over the Yemenite Jewis! 
or the Arabic guises, For the two attitude measures, the highe 
the score, the more favorable is the reaction to the group 

*p <.05,N = 32, 

HD < 01, 


naires correlate only in the 40s is difficult t 
explain; it may be due in part to difference 
of form and wording. The matched-guise an 
rating-scale techniques used the same ratin 
form but, as already noted, there are majo 
procedural differences that could account fo 
the low and inconsistent correlations. Th 
two measures correlate to a significant bu 
low degree in the case of the Hebrew-Yemen 


ite comparison (r = — .39), but not at all i 
the case of the Hebrew-Arabic compariso 
(r = .00). 


The correlations in Table 4 are similar i 
magnitude to those found in earlier studie 
using the matched-guise technique (Anisfel 
et al., 1962; Lambert et al., 1960) in that I 
scores based on ratings of matched guise 
showed low or insignificant correlations wit 
standard measures of attitudes. These lo 
correlations suggested that, in expressin 
evaluational reactions to the matched guise 
of speakers, subjects drew on stereotype 
about the groups in question in their attempt 
to make accurate judgments about the speak 
ers’ personalities and that community-wid 
stereotyped reactions may have played a mor 
important role in the personality judgment 
than did personal attitudes towards the group 
represented by the speakers, 
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The results of the present study, particu- 
larly the contrasts presented in Table 3, sug- 
gest a supplementary explanation for the low 
and inconsistent correlations noted in Table 
4. First, standard attitude items usually direct 
the respondent’s attention to ethnic or na- 
tional groups as a whole (e.g., The trouble 
with Jews is ....; or The more I get to 
know Negroes ....) while the matched- 
guise procedure directs the subject’s atten- 
tion to individuals whose distinguishable 
speech characteristics identify them as mem- 
bers of the groups in question. Furthermore, 
standard measures of attitudes are often 
transparent in the sense that subjects can 
comprehend what is being measured and can 
easily distort their actual attitudes by re- 
sponding in what they feel is a socially ap- 
propriate manner. The instructional set ac- 
companying the matched-guise technique, 
however, makes it unlikely that subjects will 
realize the true purpose of the procedure. In 
view of the marked contrasts between the 
two procedures revealed in the present study, 


the hypothesis suggests itself that the 
matched-guise technique, in contrast to stand- 
ard measures of attitudes, evokes more private 
emotional and conceptual reactions. This is, 
of course, only a hypothesis, for until further 
research directed specifically at this question 
is conducted there are as yet no empirical 
grounds for actually deciding which is the 
better procedure for evoking “private” or 
“uncensored” attitudes. 
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ATTITUDE INTENSITY AND THE SEMANTIC DIFFERENTIAL ' 


WILLIAM WEKSEL 
IBM Research Center, Yorktown Heights, New York 
anp JAMES D. HENNES 
Office of Instructional Resources, University of Illinois 


Polarization scores on the semantic differential have been said to represent 


attitude intensity (Osgood, Suci, & Tannenbaum, 1957). Logical instances can 
be cited where the degree of response or polarity will not reflect intensity, 


such as where an extreme attitude response is held with little intensity. Cor- 
relations between semantic differential polarization scores and independent 


intensity scores were obtained for 2 samples of college students, and samples 
of 10th- and 6th-grade students. The correlations were low (.31, 43, 47, and 
.62). It is concluded that polarity scores should not be equated with intensity, 
and that independent intensity measures should, in some cases, be used in 
conjunction with the semantic differential. - 


From its inception the semantic differential 
has attained great popularity among social 
psychologists and others interested in attitude 
measurement. The semantic differential has been 
linked to mediation theory and concurrently to 
a theory of connotative meaning. While the rela- 
tion between mediation theory and the semantic 
differential has been admittedly ad hoc (Osgood 
et al, 1957, Ch. 1), the sustained usage and 
popularity of the differential have created an 
unwarranted air of theoretical reputability. More- 
over, in the light of recent results in linguistics 
and semantics (Fodor, in press; Katz & Fodor, 
1963) it seems advisable to conceive of the se- 
mantic differential more simply as a fairly gen- 
eral technique for the measurement of attitudes. 
Séme empirical studies have suggested that the 
semantic differential is both valid and reliable as 
a measuring instrument (Miron, 1961; Osgood 
et al., 1957). However, most of these studies 
were not concerned with the fundamental as- 
sumptions underlying the semantic differential. 

The present paper examines one fundamental 
assumption which bears directly upon the valid- 
ity of the semantic differential as an attitude 
measuring instrument. According to Osgood et al. 
-, (1957, p. 192) the semantic differential yields 

information about the direction of an attitude 
(on a favorable-unfavorable continuum). This 


1The first author conducted part of this study 
while at the University of Illinois, and prepared the 
report while a National Science Foundation Post- 
doctoral Fellow at the Massachusetts Institute of 
Technology. 
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score is usually referred to as the polarization 
score or attitude polarity. Then the degree of 
favorableness or extremeness or polarity is 
equated with attitude intensity. 

Typically, attitude intensity is defined as the 
strength of feeling with which an attitude is 
held, The use of polarity as an intensity score is, 
in effect, utilizing the fold-over technique of 
intensity analysis frequently used with Likert 
items. The fold-over procedure eliminates ex- 
perimental independence of the two variables. 

This paper argues that the polarization score 
does not represent attitude intensity. 

The argument rests on the notion that there 
is no reason to believe that the degree of favor- 
ableness of an attitude determines the strength 
with which the attitude is held. By defining at- 
titude intensity in terms of the polarization 
score, Osgood et al. have essentially made the 
claim that degree of favorableness is equal to 
attitude intensity. This claim is logically unsup- 
portable. Obviously, it is quite possible to have a 
very favorable attitude toward a given attitude 
object without holding the attitude very strongly. 
For example, one may believe that ITALIANS are 
very happy (for example, +3 on the happy- 
unhappy scale) but not have any particularly 
strong feelings about this view. In the same way, 
a neutral attitude may be held quite intensely. 
For instance, one may feel quite strongly that 
BOULDER is neutral on the happy-unhappy scale. 

A fundamental requirement of any operational 
measure is that it allow us to represent con- 
veniently (i.e., in some reduced and shorthand 
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way) observable phenomena. As a minimum re- 
quirement, it must represent prior knowledge 
about the phenomena. The examples given above 
show that in many cases the semantic differential, 
in its present form, will fail to account for what 
is already known about attitude intensity. To 
the extent that it fails to account for intensity, 
the semantic differential does not meet this 
minimum requirement for an attitude measuring 
instrument which purports to measure both 
direction and intensity. 

The above argument does not rule out the 
possibility that in certain cases attitude intensity 
and polarity will be significantly related. Occa- 
sional positive relationships between polarity and 
intensity scores, however, do not provide a basis 
for assuming that polarity is identical with in- 
tensity nor that polarity can be used as a con- 
sistently effective measure of attitude intensity. 

If the polarity measure were to qualify as an 
adequate index of attitude intensity (i.e. if 
there were a satisfactory basis for assuming that 
attitude intensity equals polarity), then it would 
be expected that very high correlations would 
consistently occur between the polarization score 
and some independent measure of intensity. As 
is shown by the results of the two studies re- 
ported below, this is not the case. The low posi- 
tive correlations which generally characterize the 
relation between the independent measure of 
intensity and polarity suggest the extent to 
which the semantic differential fails to account 
for attitude intensity. 


PROCEDURE 


In the first study 73 college students were ad- 
ministered a series of semantic differential scales 
using certain stereotypes and abstract social cate- 
gories as concepts. In addition to the usual instruc- 
tions each subject was asked to place a number 
from 1 to 7 in a space provided alongside each 


TABLE 1 
Scares AND Concepts UsED IN First STUDY. 


Scale Concept 
Emotional-unemotional CUBANS 
Moral-immoral AMERICANS 
Disreputable-reputable ARABS 
Calm-excitable LATIN AMERICANS 
Delicate-rugged PACIFISTS 3 
Social-unsociable CONSERVATIVES 
Insensitive-sensitive PROSTITUTES 
Introverted-extroverted ITALIANS 

LIBERALS 
JEWS 
INTELLECTUALS 


TABLE 2 
SCALES AND CONCEPTS USED IN SECOND STUDY 


Scale Concept 
Good-bad TELEVISION 
True-false MY PARENTS 
Dull-interesting (actual name of subject 
Foolish-wise being taught in class) 


Unimportant-important 
Possible-impossible 
I like-I dislike 


INSTRUCTIONAL TELEVISION 
MY BEST FRIEND 
EDUCATION 


scale to “demonstrate how strongly you hold this 
particular view.” This score represents the intensity 
score for each item. For example: 


LATIN AMERICANS 
disreputable_:__:__:__:__:__:__reputable () 


It was explained to the subjects that they were to 
consider their intensity scores independently of 
their polarity scores. The 11 concepts and 8 scales 
shown in Table 1 were used in the analysis. The 8 
scales represent the four factors of “morality,” 
“excitability,” “sociability,” and “toughness” as de- 
termined in the development of the personality 
differential (Ware, 1961). 

The two scales loading on each factor and their 
respective intensity scores were summed for each of 
the four factors resulting in 44 polarity scores and 
44 intensity scores, The 44 correlation values òb- 
tained between intensity and polarity were then 
transformed into s scores. These scores were averaged 
and the resulting value was transformed into the 
corresponding mean correlation value. 

In the second study semantic differential scales 
were administered to samples of 69 sixth-grade stu- 
dents, 58 tenth-grade students, and 366 college 
freshmen and sophomores. In addition to the usual 
instructions for the semantic differential, each sub- 
ject was asked to indicate “how sure are you about 
the rating you just made?” This score represents 
cognitive certainty or intensity. For example: 


WORLD HISTORY 
good_:__:__ —:—bad 
sure 100 80 60 40 20 unsure 


The six concepts and seven scales shown in Table 2 
were used in the analysis. 

Correlations over all subjects in each grade be- 
tween each item and its intensity score were 0b- 
tained for each concept. The average correlation 
was then calculated for each grade level. Not a= | 
scales were used for each concept nor for all grade 
levels. The concepts my PARENTS and MY BEST — 
FRIEND were not used with the sixth grade. Thus 
the number of polarity-intensity correlations was 
different for each grade. For college freshmen there 
were 22 correlations, for tenth-grade students 26 
correlations, and for sixth-grade students there wet? 
20 correlations, 
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RESULTS AND Discussion 


In the first study the mean correlation for 
college students between polarity and intensity 
was .31. In the second study the mean correla- 
tion for college freshmen was .43, for tenth-grade 
students it was .47, and for sixth-grade students 
the mean intensity-polarity correlation was .62.? 

Two factors may be affecting the observed 
correlations, First of all, unreliability of the 
measures may be shrinking the true relationships. 
This is unlikely in the light of previous relia- 
bility studies on the semantic differentials Os- 
good et al. (1957) reported semantic differential 
reliabilities of .91, Miron (1961) found test- 
retest reliabilities of .86 to 1.00, and Weksel 
(1963) found alpha coefficients on factor scores 
for individual subjects of .80 to .99. There is 
little reason to assume that intensity scale re- 
sponses have lower reliabilities. Thus the “true” 
correlation, the observed correlations corrected 
for attenuation due to unreliability, would not 
change greatly. 

“A second factor that may be affecting the 
observed correlations is response set. The exist- 
ence of a response set would tend to inflate the 
true relationships. Peabody (1961) in his study 
of bipolar scales concluded that the consistency 
of response polarization he observed could more 
readily be explained asa response set than as an 
attitude intensity response. 

As a partial check on the degree of response set 
present, an additional analysis was performed. 
If a high degree of response set were present, 
then correlations run on each subject across his 
item polarity and intensity scores would tend to 


|, , be low. That is, the ratio of explained variance to 


error variance would be smaller within these 
subjects than across these subjects, so that the 
average within-subject correlation should be 
lower than the average across-subject correlation 
if response set were present. However, when 
correlations were obtained for each subject across 
his items, converted to z scores, averaged, and 
converted back to a correlation value the result 
in some cases was a correlation higher than the 
original across-subjects correlation. For the sixth- 
grade sample the across-subjects correlation was 
62 while the within-subjects correlation was .71. 


2The upper and lower 95% confidence intervals 
for the four correlations reported are as follows: 
College students .09-.50; freshmen .35-.51; tenth 
grade .24-.65; and sixth grade .46-.75. 

In a similar study at the University of Ilinois 
(1962 unpublished), M. Garrett obtained a correla- 
tion of 42 between polarization scores and attitude 
intensity scores, 


For the tenth-grade sample the across-subjects 
correlation was .47 while the within-subjects 
correlation was .55. But for the college freshmen 
sample the across-subjects correlation was .43 
while the within-subjects correlation was :40. 

This additional analysis suggests that response 
set was not inflating the obtained across-subject 
correlations. However, there is an indication that 
the magnitude of the relationship between po- 
larity scores and intensity scores increases with a 
corresponding decrease in age of subject. Whether 
this was an age effect or whether other factors 
external to the intensity-polarity relationship 
were operating was not explored in this study. 

Supporting the findings of the present report, 
Peabody (1961) found that almost 80% of the 
variance of bipolar scales could be accounted for 
by the direction of response and only 10% by the 
extremeness or polarity. That is, a bipolar scale 
score is essentially a direction score, not an in- 
tensity score. 

The implications of the present results are 
clear. They support the logical argument that 
there is a basic deficiency in the rationale under- 
lying the semantic differential which seriously 
affects its role as an attitude measuring instru- 
ment. The argument is not that polarity scores 
are unrelated to intensity scores, but that there 
exist logical reasons as well as empirical support 
for not accepting a model which equates attitude 
intensity with polarity. It seems that among 
younger (and perhaps less discriminating) re- 
spondents the magnitude of the correlation tends 
to increase. It should be emphasized that the 
magnitude of the correlation for the sixth-grade 
subjects does not nullify the logical argument 
which renders unacceptable the assumption under- 
lying the measurement model, although the de- 
crease in correlation with age may be revealing 
something about attitude formation among young 
subjects. 

For practical purposes, the deficiency of the 
semantic differential in measuring intensity or 
other components of attitude can be corrected by 
the addition of some independent measure of 
intensity to the differential form. This correction 
is especially important when one is concerned 
with attitude change since it is widely assumed 
that attitude intensity greatly determines atti- 
tude stability and that those individuals most 
intensely committed to an attitude have a greater 
tendency to act in accordance with that attitude. 


REFERENCES 


Fopor, J. Could meaning be an rm? Journal of 
Verbal Learning and Verbal Behavior, in press, 


94 BRIEF ARTICLES 


Katz, J., & Fovor, J. The structure of a semantic 
theory. Language, 1963, 39, 170-210. 

Miron, M. The influence of instruction modification 
upon test-retest reliabilities of the semantic dif- 
ferential. Educational and Psychological Measure- 
ment, 1961, 21, 883-893. 

Oscoop, C. E., Sucr, G. J., & TANNENBAUM, P. H. 
The measurement of meaning. Urbana: Univer. 
Illinois Press, 1957. 

Prasopy, D, Two components in bipolar scales: 


Journal of Personality and Social Psychology 


Direction and extremeness. Psychological Review, Í 
1961, 69, 65-73. i 

Ware, E. Progress report on the personality differ- 
ential. Paper read at Wayne State University, 1961, 

WexseL, W. A multidimensional approach to social 
perception. (Doctoral dissertation, University of 
Illinois) Ann Arbor, Mich.: University Microfilms, Í 
1963, No. 63-5589. 


(Early publication received October 16, 1964) 


1965, Vol. 


2, No. 1, 94-97 


MOTHER NURTURANCE AND GIRLS’ INCIDENTAL 
IMITATIVE LEARNING* 


PAUL H. MUSSEN anp ANN L. PARKER ? 
University of California, Berkeley 


This study, based on earlier research of Bandura and Huston (1961), was 
designed to test the hypothesis that girls with highly nurturant mothers will 
show greater tendencies to imitate their mothers’ incidental task-irrelevant 
behavior than girls with relatively nonnurturant mothers. The Ss were 30 
5-yr.-old girls to whom the Porteus Maze Test and the ITSC were adminis- 
tered. 3-4 wk. later, the girls took these tests again but, in the 2nd session, 
their mothers served as E-models. During this session, while her daughter 
watched, each mother solved the maze and, according to directions from the 
investigators, made certain irrelevant comments and certain marks on the maze 
tracings. Mother’s nurturance was evaluated on the basis of her responses to 
an intensive interview conducted at her home. The results showed that the 
nurtured and nonnurtured daughters did not differ in the extent to which they 
imitated the mother’s task-relevant, problem-solving responses, The hypothesis 
was confirmed, however, the nurtured group showing significantly more 
incidental imitative learning of the mother’s behavior than the other group. 


It has been demonstrated experimentally that 
the child’s incidental imitative learning of a 
model’s behavior increases immediately after 
nurturant interaction between the model and the 
child. There is also considerable evidence from 
correlational studies that a high degree of nur- 
turance toward the child by the same-sex parent 
is conducive to the child’s acquisition of ap- 
propriately sex-typed interests and attitudes 
(Mussen & Distler, 1959; Mussen & Rutherford, 
1963). In these studies, it is assumed that the 
appropriate sex typing is a consequence of identi- 
fication with the parent of the same sex, or in 
behavioral terms, the result of incidental imita- 
tive learning of that parent’s behavior..If this 
assumption is correct, it follows that parents who 


1 This study was supported by the National Insti- 
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facilitating effect on their children’s imitation | 


are generally warm and nurturant will have å, 
Í 
of parental behavior, even in the absence of spe 
cific instruction or of reward for such imitation. 

The research reported in this paper Wa 
designed to test this hypothesis directly. Mothers 4 
served as the experimenter-models, paired with 
their own daughters, the subjects, in a problem 
solving situation involving imitation. 


| 


METHOD | 
Subjects 


The subjects were 30 girls between 5 and 6 yeas 
of age enrolled in two kindergarten classes of a 
predominantly middle-class school. Their mo! 
had volunteered their daughters’ and their ow? 
Participation in the study. Each mother was intet 
viewed and each child was seen for two sessions ; 
one with one of the investigators (ALP) and, $0 P 
the second session, with this investigator and 
mother. 1 

During the first session, the procedure was e55°™ 
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tially like Rosenblith’s (1959, 1961). After being 
seated in the room with the investigator, the child 
was told that she would play a paper and crayon 
game, and she could choose any crayon she wanted 
to draw with from the four (red, orange, green, and 
blue) in front of her. The Porteus Maze Test 
(Porteus, 1950) was then administered, slightly modi- 
fied in instructions and procedure to render them 
more suitable to the age of the children. The test 
consists of a series of mazes, one maze for each year 
from ages 3 to 12, two trials being allowed for each 
maze. Testing is discontinued when the subject fails 
two consecutive age levels. 

Between 3 and 4 weeks after the first session, the 
child was again called from her classroom by the 
investigator, who explained that they would “have 
a second turn at the game.” Before entering the 
room where the testing was conducted, the child 
was told, “Today we have someone else here to 
play our games with us.” Upon entering the room, 
the child met her mother and was asked to sit next 
to her at the table. The mother and child each had 
her own set of crayons, arranged in identical order 
on the table, and they were given identical sets 
of mazes. 

Before she met with the subject in the experi- 
mental room, the mother had been given explicit 
instructions about the procedure to be followed. She 
had been carefully coached to draw slowly and to 
hesitate at the choice points, to make certain com- 
ments before each trial (“hm, hm, let’s see now,” 
and upon starting the maze, “here we go”) and to 
make some irrelevant marks while tracing the maze 
(a loop at any point in her tracing and, a definite 
‘final mark such as @). 

After instructions were given to the pair, the 
mother began by “casually” picking up a crayon of 
the color least frequently selected by the child in 
the first session, using this crayon to trace the first 
maze correctly at a deliberately slow pace. The child 
watched and then was given her own turn on her 
copy of the same maze. The same procedure was 
followed for each of the 10 mazes, two trials being 
allowed for each maze. 

Any improvement in maze-test performance be- 
tween the first and second sessions presumably re- 
sulted from direct imitation of the mother’s 
responses, and thus could be used as a measure 
of the child’s tendency to imitate the mother’s task- 
relevant performance. The major hypothesis of the 
study involved another type of imitation, however— 
incidental imitation learning, that is, duplication 
of the model’s responses that were not relevant to 
solving the problem. This type of imitation was 
scored in terms of the number of times the child 
Picked up a crayon of the same color as the one 
her mother picked, repeated the mother’s irrelevant 
utterances, and made loops or final marks like the 
mother’s in tracing the maze. 


Interviews 


Testing the hypothesis required some basic data 
on child-rearing practices, and more specifically, on 


the degree of the mother’s nurturance of the child. 
It was also essential to determine the extent of 
maternal fostering of dependent behavior in the 
child, for, according to some theories of identifica- 
tion, dependency itself facilitates imitative learning 
(Sears, Maccoby, & Levin, 1957). These data. were 
obtained from maternal interviews conducted at the 
mothers’ homes between the children’s first and 
second sessions. The interviews consisted of 15 open- 
ended questions taken from the Sears et al. inter- 
view schedule. The questions were related to 
maternal warmth and hostility, fostering of depend- 
ency, restrictions and demands on the child, per- 
missiveness, democracy in the home, and child 
centeredness of the home. The interviews were tape 
recorded. Overall ratings of both nurturance and 
dependence were made on the basis of the analysis 
of these interview protocols, These ratings were 
made by the junior author after she had been 
trained in rating comparable interviews and had 
achieved 85% agreement with an experienced rater. 

The distribution of the rating on nurturance was 
dichotomized as nearly as possible to the median, 
and the mothers above the median were consid- 
ered nurturant (V=17), while those below were 
considered nonnurturant (N = 13). 

It should be noted that the ratings of the mother’s 
interaction with the child were made before there 
was any knowledge of whether, or how much, the 
child would imitate the mother in the second session. 
Thus, there was no contamination of maternal rating 
and the child’s tendency to imitate. 


Teacher Ratings 


Further assessment of the children’s dependency 
came from teacher ratings of the children on four 
scales: frequency of attention-getting behavior, tend- 
ency to cling to the teacher, need for praise and 
approval, seeking help with tasks child is capable 
of doing. A rating of 1 on each scale represented 
the lowest degree of dependency, and a rating of 4, 
the highest. The child’s dependency score was the 
sum of his ratings on all four scales, 

It will be recalled that the subjects were in two 
kindergarten classes and had different teachers, Each 
teacher rated only her own pupils, of course, and, 
since the teachers probably had different standards 
of evaluating the children, their ratings had to be 
handled separately. One teacher rated 20 girls (10 
in each group), while the other rated 10 girls 
(7 daughters of nurturant mothers and 3 others). 


RESULTS AND DISCUSSION 


The major hypothesis was tested by comparing 
the imitation scores of two groups of girls, those 
with nurturant and those with nonnurturant 
mothers. The groups did not differ from each 
other in average age, ordinal position, number 
of siblings, or socioeconomic status. Since they 
did not differ significantly in their first session 
maze-test performance, it may be inferred that 
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TABLE 1 


Mean IMITATION Scores or GirLs with HIcHLY 
NURTURANT AND NONNURTURANT MOTHERS 


» Highly a Nore: À 
Variable nurturani nurturani U 
W=in | W=13) 
Imitation of goal-related 
responses (improve- 
ment in maze test 
performance) 5.2 6.4 54.58 
Incidental imitation 17.8 12.9 | 65* 


* In calculating U for this variable, we eliminated the scores 
of six girls who scored 16 or more (highest possible score =20) 
in the first testing, 

* p <.05, 


there were no significant intellectual differences 
between the two groups. 

Since maternal nurturance may foster depend- 
ency in the child, and this in turn, may promote 
imitative behavior, it was important to establish 
that maternal nurturance and child dependency 
were not closely correlated variables in this 
population. Analysis of the interview data re- 
vealed that the highly nurturant mothers in this 
study did not encourage dependency in their 
daughters. On the contrary, interview ratings of 
nurturance and encouragement of dependency, 
were slightly, though significantly, negatively 
correlated (r=—.33, p=.05). Apparently in 
this group, maternal nurturance was accompa- 
nied by some encouragement of independence 
in the child. 

There is also some evidence that this encour- 
agement of independence was, in fact, associated 
with a relatively low level of overt expression 
of dependency. According to teachers’ ratings, 
the daughters of nurturant mothers were signifi- 
cantly less dependent than the other girls. The 
average dependency score, based on the teacher’s 
ratings, of the 10 subjects with nurturant 
mothers was 8.2, the average for the other 10 
was 10.1 (¢=2.2, p<.05). In the other class, 
the 7 subjects with nurturant mothers had an 
average dependency score of 10.5; the other 3 
averaged 12.0 (t= 1.6, p = .10-.15). 

Table 1 shows that the mean scores of the 
two groups on two types of imitation—imitation 
of goal-related responses and incidental imitation 
learning. Since the number of subjects in each 
group was small, and distributions of the scores 
were nonnormal, U tests were used to compare 
rank transformation scores on all these measures 
of subjects in the two groups. The results of 
these tests and their significance are summarized 
in Table 1. 


The table shows that the two groups did not 
differ significantly in imitation of behavior di- 
rectly related to achievement of the goal in the 
maze test. The finding is analogous to the 
findings of Bandura and Huston (1961) that 
the experimenter-model’s nurturance did not | 
significantly affect the children’s tendency to 
imitate his choice (the correct and rewarded 
box) in a discrimination learning task. Children ' 
imitate the model’s operant responses that lead 
to the solution of problems, regardless of the 
degree of the model’s nurturance. Perhaps this 
is attributable to the power of the direct rewards 
involved, extrinsic (e.g., a prize for learning a 
discrimination task), intrinsic (such as the feel- , 
ing of satisfaction derived from successful maze 
tracing), or both. 

As Table 1 shows, the major hypothesis was ‘ 
confirmed. Maternal nurturance was found to — 
be related to the child’s incidental imitation 
learning, that is, to her tendency to match or 
imitate the mother’s behavior that was incidental 
or irrelevant to solving the problem or achieving . 
the goal. These results are clearly consistent with 
the conclusions of Bandura and Huston (1961) 
that 


children display a good deal of social learning of an 
incidental imitative sort, and that nurturance is one 
condition facilitating such imitative learning [p. 316]. 


Either an immediately preceding nurturant inter- 
action between the model and the child or a’ 
long-standing nurturant relationship between the 
two may have this facilitating effect. Thus, if 
the parent is generally nurturant toward the 
child, there is an increased tendency for the child 
to imitate aspects of that parent’s behavior 
spontaneously, that is, in the absence of an 
immediately preceding experience of nurturance 
or of direct tuition or specific rewards for this 
imitation. 

Tt may be inferred that the child’s assumption 
of certain aspects of his parent’s behavior may 
be explained in terms of the principles underlying 
imitation of a model’s incidental behavior, the - 
model’s (or parent’s) secondary reward value, 
and the self-rewards that consequently arise 
from this imitation, The child’s acquisition of 
appropriate sex-typed behavior, which is related 
to nurturance by the same-sex parent (Mussen 
& Distler, 1959; Mussen & Rutherford, 1963) 
may be considered a result of this kind of imita- 
tion that occurs without teaching or direct, im- 
mediate rewards. In short, it may be concluded 
that the data support the notion that 
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e 
the process subsumed under the term “identification” 
may be accounted for in terms of incidental learning, 
that is, learning that apparently takes place in the 
absence of an induced set or intent to learn the 
, “specific behaviors or activities in question... 
, [Bandura & Huston, 1961, p. 311]. 
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IMITATION AND RETENTION OF FILM-MEDIATED 
AGGRESSIVE PEER AND ADULT MODELS* 


DAVID J. HICKS 
Chico State College 


` The relative effect of peer and adult models as transmitters of novel aggressive 
+ responses was investigated, Children viewed either male or female adult or 
male or female peer models presented on film and a test for imitative aggression 
was made. 6 mo. after seeing the films the same children were reobserved in 
order to assess the long-term influence of the models. In addition, a test of 
retention was made after the 6-mo. interval. It was found that the male peer 
had the most immediate influence in shaping children’s aggressive behaviors 
while the adult male had the most lasting effect. A significantly greater number 
of the models’ behaviors were retained after 6 mo. than were performed. 


Research on the influence of film-mediated 
„aggressive models in shaping children’s aggressive 
behavior has utilized adults or cartoons as the 
Stimuli to be observed with the measurement of 
* subsequent aggression closely following exposure 
to a model (Bandura, Ross, & Ross, 1963a, 
1963b; Lovaas, 1961; Mussen & Rutherford, 
1961). Minor attention has been devoted to the 
.* effects of peer models on children’s performance 
. Of aggressive responses while the long-term ef- 
fects of aggressive models on children’s behavior 
has not been studied. The present study was 
aimed at investigating the immediate and post- 
interval effectiveness of film-mediated peers and 
*This investigation was supported in part by 


Research Grant MH-7401 from the National Insti- 
tutes of Health, United States Public Health Service. 


adults as transmitters of aggressive patterns of 
behavior. 


METHOD 
Subjects 


The subjects were 30 boys and 30 girls enrolled in 
the Chico State College Laboratory School and Child 
Development Laboratory. They ranged in age from 
41 to 76 months, with a mean age of 61 months. 
A female experimenter conducted the study for all 
60 children. 


Procedure 


Subjects were assigned to four experimental groups 
and one control group of 12 subjects each. One group 
of experimental subjects observed an adult male 
model while a second, a third, and a fourth group of 
experimental subjects were exposed to a female adult 
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model, a peer male model, and a peer female model, 
respectively. The models were presented to the sub- 
jects by means of 8-minute films projected on a 
plastic lenscreen in a television console. Children 
in the experimental conditions were equally divided 
into male and female subgroups so that half of the 
subjects in each condition were exposed to same-sex 
models, while the remainder viewed models of the 
opposite sex. 

Subjects in the experimental and control groups 
were matched on ratings of the amount of physical 
aggression, verbal aggression, and aggression toward 
inanimate objects as observed prior to experimenta- 
tion by two raters on the staff of the laboratory 
school and child development laboratory who were 
familiar with the behavior of the subjects. In addi- 
tion to sex and preexperimental aggressiveness, 
subjects were also matched on the basis of age. 

The children were brought individually by the 
experimenter to the exposure room. The experimenter 
seated the child at a table placed in front of a 
television console, The experimenter suggested that 
they watch a television program and walked over 
to the set and turned it on, A film containing the 
appropriate model condition appeared and the ex- 
perimenter moved to a chair placed to the side and 
behind the child. From this position the experi- 
menter measured the amount of time the children 
observed the films, The experimenter remained with 
the child during the exposure period. 

The films to which children in the experimental 
conditions were exposed began with a commentator 
introducing the model who was walking about a 
room looking at some toys which were on the floor. 
The toys consisted of a plastic bat, four plastic 
balls, and a pounding board with a mallet, During 
this time an inflated plastic doll was shown and 
introduced by the commentator. The model moved 
to the doll and then looked about the room to the 
toys placed on the floor. The model then picked up 
the plastic bat and started to hit the doll. Following 
the bat aggression the model picked up the plastic 
balls and threw them at the doll. The model started 
striking the doll with the mallet from the pounding- 
board set following the ball-throwing sequence. The 
final aggressive behavior performed by the model 
was to sit on the doll and punch it repeatedly in 
the nose. Each of the distinctive aggressive acts 
was shown for a period of 30 seconds before the 
next behavior in the sequence was presented. Each 
Sequence of four behaviors was repeated three times. 
The model made four verbally aggressive responses 
which were interspersed throughout the sequences of 
physically aggressive acts. The verbal stimuli used 
were “Batter up,” “Smacko,” “Take that,” and 
“Bingo.” The films were edited so the duration of 
the physical acts and the number of verbal Tesponses 
were the same in all the experimental conditions, 

Following exposure to a film subjects were mildly 
frustrated. This was accomplished by taking them 
to another room containing a variety of attractive 
toys which the children were told that they could 
play with. As soon as a subject demonstrated in- 


volvement with the toys the experimenter said th 
the child could no longer play with the materi 
for it had been decided to save them for ol 
children. The children were told that there wa 
toys in another room and that they could play wil 
any of those. The experimenter and subject tha 
entered the experimental room. i 
The experimenter remained in the experimenta 
room with the child. In order to minimize he 
influence on the children’s behavior the experimenta 
remained at a desk in the corner with her back to 
the subject and avoided any contact. { 
Children assigned to the control condition wett 
not shown films, but were mildly frustrated prio 
to entering the experimental room. 


Test for Imitation 


The experimental room contained a variety j 
play materials, some of which could presumably. 
used for imitative or nonimitative aggression 
others which would tend to elicit nonaggressive 
haviors. The aggressive toys consisted of a la 
inflated plastic doll, a mallet and pounding boa 
a plastic bat, four plastic balls, two dart guns, and 
a plastic rifle. The nonaggressive toys included 4 
farm set with animals, two small dolls with clothing 
a tea set, two cars, and a truck. These materials 
were arranged in a fixed order prior to the entra 
of each of the subjects. 

The subject was in the experimental room for 4 
period of 20 minutes during which time his behavior 
was scored by two judges on predetermined respons 
categories. The judges viewed the subject througt 
one-way vision glass in an adjoining observation 
room. The 20-minute period was divided in 5-seco 
intervals by use of a prerecorded tape which sounded 
the interval numbers, This procedure yielded a total 
number of 240 responses for each subject. 3 

A subject’s behavior was scored as an imitativë 
aggressive response when the child struck the dol 
with the bat, threw balls at the doll, hit the doll 
with the mallet, or sat on the doll and punched i 
in the nose. The number of nonimitative aggressivi 
responses and nonaggressive responses wás # 
scored. The degree of interjudge agreement on 
type of behavior performed was calculated with the 
resulting product-moment coefficient being .93. 

; 
f 


Retest Procedure 5 


Approximately 6 months after their exposure n 
a film and the initial observation of behaviors 44 
subjects were reobserved. The mean number of ani 
in this interval was 178 with a range of 166-1 
days. i 

The subjects were brought individually by= tht 
same experimenter to the instigation room. The © tf 
dren were not reexposed to the films. The frustrati 
procedure was identical with that used during vail 
initial phase of the experiment with the excepto! 
that a different set of attractive toys was presentet 
Following the frustration the experimenter reino 
duced the children: to the same experimental a 
which was arranged as it had been previously. A8 


|, the children remained in the room for 20 minutes 

during which time the subjects’ behaviors were 
scored by judges watching through one-way vision 
glass in an adjoining room. During this second 
20-minute period the experimenter remained with 
the children and avoided contact. 


Test for Retention 


. At the end of the second performance period the 
experimenter turned to the children and asked if 
they recalled the television show that they had seen 
together. When the children indicated memory of 
the exposure the experimenter produced a large candy 
- sucker and informed the children that she would 
give them the sucker if they could tell her all the 
things they had seen on the film. The sucker was 
withheld until a child was consistently incorrect or 
. obviously had terminated effort to recall. When a 
child had difficulty with verbalization or ceased to 
». verbalize the experimenter asked the child, “Show 
+me what you saw on the television program.” The 
number of descriptions of an imitative response 
and/or performance demonstrations made by the 
_ children were scored by the judges in the adjoining 
oom, 


TABLE 1 


INITIAL MEAN ĪMITATIVE AGGRESSION SCORES FOR 
EXPERIMENTAL AND CONTROL GROUPS 


Experimental conditions 


Response Control 
catezory | aauit’| Kade | pee | ieee nl = 
male | female male | female 

Imitative 

aggression 9.58 | 16.33 | 22.75 | 13.42 | .00 
Boys 14.00 | 27.50 | 29.83 | 20.33 | .00 

- Girls 5.17 5.17 | 15.67 6.50 | .00 
RESULTS 


» Tablé 1 presents the mean imitation scores 
for children in the four experimental conditions 
, and the control group obtained shortly after 
. exposure to the models. The original scores were 
transformed to the logarithm of the scores plus 
One to make the data amenable to parametric 
Statistical analysis. The transformation was ne- 
Cessitated by the presence of many zero imitation 
Scores found in the control group and used 
“because the increases in group means and vari- 
ances were essentially proportional. 

The results of the analysis of variance based 
On the initial imitation scores are presented in 
Table 2, The findings reveal that exposure of 
children to aggressive models is a relevant ante- 
, cedent in determining the subsequent form of a 
child’s aggressive behavior, The sex of the child 
was shown to be a significant main effect indi- 


| 


BRIEF ARTICLES 99 


TABLE 2 


ANALYSIS OF VARIANCE OF TRANSFORMED INITIAL 
IMITATIVE RESPONSES 


Source af MS Be ce 
Between subjects 11 
Sex (S) 1 2.41 12.78* 
Error (b) 10 1.88 
Within subjects 48 
Treatments (T) 4 9.56 10.54** 
SXT 4 2.02 2.23 
Error (w) 40 123. 
*p<.01. 
Ss. 


cating that the performance of imitative aggres- 
sive behavior is greater for boys than for girls. 

When compared with the control condition 
the adult-male (¢ = 3.79, p < .001), adult-female 
(t=4.61, p~<.001), peer-male (¢=6.24, p 
<.001), and peer-female (¢ = 4.13, p< .001) 
model conditions were all found to be highly 
effective in shaping the children’s aggressive re- 
sponses. The performance level of subjects 
who viewed peer-male models was shown to be 
significantly higher than those of either the 
adult-male (t= 2.45, p< .05) or peer-female 
(t=2.10, p<.05) model conditions. 

The mean imitation scores for the same chil- 
dren in the four experimental conditions and the 
control group 6 months after exposure to the 
models are presented in Table 3. 

Though the results of the analysis of variance 
based on the transformed imitation scores ob- 
tained from the reobservation of subjects failed 
to reveal any significant treatments, sex, or inter- 
action effects, a comparison of pairs of means 
from the performance of subjects during this 
subsequent test was performed. The results 
showed that only exposure to an adult-male 
model remains a relevant antecedent experience 
in shaping the form of children’s aggressive 
behavior (t= 2.49, p <.05). 


TABLE 3 


SUBSEQUENT MEAN IMITATIVE AGGRESSION SCORES 
FOR EXPERIMENTAL AND CONTROL GROUPS 


Experimental conditions 


RERNE f Control 
category `| Adult | Adult | Peer | Peer 
male | female | male | female 
Imitative 
aggression 8.92 | 3.92 6.42 4,33 33 
Boys 7.33 | 7.17 5.17 6.50 67 
Girls 10.50 .67 7.67 2.47 .00 
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In each of the films utilized in the experi- 
mental conditions the model performed four 
distinctive physical aggressive behaviors. In order 
to evaluate further the variety of the models’ 
behaviors which were transmitted to the sub- 
jects an analysis of the number of different 
model-provided behaviors found in the initial 
performances, subsequent performances, and the 
postexperimental retention tests of the experi- 
mental subjects was evaluated by the t-test pro- 
cedure for correlated means. The differences be- 
tween the initial and subsequent performance 
phases of the experiment were significant for 
boys (¢ = 6.39, p<.01) and for girls (¢ = 3.24, 
p< .01) indicating that the number of modeled 
behaviors found in a subject’s performance is 
greatly reduced 6 months after exposure to an 
aggressive model. * 

The evaluation of the differences of the num- 
ber of different model-provided behaviors found 
in the performance and retention of subjects 6 
months after exposure to the films showed that 
the children retained more elements of the 
model’s behavior than they performed. The dif- 
ferences found were significant for boys (¢ 
= 4.30, p < .01) and for girls (t= 4.02, p <.01); 


Discussion 


The results of the present study confirm the 
finding reported by Bandura et al. ( 1963a) that 
exposure to filmed aggression effectively shapes 
the form of children’s aggressive responses. 

In studies by Jakubczak and Walters (1959) 
and Bandura and Kupers (1964) it was found 
that an increased probability of receiving posi- 
tive consequences or preventing negative ones 
when adults rather than peers were modeled 
apparently determined the model choice and 
imitative performance levels of children. When 
aggression is the type of behavior under investi- 
gation the age-status of the model does not 
appear to operate as a discriminative cue which 
alters the amount of imitation. 

The relatively low imitation scores obtained 
by girls could be attributed to either response 
inhibition or inhibition of observational tenden- 
cies. Bandura and Walters (1963) have proposed 
that the probability of occurrence of observing 
responses may increase or decrease depending on 
the observer’s anticipation of positive or nega- 
tive reinforcement. The difference in imitative 
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responses between boys and girls could, then, } 
due to the increased tendency by male subjeth 
to observe what would be a sex-appropriate be 
havior and a decreased predisposition to observ 
a sex-inappropriate behavior on the part 4 
female subjects. A comparison of the observe 
tional responses of boys and girls does not sup: 
port this differential observation hypothesis, Boys 
did not differ significantly from girls in th 
amount of time spent in observing the models 
behavior (¢=.05). An observer’s willingness to 
perform a model’s behavior rather than a willing 
ness to observe would appear: to account for 
the significant difference in imitative response 
between sexes, 

The exposure of children to aggressive films 
appears to remain a relevant antecedent it 
shaping the form of their aggressive responses 
for a considerable length of time. The continuel 
performance of aggressive matching responses i 
most significant when the model is an adult male 
This finding would be of particular importante 
in light of the frequency with which aggressivt 
adult males are presented in movies and tele 
vision. 
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A laboratory experiment was conducted 


SOCIAL COMPARISON AND ORDINAL POSITION 


A. JOHN ARROWOOD anb DONALD M. AMOROSO: 


University of Toronto 


in which groups of Ss were led to 


believe, by means of fictitious distributions of opinion, that they either con- 
formed closely to or deviated markedly from the modal opinion of the group. 


Sociometric and self-report data provided strong support for the following 
hypotheses: (a) first-born conformers reject an opinion deviate more readily 
than do later-born conformers; and (b) first-born deviates change their 
opinions toward the modal group opinion more readily than do later-born 


deviates. These findings were intefpreted 
first borns and later borns. 


‘The concepts of social comparison (Festinger, 
1954) and social reality (Festinger, Schachter, 
& Back, 1950) have been powerful ones in recent 
social psychological theory and research, It has 
been established that, to the extent that objec- 
tive, nonsocial standards of comparison are not 
available, people will evaluate their opinions 


‘(Festinger, Gerard, Hymovitch, Kelley, & Raven, 


1952), their abilities (Hoffman, Festinger, & 
Lawrence, 1954), and even their emotions 
(Schachter & Singer, 1962) by comparison with 4 
those of others, 

It has been argued by Festinger and others 
that a discrepancy of opinion (or ability or 
emotion) in a group gives rise to pressures to 
Uniformity so that members will be better able 
to arrive at subjectively precise evaluations of 
the opinion (or ability or emotion) in question. 
Among the variables proven and postulated to 
be productive of pressures to uniformity in a 
social group have been group cohesiveness, rele- 


"vance of issue, presumed presence or absence of 


an “expert” or of a “correct” answer, and de- 
pendence. It is to some of the implications of this 
last that the present research is directed. 

Three studies provide the immediate frame of 
reference for the experiment to be reported. The 
relevance of these three studies to each other and 


~-to-the present study lies in the concept of de- 
Pendence—that is, “the extent to which the indi- 
‘vidual uses or relies upon other persons as 


Sources of approval, support, help, and reference 
(Schachter, 1959, p. 82].” In the first study, 
Schachter (1951) found that members who devi- 


- ated from the modal opinion in a group tended 


to be rejected—provided that the group was 
Cohesive and discussing an issue of relevance to 
it. Schachter interpreted this finding in terms of 
Pressures to uniformity and dependence. The 


MANOR, at McMaster University, Hamilton, On- 
rio, 
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in terms of differential dependence of 


second study, by Schachter (1959), was con- 
cerned with the sources of gregariousness. In the 
course of this research, a femarkable and con- 
sistent side-finding emerged—to the effect that 
ordinal position of birth was a particularly potent 
discriminator of the relative strength of the af- 
filiative tendency under various conditions of 
anxiety. Schachter interpreted this finding in 
terms of differential dependence of first borns 
and later borns.® In the third study, Hochbaum 
(1954) attempted to specify some of the con- 
formity-inducing forces which act on individual 
members rather than on groups as wholes. He 
found that group members whose self-confidence 
had been increased experimentally were more 
resistant to group influence than were those 
whose self-confidence had been decreased. This, 
too, can be interpreted in terms of dependence. 
These studies suggest that first borns are more 
dependent than are later borns, and therefore 
more susceptible to group pressures to uniform- 
ity than are later borns. It is expected that the 
more dependent a member is on a group, the 
more closely he will conform to the modal 
opinion of that group on relevant matters. Conse- 
quently, he will tend, more readily than will other 
members, to consider as an inappropriate referent 
any person whose opinion is divergent from that 
of the group; he will tend to reject such a 
person. Further, it is expected that if such a 
dependent member finds himself holding an 
opinion which is divergent from the modal 
opinion of a group, he will tend, more readily 
than will other members, to change his opinion 
to conform more closely to that of the group. 
These expectations are summarized in the 
form of two hypotheses: when first borns believe 
themselves to be conformers in a group, they 


2 Following Schachter (1959), we use “first born” 
to refer both to first-born and to only-born children. 
By “later born” we refer to all other birth ranks. 
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reject a deviate more readily than do later-born 
conformers; and when first borns believe them- 
selves to be deviates in a group, they tend, more 
readily than do later-born deviates, to change 
their’ opinions toward the modal opinion of the 
group. 


METHOD 


In the experiment designed to test these two 
hypotheses, an attempt was made to manipulate one 
variable—conformer-deviate status—while taking ad- 
vantage of an ecological variable—ordinal position 
at birth—for analytic purposes. Ninety-two female 
undergraduates, 46 first borns and 46 later borns, 
served as subjects in groups of from four to six 
members each. 

In order to maximize the pressures to uniformity 
acting on group members, instructions were given to 
create the impression of group homogeneity and to 
induce a condition of high cohesiveness. To afford 
this latter, each group was told that they were to 
discuss a problem among themselves, and that the 
group showing the best overall results would be 
selected to discuss a similar problem with a panel 
of authorities on television. A questionnaire filled 
out previously by the students in class facilitated 
the creation of these conditions. 

Subjects first read a condensed case history of a 
juvenile delinquent. They were asked to choose pri- 
vately one of seven graded alternative proposals for 
the rehabilitation of the delinquent, ranging from 
“all love” through “all punishment,” and to indicate, 
again privately, their degree of confidence in their 
choices, 

In order, ostensibly, to facilitate discussion, sub- 
jects were each given a tally purporting to show 
the distribution of opinions in the group. It was by 
means of these distributions that half the subjects 
were led to believe that they were conformers, while 
the other half were led to believe that they were 
deviates, The subjects were then told that the dis- 
cussion would take the form of writing notes to 
one another, rather than verbal form, and appropri- 
ate reasons were given for this procedure. A 15- 
minute period followed during which time the sub- 
jects wrote notes and the experimenter collected 
them from time to time—presumably sorting them 
and delivering them to the subjects specified. Un- 
known to the subjects, however, standard hand- 
written notes had been prepared beforehand, and 
it was these prewritten notes which were delivered. 

After the first 15 minutes of written discussion, the 
experimenter asked for another private census of 
opinion. A second set of fictitious distributions of 
opinion were given out, and a second 15-minute 
period of written discussion ensued. At the end 
of this time, subjects were again asked to indicate 
their opinions and confidence. 

Subjects were then asked to rank order the group 
members for their appearance on the television pro- 
gram, on the grounds that only four students would 
be required to balance off the panel of four authori- 
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ties. This ranking procedure provided a sociometric 
measure of rejection—that is, rejection is defined in 
terms of those members who are excluded from 
representing the group on television. 

Finally, two more questionnaire items were dis- 
tributed to the subjects, The first was designed to, 
obtain data on the subjects’ attraction to the group, 
and read: “If it involved any time and effort on 
your part, how much would you like to have: this 
discussion group meet again?”—the five alternatives 
for which ranged from “like extremely” through 
“dislike.” The second item was to obtain data on 
the effectiveness of the conformer-deviate manipula- 
tion,’ and read: “How great a difference of opinion 
would you say exists between you and the rest 
of the’ group?”—the alternatives for which ranged 
from “no difference” through “extremely great, 
difference.” a 

At this point, the experiment was over, In the 
Postexperimental discussion that followed, the sub- 
jects were advised of the several deceptions perpe- 
trated during the experiment, and the reasons for 
them were discussed at length. The subjects were 
asked not to discuss the experiment with their class- 
mates or other potential participants. As far as can, 
be ascertained, all subjects complied with this final 
request. ~ 


RESULTS 


Before presenting the main data relevant to 
the experimental hypotheses, it would be well 
to note briefly those data which verify inde- 
pendently the effectiveness of the experimental 
manipulations. Of the subjects given the con- 
former treatment, 85% of them reported only 
a negligible difference of opinion between them: 
selves and the rest of the group, while 83% of 
those given the deviate treatment reported 4 
sizable difference of opinion. We can be reason- 
ably confident, then, that the conformers per 
ceived themselves as conformers and the deviates 
as deviates. There was no difference between’ first 
borns and later borns in this respect. 

As far as attraction to the group was con: 
cerned, two-thirds of the conformers and three- 
quarters of the deviates reported a strong desire I 
for the group to meet again. Thus, whether 4 
subject liked or disliked the group appears to be 
independent of whether she was a conformer % 
a deviate. Again, there was no difference betwee 
first and later borns in this respect. 

The first hypothesis under test is that first 
born conformers will reject an opinion deviate 
more readily than will later-born conformers. 
The postdiscussion sociometric choices of ithe 
group members provide an index of rejection: 
Of the 23 first-born conformers, 15 of them 
(65%) rejected the deviate; of the 23 later-bor 
conformers, 9 of them (39%) rejected the devi- 
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ate. This difference is statistically significant 
(p=.04, chi-square, one-tailed). Parentheti- 
cally, there were two experimental groups for 

| -which there were only 4 members. In both these 
“groups the deviate was ranked in fourth place 

“by first-born conformers but not by later-born 
conformers. If these two cases were to be con- 
*sidered instances of rejection, the p value would 

- become .01.° 

There are three sources of data relevant to 

. the second hypothesis—that first-born deviates 
* are more likely to change their opinions toward 
the modal opinion than are later-born deviates. 
° The first, and by far the most direct, test of this 
hypothesis is through a comparison of the num- 
` ber, of first-born and later-born deviates who 
change their opinions toward the modal opinion. 
Of the 23 first-born deviates, 6 of them (26%) 
so changed; of the 23 later-born deviates, 1 of 
them so changed. This difference is statistically 

` significant at the .048 level via the exact test. 

A second source of relevant data is the con- 
‘. fidence with which first borns and later borns 
. hold their opinions as expressed on the first scale 
of opinion and confidence—that is, before the 
conformer-deviate manipulation was carried out. 
Underlying the hypothesis is the assumption that 
| „first borns are more dependent (less confident) 
than are later borns. Therefore, to generate this 
second test it is necessary only to compare all 
first borns and later borns on their initial level 
|` of confidence. The rationale here is simply that 
low initial confidence in one’s opinion is more 
likely to eventuate in opinion change than is high 

. initial confidence—regardless of whether or not 

the time span of the experimental session is 
| > sufficient to show so. The mean initial level of 
confidence of first-born subjects was found to be 
significantly lower than that of later-born sub- 

> jects (t= 2.69, p<.01). 
Finally, indirect confirmation of the hypoth- 

. ‘esis is afforded by the sociometric data, and lies 

in the tendency for deviates to reject themselves. 
. The measure of this tendency is simply the ex- 
tent to which deviates, when given the oppor- 
tunity to choose themselves to represent the 
group on television, exclude themselves—which 


|© For this and all other analyses presented, N is 

» defined as the number of subjects rather than the 
number of experimental groups. Since at no time 
ı during the experiment did the subjects in fact inter- 
_ act, and since, via the prewritten notes, subjects at 
comparable opinion positions in all groups received 
identical stimulus materials at the same times, the 
independent experimental unit to which the treat- 


Ment was applied is the individual rather than the 
group. 


x 
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is an index fully in accord with our specification 
of rejection. Of the 23 first-born deviates, 7 of 
them (30%) rejected themselves; of the 23 
later-born deviates, 2 of them (9%) rejected 
themselves, This difference is statistically sig- 
nificant at the .067 level via the exact test. 


Discussion 


The hypotheses having been confirmed, rather 
than reviewing the data relevant to their con- 
firmation, we shall attempt to suggest briefly 
some of the implications of those data. Con- 
sistent with a number of studies by Festinger 
and his associates in the early 1950s, we have 
shown that the existence of a discrepancy of 
opinion in a group produces pressures to uni- 
formity acting on the group members. This pres- 
sure can manifest itself in at least three ways: 
the individual can change his opinion to conform 
more closely to the modal opinion; he can at- 
tempt to change the opinions of others; or he 
can reject, or redefine the boundaries of the 
group. We have focused on the first and last of 
these. We have noted that the notion of depend- 
ence runs through these earlier studies, and that 
it arises again in Schachter’s recent work on 
anxiety and affiliation. 

Tf it is the case that dependence in this latter 
sense is the same order of phenomenon as de- 
pendence in the group or situational sense, then 
one would reasonably expect to find ordinal 
position differences in behavior in the group 
setting. The present research brings data to bear 
on this expectation. Prior research has demon- 
strated that experimental instructions and topics 
of discussion comparable to those used in this 
study are sufficient to induce a high level of 
cohesiveness and relevance of issue. The present 
study suggests that, with the customary factors 
making for high dependence on the group held 
constant and high, there is a consistent difference 
between the behavior of conformers and deviates, 
consonant with prior research; and within the 
conformer and deviate categories, there is a 
consistent difference between the behavior of 
first borns and later borns. 

That the experimental hypotheses are con- 
firmed suggests that dependence as manipulated 
experimentally and mediated by group cohesive- 
ness and relevance of issue, and dependence as 
brought .to the experimental situation and medi- 
ated by ordinal position of birth have similar, if 
not identical, behavioral implications. Thus, the 
present study demonstrates that dependence may 
be considered a unitary concept, in much the 
same way that Back (1951) has demonstrated 
that cohesiveness is a unitary concept. 
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SELF-CONCEPT, DOGMATISM, AND TOLERANCE OF 
TRAIT INCONSISTENCY 2 


DAVID FOULKES ? anp SUSAN HEAXT FOULKES 
University of Chicago and National Opinion Research Center 


A series of impression formation problems was given to a group of 62 college 
freshmen women, 2 hypotheses were tested: (a) that self-descriptions for a 


particular trait would be systematically related to reactions to inconsistency in 


as measured by the Rokeach Dogmatism Scale, would also be related to 


information presented along that trait dimension ; and (b) that dogmatism, | 
| 


tolerance of such inconsistency, Both hypotheses were partially confirmed, 
even when attempts were made to control such extraneous variables as intelli- i 


gence and general self-concept. 


Perhaps the most striking fact about impres- 
sion formation is not that first impressions gen- 
erally prove resistant to later modification 
(Asch, 1952), but rather that the same evidence 
leads different persons to different initial im- 
pressions or to different modifications of pre- 
viously formed impressions. Such divergent 
conceptual responses to constant stimuli would 
be puzzling if the only function of impression 
formation (or categorization in general) were to 
sort out objective stimulus properties and 
thereby facilitate efficient instrumental behavior 
toward stimulus objects, Psychoanalytic theory, 
however, has stressed that interpersonal cognition 
and interpersonal behavior may also serve the 
function of defense, that is, we perceive and act 
in ways which are thought to be likely to mini- 
mize threats to our own self-concept and present 
pattern of personality integration. 


1 This study was supported, in part, by the So- 
cial Science Research Council of Lawrence College, 
at which place the data were collected. 

? Now at the University of Wyoming, 


| 


Experimental research (see, e.g., Bruner, 1958) l 
has energetically, if often inconclusively, inves 
gated personal determinants of the ability 1 
identify accurately and describe nonhuman, bill , 
social, stimulus objects such as printed words, 
coins, etc., and the concept of defense has some: 
times been explicitly adopted in these studies 
(e.g., McGinnies, 1949). Interestingly a 
however, defense methodology and theory have 
not made much of an impact upon research My 
the area of impression formation, that 1s, 
study of the categorization and evaluation of hW 
man beings and their behavior. Although it ® 
known that human judges often project theif 


„own attributes into others (Bender & Hastoth 


1953), analyses of the interaction of projection , 
or other possible defensive mechanisms Wil! | 
either judge or stimulus-object variables hav! 

been lacking. In fact, there is a dearth of infot 
mation in general as to how self-conceptions a” 
other conceptual habits are related to perform ; 
ance in impression-formation tasks. i 


In the present study, we examine two hypotl . 


‘ 
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eses concerning such personal determinants of the 
impressions which are formed of others: trait 
self-conceptions have selective effects upon trait- 
relevant cognitions of others (Foulkes & Heaxt, 
1962) and a generalized trait of inability to 

. synthesize discrepant information (Rokeach, 
1960) is related to intolerance of trait incon- 
sistencies in others (Steiner, 1954; Steiner & 
Jolinson, 1963). 

In each of a series of impression-formation 
problems the subject was given a number of 
descriptions of behavioral episodes in the life of 
a person and was asked, following the presenta- 
tion of each such episode, to indicate, via rating 
scales, his impression of the person. Toward the 
end of each problem there was a reversal in the 
nature of the information given to the subject. 
If the initial items suggested shyness, for ex- 
ample, later items suggested that the person was 
not at all shy or reserved in his interpersonal 
relationships. 

It was predicted that different reactions to 

. information reversal for a particular trait would 
be a function of differences in self-description 
for that trait. It was expected that subjects with 
relatively low self-evaluations on socially desir- 
able traits would, by virtue of ego-defense mech- 
anisms necessitated by this low self-evaluation, 
react differently to changes in trait-relevant cues 
in others than would subjects with relatively 
-high self-evaluations on the trait in question, We 
had in mind, for instance, the hypothetical case 
of a person of limited self-assertiveness who 
becomes, on this account, highly sensitized to 
signs of assertiveness or nonassertiveness in 
others, and who either magnifies or minimizes the 
importance of such signs in order to defend his 
own self-concept. To test our prediction we 
compared reactions of self-described high- and 
low-shy subjects to reversals of trait information 
along an outgoingness-shyness dimension and 
reactions of self-described high- and low-insensi- 
tive subjects to reversals of trait information 

__along a sensitivity-insensitivity dimension. These 
“Particular traits were chosen because they seemed 
to be relatively central aspects of self-concept 
among college women, from which group our 
sample was drawn, and because they seemed to 
be conceptually (and subsequently proved to be 
in fact) independent of each other, thus afford- 
ing two separate tests of our hypothesis. 

It was further predicted that scores on Rok- 
each’s (1960) Dogmatism (D) Scale would be 
related to reactions to information reversal on 
all of the experimental problems. It was expected 
that highly dogmatic subjects would find such 

trait inconsistency” as threatening and frus- 
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trating as other ambiguous features of the social 
world, and thus would more frequently construct 
consistent impressions of the inconsistent per- 
sons than would low-dogmatism subjects. 


PROCEDURE 
Materials 


Five impression formation problems were em- 
ployed. Each problem was presented in a booklet 
containing 15 brief descriptions of behavioral epi- 
sodes from the life of a hypothetical person and 15 
sets of rating scales upon which cumulative impres- 
sions were to be recorded, In four of these five 
problems the first 9 items contained information 
designed to place the person being rated at one end 
of a specified rating continuum, while the last 6 
items contained information designed to place the 
ratee at the other end of the same continuum, The 
remaining problem was a buffer in that it contained 
no self-contradictory information and in that it 
was constructed without reference to any of the 
rating scales used in the study. To minimize aware- 
ness of the purpose of the test booklets, a number of 
items were constructed so as to be neutral with 
respect to the trait dimension being manipulated. 
Only Items 1, 3, 4, 8, and 9 were deliberately con- 
structed to create a particular impression, and only 
Items 10 and 12 were deliberately constructed to 
challenge that impression. 

The two dimensions of personal description chosen 
for experimental manipulation were sensitivity to 
others versus insensitivity to others and shyness 
versus outgoingness. In two of the experimental 
problems, information was varied so as to produce 
discrepant information concerning sensitivity-insensi- 
tivity and in two of the problems information was 
varied so as to produce discrepant information con- 
cerning shyness-outgoingness, The characteristics of 
each of the five problems, and the order in which 
they were administered to all subjects, are indicated 
below: 


Items Items 
Order Ratee 1-9 10-15 
i oan sensitive egy 
ancy y outgoing 
3 Ruth (buffer) 
4 Kath; insensitive sensitive 
5 if outgoing shy 


To minimize awareness of the dimensions being 
manipulated, rating scales were included for three 
nonmanipulated dimensions (emotional versus calm, 
unintelligent versus intelligent, and immature versus 
mature) as well as for the two manipulated dimen- 
sions. All five scales consisted of 7 points (1-7), 
with higher values associated with the more socially 
desirable of the two polar terms defining the scale. 

The following sample of items from the Joan 
problem will perhaps indicate most clearly the 
nature of the problem materials: 


1. Beth, a girl not well-liked by other girls in her 
dorm, stands in the doorway of Joan’s room. Joan 
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is at her desk, working. Beth says: I’m having 
such a terrible time with my paper, and no one 
seems to have any time to help me. 

Joan turns around to look at Beth, at first 

thinking she is too busy to help her. Seeing 
Beth’s look of anguish, however, she says: Why 
don’t you come in? Maybe I can help you. [Sensi- 
tive to others] 
2. The alarm rings at 7:30. Joan, with an eight 
o’clock class, turns over, shuts off the alarm, and 
falls back to sleep. She awakens several minutes 
later, jumps out of bed, and gets dressed. Finish- 
ing dressing at 7:50, she surveys the room, which 
is something of a mess, and decides that she'll 
clean it later. She picks up her notebook and 
hurries off to class. [Neutral] 

10. A white towel is on Linda’s door [local sign, 
meaning: do not disturb]. Joan knocks on the 
door. 

Linda: Oh hi, Joan, can I help you with some- 
thing? 

Joan; What are you doing? Studying? Well, I 
just wanted to ask you about that boy who’s in 
your French class, you know the one you men- 
tioned yesterday? 

Linda: Oh yeah, ah... (Moves feet around 
somewhat nervously, still holding on to the door), 
Will it take long? 

Joan: I know you're trying to study, but it 
shouldn’t take too long, I don’t think, Can I 
come in? [Discrepant: Insensitive to others] 


The problems were constructed in such a way as to 
facilitate identification for subjects; thus all five 
ratees were female, and item content was taken 
from the campus setting in which the study was 
conducted. 

Several pilot studies were run to pretest the im- 
pression-formation materials. In one such study it 
was established, through separate administrations of 
the two sections of each booklet, that these sections 
had the directional properties which we assumed 
that they had. In a second study, a small group of 
subjects took all five problems in their complete 
form, and then answered an “awareness of purpose” 
questionnaire, constructed in a funnel sequence 
(Kahn & Cannell, 1957). While there was, expectedly, 
some awareness of the contradictory nature of some 


TABLE 1 
MEAN RATINGS For Ires 9 Ann 12 


Item 9 Item 12 
n Problem 
M SD M SD 
62 Joan 6.61 61 2.55 ‘1.39 
61" | Kathy 1,16 45 4.84 1.57 
62 Carol 6.19 88 3.68 1.56 
62 Nancy 1,84 94 5.16 1.38 


* One subject received a faulty booklet for the Kathy prob- 
He She has, therefore, been excluded from all analyses of this 
problem, 
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of the portrait episodes, no subject indicated any- 
thing approaching a full awareness of the design of 
the problems or the purpose of the experiment. Our 
experience with the pilot subjects did indicate that 
written instructions were insufficient to get subjects 
to give cumulative ratings; hence, in the experi- 
ment proper, the administrator emphasized this point 
in additional, oral instructions. 

Following the completion of the five impression- 
formation booklets, subjects were given a variant 
of the Chicago Q sort (Corsini, 1956) and asked to 
perform, first, a self-rating, and then, an ideal-self- 
rating. Finally, subjects completed Form E of Rok- 
each’s, D Scale. All tests were administered at one 
sitting, with subjects generally taking from 75 to 
105 minutes to complete the test battery. 


Subjects 


Subjects were 62 freshman women volunteers who 
were paid $1.00 for their services. None of these 
subjects had served in the authors’ previous study 
(Foulkes & Heaxt, 1962), and most of them had 
had no prior contact with the Psychology Depart- 
ment, 


RESULTS 
Impressions: Group Data 


Mean ratings for Items 9 and 12, giving, re- 
spectively, the initially formed impression for 
the manipulated trait and the impression follow- 
ing two items describing discrepant behavior for 
that trait are presented in Table 1. From these 
results it may be concluded that the impression 
booklets produced the overall effects which were 
desired, namely: initial extreme placement with 
minimum variability and shifts, subsequent to 
trait-information reversal, to more moderate 
scale positions with greater variability in scale 
placements. The degree of variability of reac- 
tions to discrepant information, that is, for Item 
12 scalings, makes possible the demonstration of 
individual difference factors such as those speci- 
fied in the experimental hypotheses. 


Initial Impressions: Relation to Self-Concept 


and Dogmatism y 


The initial series of items seemed sufficiently 
extensive and unequivocal that little disagree- 
ment was anticipated as to placement on the 
manipulated scale by the time that the end of the 
series was reached. Item 9 variability scores tend 
to support this expectation. Since major interest 
attaches to shifts between Item 9 and Item 12 
scale placements, however, it will be well to 
examine whether groups whose shift scores are 
to be compared were comparable in their Item 
9 placements. That is, if major interest attaches 
to the manner in which discrepant information 1s 


BRIEF ARTICLES 


107 


3 TABLE 2 TABLE 3 
> Irem 9 Scores: SELF-RATING GROUPS Trem 9 Scores: DOGMATISM GROUPS 
ee 7 Lowin Highin Low in dogmatism | High in dogmatism| 
d manipulated trait | manipulated trait 
‘Problem t Problem “4 
$ AES, ee Item 9 n Item 9 n 
em n m n 
* "Foan | 67s | 32] Soar am Merete e e | goo” |\'3n% | A 
5 Kathy 1.09 32 1.24 29 1.25 Coat 6.03 31 635 31 145 
Carol 6.13 | 31 6.26 | 31 59 Nanc 177 | 31 190 | 31 54 
Nancy | 1.61 | 31 | 206 | 31 | 1.96" sl ; s 
Eg *p=.05, 


handled, then we must have some assurance that 
' . the initial impressions for the comparison groups 
were similar. 

Comparison groups came from dichotomizing, 
as close to the midpoint as possible, the distribu- 
tion of self-ratings for insensitivity and shyness 
and the D Scale distribution. Item 9 scores for 
the self-rating comparison groups are presented 

„in Table 2, along with an evaluation of the ob- 
served differences. Item 9 scores for the dogma- 
tism comparison groups, and evaluations of their 
differences, are given in Table 3. 

It will be observed that, in absolute terms, the 

. differences between comparison groups are not 
large. However, in several cases they attain or 
approach accepted levels of statistical signifi- 

cance. This means that controls for initial im- 
pression scores will have to be instituted in later 
shift-score analyses, since, other things equal, a 
group with more extreme initial impressions has 

„ 4 wider possible latitude for later shifts in im- 


pression. Only in this manner may we be sure 
that we are dealing with differences in “defend- 
ing” initial impressions as well as with differences 
in arriving at them in the first place. 

It is interesting to note that, with self-concept 
comparison groups, in three of four cases (includ- 
ing both which approach statistical significance), 
it is the group describing itself as low in a 
socially undesirable trait which takes the more 
extreme position. A somewhat unexpected find- 
ing is that the group with low dogmatism scores 
had a more extreme average scale placement in 
three of the four problems (including the one 
showing a statistically reliable difference) than 
did the group high in dogmatism. 


Shift Scores: Relation to Self-Concept 


Table 4 gives the distribution of shift scores 
(the absolute difference between Item 9 and 
Item 12 ratings) for subjects classified as high 
or low in self-ratings on the negative term of 


TABLE 4 
SHIFT SCORES AND SELF-RATING FOR MANIPULATED TRAIT 


Change 
Self-rating Joan Kathy 
j Maj Modi Mi 
Mais | Mee OED eo GDS (D 
N Low insensitive 18 9 5 12 14 6 
igh insensitive 7 13 10 6 10 13 
; AAY bid x = 5.22* 
Carol Nancy 
(4-6) (2-3) on (5-6) (3-4) (0-2) 
Low shy 7 13 11 11 12 8 
igh shy 9 12 10 7 12 12 
xt = 33 x? = 1.68 
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the manipulated scale dimension. The change 
score distribution was divided into three groups, 
labeled “major” changes, “moderate” changes, 
and “minor” changes. This division was accom- 
plished in that manner which yielded the closest 
possible approximation to equal-sized groups. 

The results provide support for the hypothesis 
that self-attribution for insensitivity to others is 
related to the handling of discrepant information 
for that trait, Major shifts from initial impres- 
sions were more likely to be shown by subjects 
low in self-described insensitivity, while minor 
shifts were more often made by subjects high 
in self-described insensitivity. The results of the 
“shy” groups, however, failed to confirm the 
hypothesis relating self-attribution to shift 
scores. 

Before the results for the Joan and Kathy 
problems are interpreted purely in terms of 
shifts in the face of discrepant information, it 
will be well to recall that for both of these prob- 
lems the low-insensitive group had a more ex- 
treme average score on Item 9, The greater 
incidence of major shifts for low-insensitive sub- 
jects might be interpreted, therefore, as a mere 
reflection of the fact that their initial scores made 
possible a greater range of subsequent variation. 
But, when initial impression scores are held con- 
stant, as through a consideration of only those 
subjects who gave the modal Item 9 score, dif- 
ferences between high- and low-insensitive sub- 
jects persist. For example, major change scores 
in the Joan problem were observed for 64% of 
25 low-insensitive subjects with an Item 9 scale 
placement of 7, while only 35.3% of 17 high- 
insensitive subjects with the same Item 9 score 
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showed major shifts from Item 9 to Item 12, 
Since no significant Item 9 difference between 
high- and low-insensitive subjects was observed 
in the Kathy problem, it is less likely that Item 
9 differences have contaminated these shift-score 
results. And, even here, for example, 41.4% of 
29 low-insensitive subjects with Item 9 ratings 
of 1 showed major shifts, while only 25% of 24 
high-insensitive subjects with the same Item 9 
score had such shifts. 

The fact that significant results were obtained 
only with self-descriptions along the sensitivity- 
insensitivity dimension raises a further question. 
This dimension refers to a general social-per- 
ceptual trait, and the significant differences with 
this classification may reflect a social-perceptual 
tendency of general application, rather than any 
interaction between this trait and particular 
problems in which it has been manipulated. It 
should be noted, therefore, that significant chi- 
square values were not obtained when the change 
scores of high- and low-insensitive subjects were 
compared on the trait-irrelevant problems (Carol 
and Nancy). 

Tf there is specificity on the dependent variable 
side, it may still be asked whether we are dealing 
with specific ratings on insensitivity, or with 
something more general which they reflect. For 
instance, insensitivity to others is obviously a 
socially undesirable trait. Perhaps subjects who 
rate themselves high in socially undesirable 
traits, including insensitivity to others, react 
differently to discrepant information than do 
subjects who feel they are lacking in socially 
undesirable traits. But shyness is also judged to 
be socially undesirable. Also, subjects who feel 


TABLE 5 
Suter SCORES AND DOCMATISM 


Change 
Joan Kathy 
Moderate Minor Major Moderate Minor 
8 11 11 8 11 
14 $ 4 7 16 
x? = 4.94 x? = 3,99 
Carol Nancy 
7 13 10 11 10 
| 18 A 8 8 13 10 
x? = 8.29 x? = 38 


f 
; 


¥ 
r 


y 


many socially undesirable traits 
likely, be dissatisfied with themselves 
this in low self-ideal-correlations on 
But a comparison of the shift scores 
low (dichotomy at the median) r 
ts failed to produce any chi-square 
hing statistical significance. 


Relation to Dogmatism 


5 gives the distribution of shift scores 
ts classified as high or low in dogma- 
e results in all but the Nancy problem 
subjects high in dogmatism show 
conceptual changes or very minimal 
faced with discrepant information. 
, interpretation of the shift scores 
complicated by differences in Item 9 
n the high and low D groups (al- 
differences generally failed to at- 
al significance). But, again, an 
f subjects with identical Item 9 scores, 
the modal response category, suggest 
are genuine differences in shifting, even 
tial scale placement held constant. In 
problem, for example, 41.7% of 24 
ubjects with Item 9 scores of 7 had 
“shift scores, while only 22.2% of 18 
Subjects with the same Item 9 score 
omparable shifts. In the Carol problem, 
17 low D subjects with an Item 9 
f 6 had moderate shifts while the com- 
‘figure for 10 high D subjects with the 
m 9 score was 20%. In the Kathy 
ñ, 50% of 28 low D subjects with Item 9 
‘1 had moderate change scores while 
of the 25 high D subjects with the 
9 placement had such change scores. 


Relation to Intelligence 


attitudinal or personality variables are 
d to performances on cognitive types 
S, there is always the risk that these 
‘are being confounded with intellective 
emed wise, therefore, to consider the 
y that differences in intelligence might 
results which have been attributed 
ifferences in self-concept and dogma- 
l Mathematical and Verbal scores were 
for all subjects. These score distribu- 
dichotomized at the median, and 3 X 
tables set up, casting SAT scores 
ft scores, The same form of analysis 
led significant results when self- 
and dogmatism were variables failed in 
“cases to produce a chi-square value 
hed statistical significance. 
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Dogmatism: Related Variables 


The data collected in this study make possible 
the examination of some of the correlates of D 
Scale scores. The following Spearman rank-order 
correlation coefficients come from the total sám- 
ple of 62 subjects: r for D Scale and r(S-I) = 
—.12; for D Scale and SAT-M=—.18; and 
for D Scale and SAT-V = — .30. The correlation 
with SAT Verbal scores is significant at the .025 
level, two-tailed test. The value of this correla- 
tion is considerably larger than that Rokeach 
(1960, p. 190) has reported between D Scale 
scores and intelligence test scores, Although 
insignificant statistically, the direction of the 
correlation between D and r(S-I) is interesting— 
high dogmatism subjects were more likely to 
admit self-ideal-discrepancies than were low dog- 
matism subjects. 


Discussion 


The hypothesis of trait-specific self-conceptual 
influence upon impression formation was con- 
firmed in part by the results. Some tendency 
was noted for subjects who felt they possessed a 
socially undesirable trait to make less extreme 
evaluations along that dimension than did per- 
sons who felt themselves to be relatively lacking 
in that trait. The basic confirmative finding, how- 
ever, was that self-described insensitive subjects 
reacted much differently to discrepant informa- 
tion along the sensitivity-insensitivity dimension 
than did self-described noninsensitive subjects. 
The initial impressions of “insensitives” tended 
to be more resistant to major change when dis- 
crepant information was later presented along this 
dimension, On the other hand, “sensitives” and 
“insensitives” did not react differently to dis- 
crepant information along the outgoing-shy di- 
mension, Thus the difference between these 
groups on the sensitive-insensitive problems is 
not attributable to accurate self-perceptions of a 
generalized sensitivity to others, but rather to a 
specific interaction between the perception of 
this trait in the self and reactions to indices of 
its presence in others. 

The self-descriptions of “insensitives” indicate 
variance from the culturally valued (especially 
for women) trait of interpersonal sensitivity. 
Their resistance to changing initial impressions 
came in a situation in which the experimental 
materials presented relatively striking shifts in 
behavior along the manipulated dimension. Thus 
there is an element of denial of trait change in 
the reactions of a group whose members have 
uncomplimentary evaluations of their own status 
on this socially desirable trait. Perhaps the ad- 
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mission of the possibility of major changes 
along a dimension in which a person is himself 
insecure would prove too threatening to the 
rationalizations which justify his own position on 
thgt dimension. In any event, an interpretation 
in terms of some form of defense seems plausible 
since “insensitives” are not generally insensitive, 
only where their own self-perceived imperfection 
is manipulated in others, and since their response 
to this manipulation is a denial of change in the 
face of clear-cut evidence that such change has 
occurred, 

This line of reasoning fails, however, to ex- 
plain the lack of any significant difference be- 
tween low- and high-shy subjects in responding to 
the manipulation of shyness cues in others. It is 
possible that insensitivity was a more central 
aspect of self-concept for our subjects than was 
shyness and hence it alone provoked self- 
defensive reactions. Or, it may be that the sensi- 
tivity dimension functions uniquely, even among 
central aspects of self-concept. Only future 
research can resolve this problem. The results 
with the sensitivity dimension seem promising 
enough to justify both such extended experimen- 
tation upon the interaction of trait-specific self- 
perceptions with reactions to cues of this trait 
in others and increasing attention to the self- 
defensive functions which may be served by the 
impressions which we form of others. 

With respect to the hypothesis that the 
generalized conceptual trait of personal-social 
dogmatism would be negatively related to “tol- 
erance of trait inconsistency” in impression- 
formation problems, the evidence was basically 
confirmative, In the face of newly received, dis- 
crepant information, highly dogmatic subjects 
tended to avoid compromise solutions. They 
either changed greatly or adhered very closely 
to their original impressions. These findings may 
put in a new perspective the recent findings by 
Steiner and Johnson (1963) that high scorers on 
the California F Scale (Adorno, Frenkel-Bruns- 
wik, Levinson, & Sanford, 1950) prove resistant 
to changing initially formed impressions when 
they receive moderate reversals in information 
about the object of their impressions. When high 
Scorers on an admittedly imperfectly related 
scale, the D Scale, are presented with striking 
reversals in the information they receive about 
the object of their impressions, their resolution 
is either resistance to changing the initial im- 
Pression or a complete shift from their initial 
impression to acceptance of the newly received 
discrepant information—black or white, but not 
grey. One question obviously Tequiring further 
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g 
exploration is why some high D subjects choose | 
the “black” resolution while others choose {i 
“white” resolution. Once again, the need seem 
clear for further research upon personal, as 
as stimulus, determinants of impression fot mas | 
tion and for greater attention to the function 
which the impressions we form of others ma 
serve for the maintenance of stability within our 
own conceptualizations of self and world. 

Finally, it should be noted that there are sev 
eral indications in the results that the properti 
of the D Scale and the construct(s) it assesses 
are still not very well understood. For examp e, 
it was found that subjects low in dogmatis 
tended to form more extreme initial impressioni 
than did subjects high in dogmatism. Also, the 
correlation with SAT Verbal scores indicates 
that, at least in some populations, dogmai 
may not be as independent of intellective va 
ables as has been assumed. And, the direction of 
the nonsignificant correlation between the D 
Scale and r(S-I) is not what dogmatism theo 
might lead one to expect. 
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49 ist-grade boys were variously exposed to social deprivation or social 


satiation, and were subsequently either socially reinforced or nonsocially rein- 
os forced. In a probability learning paradigm, only those Ss who had been socially 
os deprived and then socially reinforced matched input or maximized. Results 
are interpreted in terms of social-drive theory. Implications for the development 

of self-mediated reinforcement are discussed. 


It has been commonly observed that young 
children are more sensitive to direct social rein- 
‘forcement—a smile or verbal encouragement— 
than they are to impersonal abstract reinforcers 
whose effects greatly depend upon self-mediating 
cognitive processes. Several theoretical treat- 
ments which have. been concerned with the 


® problem of internalized self-reinforcing patterns 


of behavior (Hill, 1960; Mowrer, 1960, Ch. 10; 
Sears, 1957) have incorporated this observation. 
These theories all point to the initial dependence 
of young children upon direct social reinforce- 
ment and, with increasing age, to their greater 
reliance on mediating cognitive processes. 


jr Until recently, this progression from depend- 


ence upon external social reinforcers to self- 


` . reinforcement, which is more contingent upon 


own responses, lacked empirical test. Recently, 
however, Lewis, Wall, and Aronfreed (1963) 
demonstrated experimentally the relative effec- 


nce 1Data for these studies were collected by the 
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tiveness of these qualitatively different rein- 
forcing events at different age levels. Specifically, 
the authors found that for 6-year-old boys, social 
reinforcement was more effective in probability 
learning than was nonsocial reinforcement, while 
among 1l-year-olds, no difference obtained 
between the two reinforcement conditions. 

The present study examines this process in 
greater detail. In the first place, pilot studies 
by the authors and their colleagues, which 
followed carefully the experimental protocols 
employed by Lewis et al. (1963), failed to 
replicate their findings on either female or male 
populations. This failure to confirm led to a 
careful inspection of the relationships estab- 
lished between the experimenter and the subject 
in the Lewis et al. study and in our experi- 
ments. Our attention was drawn to a difference 
in preexperimental conditions between these 
studies. Specifically, Lewis et al. established a 
taciturn preexperimental relationship with the 
subject; the experimenter was a stranger who, 
after a brief introduction to the class, led each 
subject to the experimental room. The experi- 
menter did not attempt to establish a warm rela- 
tionship with the subject on the way to the 
experimental room, but rather maintained a pre- 
experimental environment best characterized as 
aloof. In our pilot studies quite the opposite 
environment existed. Prior to warmly introducing 
the experimenter to the class, the teacher in- 
formed the subjects reassuringly that they would 
be playing a game, that it would be fun, and 
that no one would do poorly or be marked ac- 
cording to his performance. The experimenter 
maintained the reassuring quality of the pre- 
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experimental environment by chatting with the 
subject on the way to the experimental room, 
addressing the subject by name, and inquiring 
into his interests. 

These differences in preexperimental environ- 
ment are strongly reminiscent of the studies by 
Gewirtz and Baer (1958a, 1958b) in which sub- 
jects who were relatively deprived of the pres- 
ence of an adult were subsequently more re- 
sponsive to reinforcers administered by that 
adult than were subjects who had been pre- 
viously satiated with such reinforcers. True, the 
period of deprivation employed by Gewirtz and 
Baer was considerably longer (20 minutes) than 
it was in these studies, where approximately 2-3 
minutes elapsed between the introduction of the 
subject to the experimenter and the time the 
actual experiment began. But the conditions were 
analogous enough to suggest that even brief inter- 
action with a taciturn adult might have effects 
similar to the lengthier social deprivation. The 
analogy between social deprivation and social 
aloofness was buttressed by Gewirtz, Baer, and 
Roth (1958) who noted the similar effects of low 
social availability of an adult (defined as an 
adult present but paying no attention to the 
child’s behavior) and the 20-minute social 
deprivation period, 

In view of the above, the following hypothesis 
was suggested: the relative effectiveness of social 
over nonsocial reinforcement found by Lewis et 
al. (1963) is contingent upon the exposure of 
these children to a period of social deprivation, 
since social deprivation is antecedent to the 
need for social reinforcement. Under conditions 
of social satiation, however, no differences in the 
effectiveness of the reinforcer will be apparent. 
The hypothesis was tested with a male population 
in order to provide a replication of the Lewis 
et al. experiment, 

In addition, the experiment served to answer 
the following question: Does a subject who has 
been socially deprived respond subsequently only 
to social reinforcers, or will a nonsocial reinforcer 
have the same effect? 


METHOD 
Subjects 


Forty-nine boys from two middle-class elementary 
public schools participated in this experiment. Sub- 
jects from three first-grade classes in one school 
experienced a period of social satiation ‘and were 
then rewarded either with social reinforcement 
(10 subjects) or nonsocial reinforcement (12 sub- 
jects). In the second school, subjects from four 
first-grade classes experienced a period of social 
deprivation and were subsequently rewarded with 
social reinforcement (17 subjects) or nonsocial 
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reinforcement (10 subjects). Six subjects alternated 
responses for more than 75 trials. Their data were 
excluded because it had been previously determined 
that such alternation reflected subjects’ failure to 
understand the experimental task. Data for an N 
additional subject who failed to complete all of the 
training trials were also excluded. 


“ewes T 


Apparatus j! 


The apparatus, identical to that used by Lewis et 
al. (1963), consisted of a mechanism with which } 
each subject “played,” and a monitoring device on f 
which the experimenter could follow the course of | 
the game. The mechanism was a black box (7 X 12 
X 7 inches) with two toggle-type, automatic-retun 
switches along the lower edge of the front panel. A 
large red light, which could be operated from the i 
monitor, was centered on the front panel. wi 
monitor also activated a signal buzzer inside the } 
subject’s machine. A 


Procedure 


Social deprivation. Subjects were given no infor- 
mation about the experiment from their teacher. 
Rather, the teacher simply asked the subject to “go 
with this man” when the experimenter appeared in 
the classroom. During the walk to the experimental 
room, which took about 3 minutes, the experi- 
menter engaged in minimal conversation with the 
subject. 

Social satiation, The teacher told the subjects that # 
they would be playing a “fun” game in which no I 
one could do poorly. She warmly introduced the l 
experimenter to the class. On the way to the exper- 
mental room, the experimenter maintained this 
reassuring atmosphere by chatting with the subject | 
in a quite friendly fashion, addressing the subject 
by name and inquiring into his interests. p 

Once in the experimental room, the procedure was 
identical to that employed in the Lewis et al. (1963) 
study. The subject was told that he would be f 
playing a game in which he was to press one of the 
two levers each time the buzzer sounded. The instruc- 
tions implied that the object of the game was to 
try to produce the reinforcing event, but not that 
there would always be a correct lever on every 
trial. Reinforcement, therefore, was contingent upon 
the subject’s response. A 

Prior to the training trials, the subject was Pré 
sented with four sample trials for which he was 
instructed to press the left lever for the first al 
trials and the right lever for the last two 
Reinforcement was administered on the first and 
last trials, The subject then began the series of 150 
training trials. Events were randomized within each 
block of 10 trials, with the probabilities of the 
left and right levers being correct in the ratio of 
70:30, respectively. During the training period, 4 
second experimenter, hidden entirely from the sub- 
ject, monitored and recorded the subject’s responses: 
Subjects were rewarded with one of the two rein- 
forcers: 
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al reinforcement—The experimenter, seated 
the subject during the entire series of trials, 
either, “Good” or “Fine” whenever the sub- 
Tesponse was correct. The light was not used. 
ocial reinforcement—After the four sample 
i, the experimenter told the subject that for the 
inder of the game the subject was to respond 
e heard the buzzer and the red light would 
to the subject whether his response was 
The experimenter then left the room, ex- 
that he had some other work to do and 
Would return shortly. 
> 


RESULTS 


1 presents the mean proportion of re- 
o the more frequently reinforced side 
in blocks of 10 trials for all subjects. In- 
reveals a marked tendency by those 
who experienced social deprivation to 
the more frequently reinforced re- 
Among the remaining groups, no differ- 
tformance is evident. The Kruskal- 
test (Siegel, 1956), applied to the sub- 
Mean proportions of correct responses 
all 150 trials, indicates that these four 
are not drawn from the same population 
1). The Mann-Whitney U test was ap- 
“Siegel, 1956) to the asymptotic propor- 
S.10 responses over the last three blocks 
als. These findings, presented in Table 1, 
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BLOCKS OF 20 TRIALS 
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BOYS — FIRST GRADE 


*———* = SATIATED SOCIAL 
@-s--==-0 = SATIATED NONSOCIAL. 
4—-—--* = DEPRIVED SOCIAL 
eee = DEPRIVED NONSOCIAL 


5 6 7 8 


(Block 1 = 10 Trials) 


Fic. 1. Mean proportion of response to the more frequently reinforced side. 


confirm the need for social reinforcement on the 
part of those subjects who had been socially 
deprived prior to the learning task. 


Discussion 


It is clear that the superior asymptotic per- 
formance found by Lewis et al. (1963) among 
young boys who had been socially reinforced was 
not due merely to the effectiveness of social 


TABLE 1 
Comparisons OF EXPERIMENTAL CONDITIONS AT 
ASYMPTOTIC PERFORMANCE 
Trials SD-SR SD-NSR | SS-NSR 
141-150 
SD-NSR O18 
SS-NSR -001 ns 
SS-SR -001 ns ns 
131-140 
SD-NSR oi 
SS-NSR 01 ns 
SS-SR OL 05 ns 
121-130 
SD-NSR -001 
SS-NSR 01 ns 
SS-SR 001 


ns ns 


NSR =nonsocially rein- 


Note.—SR =socially reinforced, 
forced, SD =socially deprived, SS =socially satiated. 
forall p-values are (orone tailed Mann-Whitney U tests, 
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reinforcement. Rather, it resulted from the com- 
bined effects of social deprivation followed by so- 
cial reinforcement. And, while the data presented 
here do not permit extrapolation to older chil- 
dren, it would seem more reasonable to interpret 
their findings in terms of differential capacities to 
tolerate social deprivation among younger and 
older children, rather than in terms of the rela- 
tive effectiveness of social and nonsocial rein- 
forcement per se. Thus, it might be argued that 
for older children the need for social reinforce- 
ment is not so intense, perhaps because they have 
been satiated on that kind of reinforcement or be- 
cause other reinforcements have become domi- 
nant in their reinforcement hierarchy. Older chil- 
dren would, then, be more able to tolerate so- 
cial deprivation than younger children, for whom 
the need for, and consequently the potency of, 
social reinforcement is relatively greater. 

These hypotheses are, of course, tentative 
formulations amenable to, and currently receiv- 
ing, experimental verification. They are not, 
however, necessarily contradicted by the strong 
Positive correlation found by Gewirtz and Baer 
(1958b) between deprivation and age, since in 
that study the authors used very young (nursery 
school) subjects. Conceivably, very young chil- 
dren show a marked responsiveness to social 
deprivation which tapers off and then actually 
decreases with increasing age. 

Gewirtz and Baer (1958a), in discussing the 
implications of their study for a theory of social 
drives, suggest that the existence of such a 
theory would gain support if it could be shown 
that following social deprivation, subjects are 
more responsive to specifically social reinforce- 
ment than to other kinds of reinforcement. This 
experiment, along with Erickson’s (1962), would 
appear to offer that support, in that a light, in 
this case, and marbles, in Erickson’s—both non- 
social reinforcers—appear to have no differential 
effect following deprivation or satiation, while 
Social reinforcements following deprivation do 
have their effects. 

Indeed, these data and Erickson’s (1962) 
invite some discussion regarding the controversy 
over whether the condition brought about by 
social deprivation is best seen as creating a social 
drive (Gewirtz & Baer, 1958a, 1958b) or anxiety. 
Walters and his co-workers (Walters & Karal, 
1960; Walters & Quinn, 1960; Walters & Ray, 
1960) have taken sharp issue with the concept of 
social drive (as opposed to the operationally 
defined social deprivation) on both theoretical 
and experimental grounds. As a matter of theo- 
retical logic, they object to the evidential basis 
of any state whose existence is imputed solely 
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from the behavior it is called upon to explain, 
Experimentally, they demonstrate that it may | 
be to the anxiety component of social deprivation | 
that the subsequent reinforcement is addressed) 
and they, therefore, prefer to interpret the find 
ings in this area in terms of anxiety arousal and 
reduction. 

The findings of this study and of Erickson’ 
(1962) offer no support to the anxiety position 
but implicitly support the interpretation of 
Gewirtz and Baer (1958a, 1958b). For if, indeed, 
it is anxiety that is at the root of the phenom 
enon elicited by social deprivation, then it mus 
be an anxiety of a very special sort, one whichi 
is reduced not by all reinforcers but by a limited] 
class of reinforcers of which, thus far, only, the 
social reinforcer is known. And, while it may be 
argued that nonsocial reinforcement—lights otf. 
marbles—is not as powerful a reward as soci 
reinforcement, it is certainly more powerfill 
than no reward at all. The fact that these nom 
social reinforcements had no effect may indicate 
that the anxiety which arises from social deprivad 
tion requires a specific class (or kind) of 1@ 
ducer, to which nonsocial reinforcement does not 
belong, and to which, perhaps, only social reits 
forcers do belong. Further defining that class 
that is, specifying which reinforcers affect social 
deprivation, is an interesting matter for futur) 
investigation. 3 

The findings reported here are evidence {ot 
the operation of the social deprivation variable 
in a new experimental context—probability) 
learning—and under conditions of considerably] 
briefer deprivation than was previously employe} 
With regard to the latter, an issue of some 
methodological importance emerges, which ii 
that apparently small and trivial differences ™ 
experimental procedure can yield large different 
in experimental outcomes. The findings presented 
here, which indicate that preexperimental al00t| 


Suggest that greater attention needs to be giv 
to the preexperimental environment in whi | 
studies with human subjects take place. Certain 
some comment regarding that environment mig 
well be incorporated in the Method sections % 
research reports. 
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APPROVAL-SEEKING MOTIVATION AND THE VOLUNTEER SUBJECT 


JOHN W. McDAVID 


University of Miami 


Students were recruited under 4 degrees of external inducement to participate 
as Ss in psychological research. Their performances on a test of approval- 
seeking motivation differed in an orderly manner. Captive Ss, tested without 
forewarning, received the lowest and most widely dispersed scores; those who 
i participated as an academic requirement and volunteers who received extra 
grade credit for participation showed intermediate scores; and Ss who volun- 
teered with minimal external inducement obtained the highest mean and least 


the approval-seeking scores. 


Experimenters have often expressed skepticism 

' about ‘generalizations from behavior of volun- 
teers. Concern about “demand characteristics” 
_ and eagerness of subjects to please the experi- 
menter originally expressed decades ago (€.g., 
Pierce, 1908), has become a more central prob- 
A experimental design recently (Orne, 1959, 
It is tacitly assumed that volunteers differ in 

_ Several ways from those who do not volunteer. 
The nature of these differences is largely unspeci- 
“fied, however. Frey and Becker (1958) identified 
erences in scores on four scales of the Guil- 

ord personality inventory between volunteers 
Who notified the experimenter of their inability 
to keep an appointment and volunteers who 
Simply failed to show up at the appointed time. 
rien and Rosenbaum (1955) demonstrated 
at social pressures from observation that a 


dispersed scores, The t tests indicate that the groups differed significantly as 
a function of the conditions of recruitment: the more voluntarism, the higher 


colleague has volunteered increase the probability 
that the observer himself will volunteer. 

The observations reported here were made 
incidentally in connection with development of a 
device for assessing the strength of an indi- 
viduals motivation to seek interpersonal ap- 
proval. Differences between several groups of 
subjects recruited under different conditions sug- 
gested that approval-seeking motivation is an 
important inducement to the volunteer subject. 


METHOD 
Subjects + 


One-hundred and thirty-six male students, mostly 
sophomores, in Jower-division university courses in 
psychology served as subjects in the preliminary 
standardization of a scale for assessing strength of 
approval-seeking motivation. The manner of pro- 
curement of these subjects for experimental partici- 
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TABLE 1 


SRS Scores ror SUBJECTS PARTICIPATING UNDER 
Four CONDITIONS OF INDUCEMENT 


* Condition N M SD 
Captive 31 53.90 10.54 
Required 25 53.68 14.19 
Bonus point 61 58.20 6.47 
Gratis volunteer | 19 63.26 7.63 
Total 136 57.10 9.98 


pation varied such that they fell into several cate- 
gories; 

_ 1. “Captive audience” subjects: Thirty-one sub- 
jects were tested in two classes during regular class 
periods, No advance warning that the class period 
would be used for research purposes was given. Such 
subjects were “captive” in that they had no volun- 
tary choice about participation. 

2, “Required” subjects: Another 25 subjects came 
to a scheduled testing session outside regular class 
meetings. They were under professor’s instructions 
that participation was mandatory. No penalty for 
nonattendance was mentioned. 

3. “Volunteer” subjects: Eighty subjects were 
from another large class. Volunteers were recruited 
under promise of extra grade credit (0.5% added to 
the final course grade per hour of experimental 
Participation, up to a limit of 3.0% bonus credit). 
Volunteering involved willingness to attend any one 
of several testing sessions at various hours every 
weekday during a 10-day period. Testing was 
scheduled in the same classroom building. 

Minimum inconvenience was required for partici- 
pation, These subjects were distributed as follows: 
61 subjects eligible to receive extra grade credit at 
the promised rate (c); 19 subjects who were aware 
they had already achieved the maximum bonus 
credit allowed (3.0%); therefore they volunteered 

gratis? without the additional inducement of 
Promised grade credit (ca). 


Procedure 


The assessment instrument used is the Social 
Reinforcement Scale (SRS; McDavid, 1962). It is a 
Paper-and-pencil test, with standardization extended 
to include several hundred additional subjects. The 
respondent answers 25 items indicating the extent of 
his agreement with each by marking one of four 
symbols from “Strong and unqualified agreement,” 
to “Strong and unqualified disagreement.” These 
responses are weighted 4-3-2-1 for 15 items scored 
for agreement and 1-2-3-4 for 10 items scored for 

agreement ; thus scores range from 25 to 100. 
Each item describes either a high degree of sensi- 
tivity, or conversely a high degree of indifference, to 
a variety of social reinforcement events (interper- 
sonal praise, social acceptance, social desirability, 
verbal reward, etc.). Test-retest reliability for male 
subjects is approximately .90. Exploration of social 
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scores and the social desirability variable as well as 
acquiescent response set (McDavid, 1962). Scores 
on the SRS were obtained for these subjects, and 
the mean scores for each of the subgroups of sub- 
jects described above were compared. 


RESULTS 


The mean scores for each category of subjects 
are reported in Table 1. The ¢ tests of differences {i 
between various pairs of these means revealed 
statistical significance as follows. 

The captive subjects scored significantly lower 
on the SRS than either the bonus-point volun- 
teers (t = 2.07, p < .05) or the gratis volunteers 
(t = 3.58, p <.01). However, this group did net 


differ from the required subjects. The required | » 


subjects scored significantly lower than the gratis 4 


volunteers (t= 2.81, p < .01), but did not differ | 


from the bonus-point volunteers. Finally, the 
two volunteer groups differed, with the gratis’ 
volunteers scoring significantly higher on the 
SRS than the bonus-point volunteers (t= 2.56, 
p<.05). 

These observed mean differences are consistent 
with differentiations among the four groups in 
terms of the character of external inducement 
applied to elicit participation. The less coercive | 
pressure applied, the higher the scores on the 
SRS. The fact that the variability of scores 
within the volunteer groups, as reflected in theit 
standard deviations, is considerably smaller that | 
that within the captive and required groups 
further supports the assumption that individuals 
who appeared in the volunteer groups are selec 


tively homogenous in their stronger approval: R 


seeking tendencies. 

It appears, then, that an important determi- 
nant of willingness to volunteer as a subject for 
psychological research involves motivation to 
seek social approval as a reward or goal. Ordi- 
narily, the conditions of recruitment of subjects | 
depict volunteering as a socially desirable form 
of behavior or even as a potential act of ingi% 
tiation toward the experimenter or course W |. 
structor, Since these data suggest that voluntee! 
subjects represent a selectively homogenous 
group of individuals with relatively strong nee 
for social approval, any generalization of fn% | 
ings must be made cautiously and with recogni d 
tion of this kind of situationally induced bi): 
upon the performance of such subjects. The im- 
plications of such bias are especially crucial for 
research dealing with the issue of social te™ » 
forcement itself, in that limiting our observation | 
to such selected volunteers may lead to exagéet 


desirability has indicated independence of total : 
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9 ° 
ation of the role of approval seeking or accep- 
_ tance seeking in human behavior. 


3 REFERENCES 


Braxe, R. R, & ROSENBAUM, M. Volunteering as a 

' function of field structure. Journal of Abnormal 
and Social Psychology, 1955, 50, 193-196. 

Frey, A. H, & Becker, W. C. Some personality 
correlates of subjects who fail to appear for ex- 
perimental appointments. Journal of Consulting 
Psychology, 1958, 22, 164. 

. McDavm, J. W. The incentive and reward value of 

social approval and disapproval. Technical Report 


' «Journal of Personality and Social Psychology 
Vol. 2, No. 1, 117-121 


117 


No. 2, 1962, University of Miami, Contract Nonr 
840(22) NR-177-303, Office of Naval Research. 
Orsz, M. T. The demand characteristics of an ex- 
perimental design and their implications. American 
Psychologist, 1959, 14, 424. (Abstract) ' 

Orne, M. T. On the social psychology of the psy- 
chological experiment. American Psychologist, 
1962, 17, 776-783. 

Prce, A. H. The subconscious again. Journal of 
Philosophy, Psychology, and Scientific Methods, 
1908, 5, 264-271. 


(Received December 9, 1963) 


SOCIAL ROLE AND ACADEMIC ACHIEVEMENT * 


ROBERT S. WYER, Jr.? 


University of Iowa 


AND 


GLENN TERRELL 
University of Illinois, Chicago Circle 


This study was designed to investigate the relationship between social role 
and academic achievement in college students. A sample of 28 males and 28 


females were selected from a large sample of college sophomores and classified 


into 4 groups on the basis of their academic aptitude and academic perform- 


ance. Analyses were made involving 


academic recognition, the certainty o! 


sufficiency or social independence, 


nificant triple interactions of sex, performance, y r ast 
choice and desire to receive social recognition; 


of certainty of occupational 


these Ss’ desires to receive social and 
f their occupational choice, their self- 
and their persistence or self-control. Sig- 


and ability occurred on measures 


sex interacted significantly with performance on the measure of self-control. 
Social group dependence ae negatively to academic performance among 


males but not among females. 


variables relating to their academic per! 
differences in their prescribed social roles] 


American colleges and universities provide 
means for students to attain goals of two general 
types. Intellectual or academic goals are provided 

_ by the educational structure of the institution and 
May be attained through satisfaction of the spe- 
ie academic requirements and through partici- 
Pation in other campus cultural and intellectual 
events, Social goals are provided by extracurricu- 
‘lar activities and social interaction, peripheral to 
the formal educational system. These are attain- 


.; able through interpersonal relations and recogni- 


_ tion from peer group members. 


aes study was performed under the sponsorship 
a he United States Office of Education and the Uni- 
th aay of Colorado, with whom the authors were at 
a time affiliated. Thanks are extended to Cath- 
hel e Felknor and Wayne Herschberger for their 

Ip in organizing and collecting data for this study. 


2 
ee the Institute of Child Behavior and Develop- 


Differences between males and females in the 
formance were interpreted as due to 


Marked differences in motivational patterns 
between males and females of varying ages have 
been observed. Cobb (1954) found that wishes 
of young boys exceeded those of girls in areas of 
personal achievement and possessions, while girls 
had stronger wishes than boys in matters pertain- 
ing to social and family relationships. Todd, Ter- 
rell, and Frank (1962) report sex differences in 
goal patterns related to academic achievement of 
college students. These results suggest that the 
development of motivation and personality char- 
acteristics is influenced by factors pertaining to 
the prescribed sex-role definitions in our society. 
Tf this is true, goal-directed behavior should vary 
among individuals with the perceived relevance of 
these goals to the fulfillment of their present or 
anticipated social roles. Applying this reasoning to 
the scholastic performance of college students, 
variation among persons in academic achieve- 
ment may be viewed in part as a result of dif- 
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ferences in the degree to which academic goals are 
perceived as facilitating attainment of sex-defined 
social roles. It may be anticipated that among 
males, students who occupy a dominant career- 
oriented social role to which academic competence 
may be directly relevant, academic goals relate 
positively to scholastic achievement, as would 
personality characteristics which facilitate chan- 
neling of efforts into academic areas (e.g., perse- 
verence, strong willpower, etc.). Social goals in 
male students would be expected to conflict with 
academic pursuits and consequently decrease aca- 
demic achievement. Females generally come to 
occupy a non-career-oriented role, that of house- 
wife or mother, to which academic competence 
is not directly relevant. For them academic effec- 
tiveness may be inspired by desires and interests 
which are not of a purely academic or career- 
oriented nature but rather which reflect the par- 
ticular social role which they would ultimately 
be expected to fill. 

The study reported here, which is part of an 
extensive investigation concerned with the rela- 
tionship between nonintellective variables and 
academic achievement of college students, was 
concerned primarily with the above propositions. 


METHOD 


Data were obtained from 576 sophomores and 1144 
first-term freshmen from various colleges of the Uni- 
versity of Colorado for whom measures of both aca- 
demic aptitude and academic performance were 
available. The results to be reported here were ob- 
tained primarily from the sample of sophomores 
which included 225 males and 351 females. 

Academic performance was measured by cumula- 
tive grade-point average (GPA). Academic aptitude 
was measured by the combined Verbal and Mathe- 
matics scores on the Scholastic Aptitude Test (SAT) 
of the College Entrance Examination Board. Both 
GPA and SAT were converted to z scores, From the 
sophomore sample 28 males and 28 females were 
selected such that 7 subjects of each sex Possessed one 
of four combinations of aptitude and performance. 
Criteria for selection were as follows: 


1. High aptitude-high performance—z scores of 
greater than .50 for both GPA and SAT, and an 
absolute difference between GPA and SAT = scores 
of less than .25, 

2. Low aptitude-low performance—z scores of less 
than —.50 for both GPA and SAT, and an absolute 
difference between GPA and SAT z scores of less 
than .25. j 

3. High aptitude-low performance—a SAT z score 
of greater than .50, a GPA z score of less than TACEN 
and a difference between SAT and GPA z scores of 
greater than 1.50. 

4. Low aptitude-high performance—a SAT z score 
of less than —.50, a GPA z score of greater than 50, 
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and a difference between GPA and SAT z scores of Í 
greater than 1.50. 3 


Subjects in the high aptitude-low performance 
group were considered to be underachievers; those in 
the low aptitude-high performance group were con- s 
sidered to be overachievers. 


Academic and Social Motivation 


Academic and social motivational characteristics ' 
were assessed primarily through use of a modified ^ 
form of the Goal Preference Inventory developed by | 
Liverant (1958). Motivation was assessed in three 
general areas: 


1. Academic recognition—acknowledgement of 
competence in academic situations, approval and ad- 
miration by others for academic behaviors, attain-, 
ment of academic status, etc. oe 

2. Social recognition—acknowledgement of compe- ' 
tence in social activities, approval and admiration by 
others for social behaviors, attainment of social | 
status, etc. 1 

3. Social love and affection—acceptance and posi- 
tive regard by others in a social atmosphere, being 
valued as a friend in social activities, affiliation with 
a social group, etc. 


Liverant’s original questionnaire was modified to 
eliminate forced choice between pairs of items and 
thus facilitate detection of equally high or low 
motivation in all areas. In addition, the inventory 
was expanded into 12 subscales to assess desire to 
seek goals in each of the three areas, expectancy t0 -1 
seek and expectancy to receive them. In the present 
study two scales were selected which were felt to 
assess most directly academic and social motiva- 
tional characteristics: desire to receive academic | 
recognition and desire to receive social recognition. | 

The inventory consisted of 60 items; for each a 
item, subjects were asked to record on a 5-point t 
scale the extent of their desires and expectancies i 
the area referred to. The questionnaire was presente { 
in the following form: 


How much would you like and how much do you” 
expect: ' 


Like Expect- 
1. to do favors for others ae 
2. to be praised for a high quality . 
term paper —— — . 
3. to be active in campus politics atid 


In addition to this measure, an indication of the 
subjects’ vocational goal directedness was obtained. , 
by asking them to respond on a 4-point scale to the" 
following question: How certain are you in yout 
choice of the profession or occupation (e.g, lawye 
housewife, journalist, teacher, etc.) which you it 
tend to pursue? 

Personality characteristics were assessed throug | 
use of the Sixteen Personality Factor Questionnaire | 
constructed by Cattell, Saunders, and Stice (1987). 
Factor Qs (high self-sentiment formation; controlled, 
exacting willpower) was selected for analysis in this 
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TABLE 1 


, Mean Desire To RECEIVE ACADEMIC RECOGNITION 
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TABLE 3 
MEAN CERTAINTY OF OCCUPATIONAL CHOICE 


Performance Performance 
Males Females Aptitude Males Females 
Low High Low High Low High Low High 
56.5 58.4 58.0 55.4 High 2.3 2.6 3.6 2.3 
53.3 55.9 57.6 54.5 Low 2.6 3.1 2.3 3.4 


, study due to its potential bearing on academic effi- 
ciency and Factor Q» (self-sufficiency, lack of social 
group dependency) was selected due to its potential 

assessment of social dependency and need for social 
approval, factors which could conflict with academic 
goal attainment, 
i RESULTS 


were analyzed using a three-way analysis 
Variance factorial design. Tables 1 and 2 show 
e mean desire to receive academic and social 
Tecognition acknowledged by males and females 
at each of four combinations of aptitude and per- 
mance. No main or interaction effects were sig- 
nificant regarding the desire to receive academic 
Tecognition. However, the Aptitude X Perform- 
ance X Sex interaction was significant in the anal- 
‘of the desire to receive social recognition 
43, df= 1/48, p< .05, MS, = 66.15). 
le 3 shows mean certainty of occupational 
ce of subjects of each of the four ability- 
Ormance combinations. Both the Aptitude X 
ormance interaction (F = 7.05, df= 1/48, 
105, MS,=.916) and the Aptitude X Per- 
ce X Sex interaction (F= 4.39, df= 
< 05) were significant. Analyses of sim- 
ects (Winer, 1962) show that the Apti- 
X Performance interaction was significant 
females (F = 11.27, df = 1/48, p < .01) 
among males (F = .15). Furthermore, not 
Were high ability-low performing (under- 
leving) females significantly more certain of 
Occupational choice than both low ability- 
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TABLE 2 
DESIRE to Recetve SOCIAL RECOGNITION 


low performing females and high ability-high 
performing females (F = 5.79, df=1/48, p< 
-05 in both cases), but low ability-high perform- 
ing (overachieving) females also were signifi- 
cantly more certain of their choice than the lat- 
ter two groups (F = 4.99, df = 1/48, p < .05 in 
both cases). 

Tables 4 and 5 show mean scores on 16 PF 
Factors Q, and Q3. A high score on Factor Qə 


TABLE 4 
MEAN SELF-SUFFICIENCY (16 PF Factor Qe) 


Performance 
Aptitude Males Females 
Low High Low High 
High 10.0 13.0 10.7 9.7 
Low 9.4 11.9 8.4 9.7 


suggests high self-sufficiency and low social group 
dependence, while a high score on Factor Q3 
suggests a highly controlled, exacting willpower 
(Cattell et al., 1957). On Factor Qo, main effects 
of both performance (F = 4.28, df = 1/48, p< 
.05, MS, = 6.67) and sex (F = 4.28, df = 1/48, 
p<.05) were significant while the Perform- 
ance X Sex interaction approached significance 
(F = 3.48, df = 1/48, p < .10). Analyses of sim- 


TABLE 5 
MEAN CONTROLLED WILLPOWER (16 PF Factor Q;) 


Performance Performance 
Males Females Aptitude Males Females 
Low High Low High Low High - Low High 
eles alg Ey fe || 3 
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ple main effects indicated that self-sufficiency in- 
creased significantly with performance among 
males (F=7.73, df = 1/48, p< .01) but not 
among females (F = 1.37). 

On Factor Q, the Performance X Sex inter- 
action was significant (F = 5.48, df = 1/48, p< 
.05, MS, = 10.96), Simple effects analyses showed 
that indications of a controlled willpower in- 
creased significantly with performance among 
males (F= 4.33, df= 1/48, p< .05) but not 
among females. 


Discussion 


Differences between males and females in the 
factors relating to performance and aptitude may 
be interpreted as a function of the different roles 
aspired to by males and females and the effect of 
these role aspirations in channeling motivation. 
Males should perceive themselves as ultimately 
attaining responsible positions in vocations to 
which a college education may be directly rele- 
vant. If this is true, then for males a primary 
incentive for high academic performance would 
be the opportunity it provides for obtaining jobs 
after graduation which require academic compe- 
tence. In contrast, the female student may often 
perceive herself as eventually filling a social role, 
that of housewife or mother, to which academic 
success is not directly relevant. She may there- 
fore be primarily oriented toward attaining goals 
in college relevant to this role; for example, 
marriage. (In this regard, 21% of 619 incoming 
freshmen women reported marriage as a primary 
goal upon graduation.) Therefore, although both 
males and females may strive for high academic 
performance to the extent it is felt to facilitate 
vocational goal attainment, a substantial propor- 
tion of females, unlike males, may feel that aca- 
demic competence is not relevant to their long- 
range vocational interests and hence may engage 
in nonacademic pursuits more directly related to 
their primary interests. 

Data in Table 3 are of interest in this regard. 
Both overachieving and underachieving females 
were more certain of their occupational choice 
than subjects whose performance was commen- 
surate with their ability. This suggests the possi- 
bility that underachieving females are those who 
have definitely decided upon homemaking as an 
ultimate vocational role and therefore have little 
desire to pursue academic goals which are irrele- 
vant to this role. On the other hand, overachiev- 
ers may be those who have decided upon a career 
outside the home which requires academic compe- 
tence, It is noteworthy that among males, whose 
` career goals may be predominantly those requir- 
ing academic competence, overachievers were also 
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higher than other groups in the certainty of their 
occupational choice while underachievers were A 
slightly lower than other groups. 

Indications of a controlled exacting willpower 


related significantly positively to academic per- ` 


formance among males but tended to have the 
opposite effect among females (Table 5). This 
suggests that among males who may be inspired ” 


by a desire for long-range vocational goals, 


perseverence and self-control are beneficial to 
academic effectiveness. However, similar charac- 
teristics in females may direct them toward the 
pursuit of nonacademic goals more relevant to 
their social role, resulting in a decrement in aca- 
demic performance. Additional research is re- 
quired to substantiate the latter interpretation, ~ 

It should not be inferred from this discussion 
that no male college students are socially moti- 
vated. High social motivation may in fact in- 
hibit academic striving in males for at least two 
reasons: social involvement may conflict with 
active striving for the high academic perform- 
ance required for long-range vocational goal 
attainment, and some socially motivated and 
socially conscious males may perceive active © 
pursuit of academic goals (e.g., studying) as not 
socially acceptable. Particularly indicative of the 
latter possibility is the fact that, along the per- 
sonality dimension “studious,” self-acceptance 
correlated .542 for 250 randomly selected fresh- 
man females but only .342 among 250 freshman 
males, The difference between these correlations, 
which is statistically significant (z= 2.77, p< 
01), implies that while females tend to accept 


themselves more as they perceive themselves as~[ 


being more studious, a substantial proportion of 
males may see active academic striving (studious 
ness) as not acceptable, or else may accept them- 
selves even though they perceive themselves 9% 
low in this characteristic. Therefore, while desire 
to receive academic goals may be similar in all 
male students due to the value of these goals in 
vocational success (as results shown in Table 1 
indicate), desire and expectancy to seek acadęmic 
goals may vary inversely with the degree to which 
a social stigma is felt to accompany active put: 
suit of academic goals. 


may feel a conflict between social and academic 
goals is provided in the data shown in Tables 2 
and 4. Underachieving males were found to have 
higher desire to receive social recognition than 
subjects described by any other aptitude-pet 
formance combination, while underachieving fe- 
males had lower desire to receive social recogni- 
tion than subjects not fitting this description. 
Furthermore, low performing males acknowl- 


l 


; 


| 
b 
Further indication that males but not females. | 
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TABLE 6 


MEAN GRADE 


-POINT AVERAGE 


Desire to seek academic recognition 


Desire to seek Males Females 
social recognition 
Low High Low High 
(below the mean) (above the mean) (below the mean) (above the mean) 
High (above the mean) 2.07 (118) 2.10 (174) 2.19 (134) 2.30 (208) 
Low (below the mean) 1.96 (132) 2.29 (98) 2.20 (132) 2.33 (137) 


Note.—N in each cell is given in parentheses. 


edged significantly less self-sufficiency and more 
social group dependence than higher performers, 
while a parallel relationship among females was 
not found. 

If this reasoning is correct, a decrement in aca- 
demic performance should occur primarily among 
males for whom social and academic goal at- 
tainment are in conflict. At a given level of so- 
cial motivation, males feeling little conflict be- 
tween pursuit of academic goals and social goal 
attainment may strive for higher performance 
than subjects who feel that conflict does exist. 
Data obtained from incoming freshman males 
Suggest that this conflict is predominant among 
students highly motivated to seek goals in both 
academic and social areas. Table 6 shows that 


among male students high in desire to seek aca- 


demic recognition, a high desire to seek social 
recognition appeared to have an adverse effect 
upon performance. However, among males with 
low desire to seek academic recognition, a high 
desire to seek social recognition appeared to 
facilitate performance. This interaction, which 
Was significant (F = 4.30, df =1/518, p<.05, 
MS. = 00519), using a two-way analysis of vari- 
ancé approximation technique for unequal cell 
frequencies (Walker & Lev, 1953), suggests that 
males with high desire to seek goals in both areas 
may have greater conflict in deciding in which 
area they should direct the weight of their ener- 
gles. Their performance may therefore suffer 
relative to students high nly in academic recog- 
nition needs who can channel their efforts com- 
pletely toward academic goal attainment. On the 
other hand, a social stigma may be attached to 


. Performance below a certain minimal level; so- 


cially 


i oriented males may strive to attain at 


east this performance level, a “gentlemen’s C,” 


despite lack of intrinsic desire to seek academic 
goals. The fact that a similar interaction was 
not found in analysis of the data on females 
(F=.11) suggests that for these subjects high 
social motivation may not conflict with academic 
pursuits, The finding that social group dependence 
relates negatively to performance only among 
males (Table 4) is also consistent with this in- 
terpretation. 

Although the above interpretations should be 
validated with additional measures of the con- 
structs involved, it is apparent from the results 
that a complete understanding of the factors 
affecting academic achievement in college stu- 
dents must take into account the differential 
effects of social role in channeling motivation and 


values. 
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SOME EFFECTS OF ANXIETY ON THE INTELLIGIBILITY OF 


VERBAL COMMUNICATION IN PSYCHOTHERAPY * 


ARTHUR PLATZ ann GILBERT HONIGFELD 


Veterans Administration Central Neuropsychiatric Research Laboratory, 
Perry Point, Maryland 


6 speech samples from psychotherapy interviews were mutilated by deleting 
every fifth word and presented to 30 college students with instructions to fill 
in the blanks. A content measure of the patient’s tension level during each 
speech sample was significantly related to the difficulty of the passage (p < 
001), The anxiety level of the students was not related to the accuracy with 
which the blanks were completed, although high-anxiety Ss gave fewer different 
responses to each blank, and used the most common response more fre- 
quently than did low-anxiety Ss (p < .05).The results are discussed in terms 
of both an interference theory of anxiety and the Hull-Spence hypothesis 
of an energizing effect of drive. Experimental studies on the effects of stress 
and anxiety on verbal behavior are considered in relation to the possible 
effects of patient and therapist anxiety on the communication process during 


psychotherapy. 


Psychotherapy is a communication process 
based upon the shared language habits between 
patient and therapist. This process may be 
analyzed into two sets of interlocking habits: 
the encoding habits of the communication 
source, and the decoding habits of the com- 
munication receiver (Osgood, 1957, 1959). 
Osgood and Walker (1959) have argued that 
the effects of drive on these verbal habits are 
similar to its effects on nonlanguage behavior. 
If drive states do modify language behavior, they 
may also affect the process of communication in 
psychotherapy. The present study is concerned 
with two related questions suggested by the 
Osgood-Walker study: Is the intelligibility of a 
patient’s speech related to his drive level? Is the 
ability to comprehend what the patient says 
related to the receiver’s drive state? 

Previous research and theory suggest con- 
tradictory predictions concerning the effects of 
anxiety and stress on verbal behavior. Some 
theorists emphasize the disruptive effects of 
anxiety on behavior. In such theories high drive 
states are assumed to elicit either task-irrelevant 
responses (Sarason, 1959) or to break down the 
“differentiation” between learned stimulus-re- 
sponse connections (Saltz, 1961). Both these 
theories suggest that anxiety would also disrupt 
ongoing verbal behavior. Anxiety in the speaker 
might, therefore, be reflected in reduced in- 
telligibility of his speech and anxiety in the 
listener in lowered comprehension of speech. 

1The writers wish to thank C. James Klett for 


his helpful suggestions on the first draft and Roderic 
Gillis for scoring and analyzing the data. 


The Hull-Spence theory (Spence, 1958) seems 
to lead to a contradictory prediction. The 
momentary reaction potential of any response 
within a habit-family hierarchy is assumed to be 
a multiplicative function of habit strength and 
drive. Since drive multiplies with habit strength, 
the effect of increasing drive should be to in- 
crease the probability of the stronger of the 
response alternatives available. This suggests 


that, under some conditions at least, verbal be- 


havior would become more stereotyped and 
redundant and consequently, more predictable. 
The research findings provide some support 


for both points of view. In regard to encoding ~ 


habits Osgood and Walker (1959) report that 
letters written under high drive show evidence of 
greater stereotypy as measured by vocabulary 
diversity, number of repetitions, and use of 
qualifying phrases, but do not show more fre- 
quent structural disturbances such as gram- 
matical errors or awkward constructions. Feld- 
stein and Jaffe (1962) have reported a substantial 
negative relation between measures of affective 
arousal and vocabulary diversity in speech 
samples of psychotherapy patients, primarily due 
to a greater number of word repetitions under 
high tension. However, the fact that nonlexical 
measures of tension level in speech samples 
(pauses, repetitions of isolated words or phrases, 
intrusions, etc.) generally reflect a breakdown in 
linguistic structure suggests that speech may 
become less intelligible under high tension 
(Dibner, 1956; Mahl, 1959). In regard to de- 
coding habits, little work has been done on the 
effects of anxiety in the receiver on his ability 
to comprehend verbal messages. 
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technique used for assessing the intelligi- 
J lity of verbal material in the present study was 
| the Cloze procedure developed by W. L. Taylor 
953). Verbal productions with every nth word 
eted are presented to a panel with instruc- 
is to fill in or predict the missing word from 
t maining context. The ease and accuracy of 
filling the deletions is related to the com- 
‘munality between the language systems of the 
source and receiver and thus provides both a 
ure of the intelligibility of the message and 
omprehension level of the receiver (Osgood, 
). Previous research has found the* Cloze 
edure sensitive to the effects of mental illness 
Salzinger, Portnoy, & Feldman, in press) and 
on the intelligibility of speech samples 
(Honigfeld, 1963b). The ability of the receiver 
‘to fill in deletions has been related to mental 
‘illness (Honigfeld, 1963a), and the extent of 
iliarity with the source as estimated by the 
igth of psychotherapeutic contact (Feldstein 


METHOD 


In the selection of the patient’s speech samples 
content measure of tension level was used in 
ference to the “nonlexical” indices favored by 
il (1959). The latter measure is based primarily 
Signs indicative of a breakdown in linguistic 
S ucture and thus, by definition implies reduced 
i telligibility. Consequently, it was felt that a 
Measure of tension would constitute a more 
ative test of the hypothesis. 
le passages selected for the speech sample were 
cen from published therapeutic interviews in 
tsch and Murphy (1955, pp. 186-246, Case 3). 
| Passages over 12 lines in length (N =15) 
‘Scored independently for tension level by the 
authors using a modification of the Dollard- 
er Discomfort Relief Quotient (Dollard & 
wrer, 1947). The score was obtained by simply 
the number of words indicative of tension 
tring in each speech sample. With the scores 
ected for differences in passage length the inter- 
€ reliability was .89. Of these, the passages 
h the three highest and three lowest scores were 
the study. 
Students, enrolled in an introductory psy- 
course at University College, University of 
yland, served as subjects. Prior to the experi- 
t the subjects had completed the Taylor Mani- 
fiety scale (Taylor, 1953) and had been 
into high- and low-anxiety groups. The 
-anxiety subjects had scores ranging from 
2 with a mean of 8.0; the high-anxiety sub- 
Tanged from 13 to 33 with a mean of 18.13. 


Ocedure 


th 6 psychotherapy passages were mimeographed 
every .fifth word deleted and replaced by a 


BRIEF ARTICLES 


123 
TABLE 1 
ANALYSIS OF VARIANCE OF CLOZE SCORES 
Source df MS F 
Between subjects 29 
Anxiety 1 9.60 <1.00 
Error (b) 28 32.39 
Within subjects 30 
Passage 1 1,904.07 85.77* 
Interaction 1 11.27 <1.00 
Error (w) 28 22.20 
* p <.001. 


blank. The passages were roughly equivalent in 
length, averaging about 170 words, of which 34 
were deleted. Both the three high-tension and three 
low-tension passages had a total of 102 deletions. 

The deleted passages were completed by the 
students in class. They were told that the passages 
were taken from psychotherapy interviews, and that 
their task was to try to guess the patient’s words 
which had been deleted, filling in only one word in 
each blank space. 


RESULTS 


The analysis of variance due to passage dif- 
ferences and the subjects’ anxiety level is pre- 
sented in Table 1. The differences in intelligibility 
between the speech samples with high-tension 
scores and those with low-tension scores is highly 
significant (F=85.77, p<.001, df= 1/28). 
Subjects were able to supply the correct missing 
word for only 42% of the deletions in the 
high-tension passages as compared to 53% cor- 
rect completions for the low-tension passages. 
There was no relation between the anxiety level 
of subjects and their comprehension score (F < 
1.00), nor was the interaction between the ten- 
sion level of the source and the anxiety level of 
the receiver significant (F < 1.00). 

Since previous research suggested that drive 
states frequently reduce response variability, the 
verbal response distributions elicited by each 
blank were tallied separately for the high- and 
low-anxiety groups. Consistent with this view the 
high-anxiety subjects tended to give fewer re- 
sponses to each blank than the low-anxiety sub- 
jects. The means were, respectively, 4.53 words 
and 4.92 words, and the difference significant 
beyond the .01 level (ż= 3.01). A related 
measure of reduced variability indicated a con- 
sistent tendency for the high-anxiety subjects to 
use the most common response given to each 
blank more frequently than low-anxiety subjects. 
For the high-anxiety group an average of 8.86 
subjects gave the same response within each 
blank; the mean for the low-anxiety group was 
8.50 subjects. This difference is significant beyond 
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the .05 level (¢=2.15). The use of fewer re- 
sponses and the most common response more 
frequently both reflect the greater stereotypy in 
verbal responses associated with high anxiety. 

Since the high-anxiety subjects were more 
likely than low-anxiety subjects to give the 
same response within each blank, this suggested 
that where this response was correct they would 
be superior to the low-anxiety subjects, but on 
difficult items where the most frequent response 
was incorrect, they would be inferior. To test 
this the difficulty of each deletion (measured 
by the number of subjects who filled it correctly) 
was correlated with the difference between the 
number of high- and low-anxiety subjects guess- 
ing correctly. The obtained correlation of .10 
was not significant. 


DISCUSSION 


The present study was concerned with the role 
of personality variables on the intelligibility and 
comprehension of verbal communication. These 
data indicate that communication in psycho- 
therapy becomes more difficult and less intel- 
ligible when the patient discusses anxiety-arous- 
ing material. Tension in the speaker reduces his 
intelligibility, but there is no indication that 
anxiety in the receiver affects his ability to 
“cloze” deleted passages. Apart from task differ- 
ences, two possible reasons for this difference 
may be suggested. For the patient the tension is 
acute and specific to the material being discussed; 
for the college students the anxiety level was 
chronic and not related to the stimulus situa- 
tion, In addition, it is quite probable that the 
effect of anxiety is dependent upon the level of 
arousal (cf. Malmo, 1959). Moderate levels of 
drive may reduce response variability because of 
its multiplicative relationship with habit strength. 
However, high levels of drive, sufficiently strong 
to elicit non-task-oriented responses, would be 
interfering and reflected in a performance dec- 
rement, 

The use of fewer responses by the high-anxiety 
group is consistent with this interpretation. The 
related prediction that a higher drive level should 
be associated with improved performance for easy 
items and poorer performance for hard items 
was not supported. However, drive has a dif- 
ferential effect on performance in tasks eliciting 
competing responses depending upon the ‘extent 
to which the habit strengths of the alternative 
responses are above threshold (Spence, 1958). 
Since these thresholds cannot be estimated from 
these data, this does not constitute a critical test 
of the Hull-Spence hypothesis. 

The greater communality in the responses of 
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the high-anxiety college students is at variance 
with the related data from studies using a free 
association task (Jenkins, 1959; Saltz, 1961), 
but consistent with the absolute judgment data 
of Eriksen and Wechsler (1955) showing greater 
bias in the use of response alternatives under a 
fear of shock condition. Anxiety apparently exerts 
a constraining effect on behavior in tasks demand- 
ing a correct solution which it does not exert in 
free response situations. In the latter situation 
almost any response may be equally good since 
the task demands only that a response be made 
and rot that it attain any specified end result. 
This would seem to be in agreement with 
clinical data reported by Shaffer (1947) to the 
effect that situationally aroused anxiety is much 
less apt to be disruptive if the individual is 
engaged in task-oriented behavior than if he is 
not, 

An interesting problem remains concerning 
precisely what linguistic changes occur under 
high tension which mediate the lowered intel- 
ligibility found in this study. Undoubtedly part 
of the effect is due to breakdowns in linguistic 
structure such as pauses, repetitions, and in- 
trusions which are associated with reduced pre- 
dictability of word sequences (cf. Goldman- 
Eisler, 1958). A second possibility is that high 
drive states interfere with the normative use of 
language. Although diversity of vocabulary de- 
creases under tension (primarily because of 
repetition of phrases), it also seems likely that 
when patients are talking about events which 
elicit conflict, their usage of words will become 
more idiosyncratic. Under these conditions words 
may be given personal meanings, which are not 
determined by conventional social usage. The 
extreme example of this occurs in schizophrenic 
speech samples which generally have lower 
type/token ratios than speech samples from 
normals (Salzinger et al., in press) but are also 
much less predictable because of the private 
meanings assigned to many of the words. 
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PERSUASIVE IMPACTS OF EARLY, LATE, OR NO MENTION 
OF A NEGATIVE SOURCE * 


T. R. HUSEK 


University of California, Los Angeles 


k 20-min. talks designed to improve attitudes toward mental illness phenomena 
were given to 498 Ss. For some of the talks, the communicator said she was 
an ex-mental patient at the beginning of the talk. For other groups, this in- 
formation was given at the end or not given at all. Late presentation of the 
information about the speaker resulted in more positive attitudes expressed 

toward mental illness phenomena than did early presentation. No mention of 
the information was better than early mention. No differences were found 

£4 between personal and impersonal presentations. 


Th any attempt to change the attitudes of a 
Soup by means of a persuasive communication, 
© reactions of the group to the communicator 
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are of importance (Hovland, Janis, & Kelley, 
1953). In addition, where two conflicting views 
are presented in a communication, the order of 


committee of the University of California, Los 
Angeles. The author also wishes to acknowledge the 
Western Data Processing Center, University of 
California, for use of its facilities to assist in the 
analysis of the data reported in this paper. 
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presentation of the sides has been the subject of 
considerable investigation (e.g., Hovland, Man- 
dell, Campbell, Brock, Luchins, Cohen, McGuire, 
Janis, Feierabend, & Anderson, 1957). Both the 
role of the communicator and the primacy- 
recency problems are rich fields of inquiry. 

In an attempt to clarify the individual issues, 
the two problems have been dealt with separately. 
In general, it has been found that positively 
valenced communicators are associated with more 
positive attitude change than negatively valenced 
communicators (e.g., Kelman & Hovland, 1953), 
and there is evidence which suggests that, 


when contradictory information is presented in a 
single communication, by a single communicator, 
there is a pronounced tendency for those items 
presented first to dominate the impressions received 
[Hovland et al., 1957, p. 133]. 


This paper describes an experiment the design 
of which allowed the two effects to interact. 
Talks on mental health were given to subjects 
in an attempt to improve attitudes toward mental 
health phenomena. In some of the talks the 
speaker identified herself as belonging to a nega- 
tively valenced group. Sometimes the identifica- 
tion of the speaker was made at the beginning of 
the speech, and sometimes at the end. In other 
talks, the identification was not made at all. In 
addition, the effects of impersonal versus personal 
presentations were examined. 


METHOD 
Subjects 


The subjects were 498 high school students from 
two schools matched in terms of size and socio- 
economic nature of the communities from which 
they drew their students. The smallest sampling 
unit used was a single class and 24 of these were 
used, 12 in each school. All of the classes were social 
science classes and none of them were special classes 
for exceptional students, advanced or retarded. 


Experimental Materials 


Talk, A 20-minute talk was composed which at- 
tempted to promote favorable attitudes toward 
some concepts relating to mental illness and to cor- 
rect some common misconceptions about mental 
illness and mentally ill persons. The content of the 
talk ranged widely over topics concerning mental 
illness, mentioning such items as the fact that cured 
mental patients do not stand out in a crowd and 
usually make good adjustments to society, the fact 
that a vacation is not a cure for mental illness, the 
fact that mental patients are not just pretending to 
be sick, and others. 

Speakers. To control for the specific effects of any 
particular speaker, two persons were used as com- 
municators. Two women staff members, about the 
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same age, with similar training, were selected. The 
speakers did not completely memorize the talk, but 


were able to present the material with only an * 


occasional glance at the notes. They practiced for 
several weeks and watched each other’s presentation 
with a view to standardizing the talk as much as 
possible, 

Source identification variable. For one-third of 


the presentations, the speaker mentioned at the ` 


beginning of the speech that she was a “former 
mental patient.” For another third of the presenta- 
tions, the point was made at the end of the talk. 
For the last third, no mention was made about the 
background of the speaker. Neither speaker actually 
had been a “mental patient.” 

At each of the two high schools, the role of 
each of the two speakers was held constant. That 
is, for all of the classes tested in one high school, 
one of the speakers played the role of the former 
mental patient (sometimes announcing the fact at 
the beginning of the talk and other times announc- 
ing the fact at the end of the talk). In that high 
school, the second speaker consistently made no 
mention of being a former mental patient. Roles 
were reversed in the other high school. 

Personal versus impersonal variable. Half of the 
talks were presented in an impersonal manner; the 
other half were made as personal as possible, The 
information presented in the two versions was the 
same, The personal talks attempted to make this 
information appear closer to the subjects and more 
relevant to their everyday existence than was the 
case depicted in the impersonal talks which were 
written and given as no more or less than factual 
presentations, 

Attitude measures. A form of the semantic dif- 
ferential (Osgood, Suci, & Tannenbaum, 1957) was 
used to measure attitudes, The subjects rated the 
concepts “me,” psychotherapy, neurotic people, 
mental hospital, and ex-mental patient. The scales 
worthless-valuable, dirty-clean, insincere-sincere, 
foolish-wise, unpredictable-predictable, sick-healthy; 
and dangerous-safe were used. 


Procedure 


The talks were given during the subjects’ regular 
class periods on a Friday. The speakers were in- 
troduced as people who were giving talks to 
high schools on the topic of mental health. On the 
following Monday, different experimenters came to 
the classes and administered the tests to the sub- 
jects. During the experiment, the speakers and the 
tests were not described as being connected. After 
the experiment was completed, the study was €*- 
plained to the subjects and any questions they had 
were answered frankly. 


RESULTS 


A single mean rating was obtained for each of 
the 24 classes, averaging across subjects, scales: 
and concepts (the concept “me” was not use! m 


is). This was done as follows. First 
side of each scale was defined; it 
cided that valuable, clean, sincere, wise, 
jle, healthy, and safe were the positive 
their respective scales. Then the average 
for each class was computed across all the 
nd all the concepts and all the subjects in 
ss. These class means represent an average 
T diverse set of scales and the actual 
the means are difficult to interpret. 
since the important part of the experi- 
cerns the differences among the means, 
resented in Table 1. An appropriate 
cal treatment is that for a simple random- 
design (see Lindquist, 1953, p. 177). The 
were treated as consisting of six groups 
versions of contact and personal versus 
onal) with four entries for each cell. 
ount of variance attributable to inter- 
and the personal versus impersonal 
ble is so small (interaction F = .026; per- 
impersonal F = .046) that one is tempted 
beyond the limits of conclusions permitted 
analysis of variance and suggest that the 
mal versus impersonal variable did not have 
effect. At any rate there was definitely no 
ficant interaction or personal-impersonal F 


‘the other hand, the main speaker identifi- 
m variable was quite effective, with an F of 
(p= .001 level). Since this main effect was 
ant, it was decided to make some com- 
ms among the early, late, and no-speaker 
fication groups. The Mann-Whitney U test 
, 1956) was used to compare the three 
, and each of the three possible com- 
ons was significant beyond the .05 level of 
ance, 


DISCUSSION 


findings reported above are consistent 
the conclusion that the differences between 
Subjects’ attitudes were related to their 
experienced a given combination of two 
communicator effects and a primacy 


l illness is generally negatively regarded, 
mentally ill are usually rated as “bad” 
ly, 1961). Thus an ex-mental patient 
Presumably be a negatively valenced com- 
tor when compared to an “ordinary” 
Previous research on communicator ef- 
iggests that negatively valenced com- 
Ors are associated with less positive at- 
ges than positively valanced com- 
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TABLE 1 


OVERALL SEMANTIC DIFFERENTIAL MEANS FOR 
THE 24 CLASSES 


Presentation | Early mention | Late mention | No mention 


Impersonal | 4.803 | 4.792 
4,632 | 4.776 


4.832 | 4.774 
4.732 | 4.692 


5.105 | 5.016 | 4.988 | 4.890 
5.113 | 5.090 | 4.901 | 4.790 


5.112 | 5.201 | 4.905 | 4.902 


Personal 
: 5.015 | 5.118 | 4.862 | 5.001 


¥=4.754 | =5.096 | X=4.905 


This reasoning would lead to the prediction 
that the identification of the speaker as an ex- 
mental patient should result in a communicator 
effect involving less positive attitude changes 
than would not identifying the speaker at all. 
The findings for the early-mention group versus 
the no-mention group show this expected dif- 
ference between the groups. 

However, the late-mention group displays a 
more positive set of attitudes than the no-men- 
tion group. The findings would here suggest that 
a second kind of communicator effect was operat- 
ing in contrast with the communicator effects 
identified above. 

Comparison of the early-mention and late- 
mention groups suggests that the differences in 
attitude changes between these groups are in 
part related to primacy effects. This inter- 
pretation is consistent with the general findings 
in previous investigations of the order of presen- 
tation of materials and relative persuasive impact 
of contradictory communications. 

In the comparison of all three groups one 
may interpret the findings to conclude that the 
positive attitude changes manifested in the late- 
mention group is due in part to the interaction 
of communicator and primacy effects. It is sug- 
gested that early mention of the speaker’s 
identification marks the speaker as having a 
negative valence. However, late mention of the 
speaker's jdentification allows the talk to in- 
fluence and improve the subjects’ attitudes toward 
ex-mental patients. Then, when the speaker is 
identifed at the end of the talk, subjects mark 
him as being more positively valenced than 
early-mention speakers or no-mention speakers. 
In this way, the results are consistent with 
previous research on communicator effects, but 
only after the effects of the time of speaker 
identification are taken into consideration. 

Thus not only were primacy and communicator 
effects found, but also the interaction of the 
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two types of effects produced different attitudes 
in the various treatment groups. The attitudes 
toward the communicator were affected by the 
information presented in the speech, even though 
the speakers for the various conditions were 
not separate “positive” and “negative” com- 
municators, but the same persons giving almost 
identical talks. 
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EFFECTS OF SEVERITY OF THREAT AND PERCEIVED AVAILABILITY 
ON THE ATTRACTIVENESS OF OBJECTS + 


ELIZABETH ANN TURNER 2? AnD JOHN C. WRIGHT 
Institute of Child Development, University of Minnesota 


2 experiments are reported on changes in children’s evaluation of play objects 
as a function of several types of interference with free play. Exp. I successfully 
replicates a previous study where coercion not to play with a desirable toy 
produced relative increases or decreases in rated attractiveness depending on 
whether strong or mild threat, respectively, was employed. In Exp. II the effects 
of 5 ways of preventing play with the critical toy between rankings were in- 
vestigated. When the critical toy was secretly removed and returned, when the 


E removed it “in order to fix it” for S, 


and when the E removed it but 


promised that it would be available later, its attractiveness increased. When 


E removed the toy without explanation, 


and when E removed it indicating it 


would never be available for Play, its relative attractiveness declined. The 
results are interpreted in terms of satiation-novelty and cognitive-dissonance 


Processes, 


Jn a recent study, Aronson and Carlsmith 
(1963) found that mild, but successful coercion 
was more effective than strong coercion in in- 
ducing a child to devalue an object previously 
rated as attractive. Preschool children ranked 
several toys by a paired-comparison procedure, 
Each child was then threatened with either mild 
or severe consequences if he played with the 


1 Experiment I is a portion of an honors thesis 
submitted by the senior author to the Institute of 
Child Development, University of Minnesota, Experi- 
ment II was supported by Grant NSF-GS-202 from 
the National Science Foundation to Elliot Aronson. 
The authors wish to thank Elliot Aronson for his 
advice and assistance in the design of Experiment II. 

2 Now at Cornell University. 


second ranked toy during the play session. After 
leaving the subject alone for 10 minutes to play 
with the toys, the experimenter had the child 
rank the toys again. Each child was tested in 
both strong and weak conditions, with a period 
averaging 45 days between the two conditions. 
It was hypothesized that mild threat as com- 
pared with strong threat would arouse greater 
dissonance between the rated attractiveness 0 
the critical toy and the subject’s avoidance 0 
it during the play session and, therefore, that 
subjects would tend more to devalue the critical 
toy in the postexperimental ranking. This 
hypothesis was confirmed. 

Much of the difference found between the 
strong and weak conditions, however, was due 


à BRIEF ARTICLES 


° 
to the fact that for many subjects the strong 
" threat produced an increase in the rated attrac- 
tiveness of the critical toy. This portion of their 
results is not readily explained by dissonance 
| theory. Aronson and Carlsmith suggested that the 
‘tendency of the critical toy to increase in attrac- 
‘tiveness was not attributable to the strong threat 
. but rather to some other base-line factor present 
| . in the general experimental situation, As a test 
of this hypothesis a “no-threat” control condition 
was added in which the experimenter simply took 
‘the critical toy with him when he left the room, 
without issuing any threat. In this condition, as 
“predicted, the attractiveness of the critical toy 
increased about as much as in the strong-threat 
‘condition, indicating that the increase could not 
_ be ‘attributed to the strong threat itself. The 
dissonance-produced derogation of the critical toy 
`- in the mild-threat condition may be seen as over- 
| riding a general base-line tendency for the critical 

| toy to increase in attractiveness. 
_ , The present study was designed primarily to 
‘investigate what factors might be responsible for 
the general increase in attractiveness of the 
critical toy. As a preliminary investigation, an 
attempt was made to replicate the Aronson and 
Carlsmith results using separate groups of sub- 
jects under strong- and mild-threat conditions 

(Experiment I), In Experiment II three alterna- 

Ñ tive explanations for the general tendency of the 
Critical toy to rise in the strong-threat and no- 
| threat conditions were investigated. 

One explanation proposed by Aronson and 
Carlsmith was that in the absence of dissonance, 
Subjects had an opportunity of becoming satiated 

n with the more attractive toys during the play 
period, but not with the critical toy, since the 
latter was either avoided (strong-threat condi- 
tion) or unavailable (no-threat control). Data 
| Consistent with this hypothesis have been ob- 
tained by Harris (in press), Maddi (1961), and 
ae (1960) in studies of the incentive value 
relative novelty.” This explanation will be 
tefetred to as the satiation-novelty hypothesis. 
ae Second possibility suggested by Aronson and 
“Usmith is that the distinctiveness and attrac- 
aoe of the critical toy may be enhanced 
Bi S by an adult’s calling special attention to 
Pais it as exceptional. This explanation 
Ea referred to as the attention hypothesis. 
6 hate explanation would suggest that an 
ean attractive in proportion to its 
te Ae ility. As something becomes unavailable, 
: Poe ty correspondingly increases if one 
diene that the unavailability results from the 
Thc! having been selected by someone else- 
© experimenter removes something from the 


= 
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subject’s choice array, the subject may assume 
that the experimenter wants it for himself. An 
alternative set of assumptions leading to the 
same conclusion has to do with restriction of 
freedom of choice (see Brehm, 1961). If one 
believes he has freedom of choice and this 
freedom is unexpectedly reduced, a possible 
consequence is motivation to regain that freedom, 
or to demonstrate that it has never been lost, 
by an increased evaluation of the unavailable or 
prohibited alternative. 

In the strong-threat and control conditions the 
critical toy is made unavailable and hence more 
attractive. In the mild-threat condition, however, 
the toy is not so definitely unavailable; freedom 
of choice is not so forcibly restricted; and dis- 
sonance, which would have opposite effects, is 
aroused, This explanation is referred to as the 
unavailability hypothesis. 


EXPERIMENT I 
Method 


Subjects. The subjects, 33 girls from the nursery 
school of the Institute of Child Development at 
the University of Minnesota, ranged in age from 3.9 
to 5.1 years. They were randomly divided into two 
experimental groups, a mild-threat condition and a 
strong-threat condition, with 16 subjects in each 
experimental group. 

Procedure. The experimental room contained a 
one-way mirror and a low table, Along the wall 
opposite the door eight toys were placed in a row. 
They were: a gun with rubber darts, a stiff plastic 
doll with no accessories, a battery-powered Army 
tank, a puzzle with 100 pieces, a large truck carrier 
with four smaller trucks, a homemade stuffed cat, 
a plastic rocket ship, and a ball and jacks. As a 
part of a separate research design the four most 
attractive toys were masculine, and all the subjects 
were female. However, these considerations do not 
invalidate the test of the present hypothesis which 
concerns the effects of strong versus mild threat on 
the child’s attitude toward the critical toy. Before 
the study was initiated, the experimenter spent 3 
weeks playing with the children in the nursery school 
so that they would be familiar with the experimenter, 
an adult female, when the study began. 

The experimenter led the subject into the experi- 
mental room and pointed out the toys on the floor. 
The subject was invited to kneel on the floor next 
to the experimenter as she demonstrated each of 
the toys. Then the subject was allowed to manipulate 
each toy very briefly. The experimenter pointed out 
each toy again and said: “Look at all the toys very 
carefully. Now of all these toys which one would 
you like to play with best of all? Which one do 
you like the very best?” When the subject made a 


3 One subject in the strong-threat condition: was 
replaced because she played with the forbidden toy 
while the experimenter was absent from the room. 
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TABLE 1 


CHANGES IN RERATING THE ATTRACTIVENESS OF THE CRITICAL Toy (EXPERIMENT I) 


Condition Increase No change Decrease Total ; 
Mild threat 
‘4 5 (10) 9 (8) 16 (22) 
% Bs 182) 31.2 VA 56.3 (36.4) 100 (100) d 
Strong threat 16 (2) 
14) 5 8) 3 (0) 
% 50.0 (636) 31.2 RA 18.8 (0) 100 (100) 
Total 
10 (18 10 (18) 12 (8) 32 (44) 
% 31.3 io) 31.2 (40.9) 37.5 (18.2) 100 (100) 


Note.—Comparison data from Aronson and Carlsmith (1963) are shown in parentheses. 


choice, she was requested to pick it up and to 
put it on the other side of the room. The experi- 
menter said: “Now of all the toys that are left, 
which one would you like to play with best of all? 
Which one do you like the very best?” This pro- 
cedure was repeated until the child had chosen seven 
out of eight toys. In this way a rank order of 
preference of the toys was obtained, 

The experimenter then randomly Placed the toys 
around the room, except for the second ranked toy, 
if it were a masculine toy, or if the second choice 
were nonmasculine, the next highest ranked mascu- 
line toy.* As in the Aronson and Carlsmith study, 
the second ranked toy instead of the first ranked 
toy was chosen as the critical toy on which the 
prohibition was placed, because this permitted the 
subject to change the evaluation of the toy in either 
direction. The toy was placed on the table as the 
experimenter said, “I have to leave the room for 
just a minute to do an errand. While I’m gone you 
can play with any of these toys you want except 
this boy toy [indicating the toy on the table].” In 
the mild-threat condition the experimenter added: 
“You shouldn't play with the [naming the 
toy]. TI be right back—goodbye.” In the strong- 
threat condition, the experimenter substituted: “Tf 
you play with it, it would be very naughty and 
you wouldn’t be able to play with any of the toys 
again. Remember it would be very naughty to play 
with the —__ [naming the toy]. I'll be right back 
—goodbye.” If the subject asked why she could not 
play with the toy on the table, the experimenter said 
that they would talk about that when the experi- 
menter got back. When the subject asked specifically 
if she could play with a toy other than the critical 
toy the experimenter said, “Yes.” 

The experimenter then left the room and observed 
the subject for 5 minutes from behind the one-way 


*Th eight instances the critical toy was one ranked 
third or lower instead of the one ranked second, 
This variation stemmed from the requirement for 
other purposes that the critical toy be also a mascu- 
line toy. Three of these instances occurred in the 
mild-threat condition and five in the strong-threat 
condition, 


mirror. When the experimenter returned, the forbid- 
den toy was taken from the table and placed on 
the floor with the comment, “Now, let’s play witi 
all the toys.” After allowing the subject to play 
briefly with all the toys, including the forbidden toy, 
the experimenter suggested that the “game” be played 
again. The order of placement of the toys along the 
wall was reversed and the subject was asked 10) 
choose the toys again according to her preference. 
After this ranking, the subject was allowed to play 
with all the toys again before returning to the 
nursery school, 


Results 


Table 1 shows the number of subjects who 
decreased their liking for the critical toy, me 
creased their liking for it, or did not change their 
liking for the critical toy from the first ranking 
to the second, together with comparable fre 
quencies from the Aronson and Carlsmith study: 
In the mild-threat condition there was a tendenty 
for the subjects to decrease their liking for the! 
critical toy, while in the strong-threat condition 
subjects tended to increase their evaluation, a 
predicted. By a chi-square test the differentt 
between the strong- and mild-threat cont 
was found to be significant at the .05 lea 
(x? = 6.60, df = 2). Excluding the cases in which 
there was no change in either direction in w 
position of the critical toy, on the assumptio 
that the experimental manipulation had no effet 
on these subjects, the chi-square was 
(df= 1), significant at the .01 level. l 

These results are very close to those of the 
Aronson and Carlsmith study and confirm the 
dissonance explanation of the difference betwee! 
the effects of mild and strong threat. Aga 
however, some of the obtained difference betwee 
the strong- and weak-threat conditions was du? 
to the tendency of subjects in the strong-thre# 


condition to increase their evaluation of t° 
critical toy, 
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EXPERIMENT IT 


< The conditions in Experiment II were designed 
to test the satiation-novelty hypothesis, the 
attention-distinctiveness hypothesis, and the un- 
availability hypothesis as explanations of the rise 
“nm attractiveness of the critical toy. In one 
condition (disappear) the toy was removed and 
_ Teturned secretly so that a rise would be pre- 
dicted only by the satiation-novelty hypothesis. 
A second condition (fix) called attention to the 
stoy, but did not imply restriction of the child’s 
freedom of choice. Here a rise would be pre- 
' dicted by both the satiation-novelty and the 
attention-distinctiveness hypotheses, but not by 
the unavailability hypothesis. In the third condi- 
_ tion (removal), there was a moderate restriction 
of the subject’s freedom of choice. In this condi- 
tion a rise would be predicted by all three 
. hypotheses. 
. The inclusion of the last two conditions (never 
and later) represented an attempt to clarify the 
„ temoval manipulation, which it was believed 
could be interpreted by the child as either that 
he could never play with the critical toy or that 
he could definitely play with it later. 


Method 


Subjects. The 82 subjects, ranging in age from 6.0 
| to 83 years, attended art classes at the Institute of 
Child Development during a special summer session.® 
ey were randomly divided into five experimental 
groups without regard to the sex distribution. 
Procedure. The study was conducted in an experi- 
mental room in which six toys were placed along one 
eu The toys were: a plastic tennis racket with 
; a attached, a power-operated robot, a “slinky,” 
4 ma end scoop, a four-way periscope, and a 
Ne eel-lo.” The ranking procedure was essentially 
thc as in Experiment I. After the child had 
i R five of the six toys, the experimental condi- 
Rare introduced. The five conditions were 
“all fiv disappear, fix, removal, never, and later. In 
for iu conditions the experimenter left the room 
Peers nine. Withee third ranked 
criti za third ranked toy was designated as the 
i cal toy instead of the second ranked toy used 
y peru experiments in order to allow a wider 
tgin for increases in relative attractiveness. 
a aa a disappear condition the experimenter re- 
VF he the critical toy from the room in the pocket 
ae he smock, making sure that the subject did not 
tea do so, When she returned, she restored the 
kier toy, again with secrecy. In the fix condition, 
“This eour picked up the critical toy and said, 
s looks kind of dusty. I think PI go clean it 


5 
oe Subjects were replaced, one because she cried 
as By e experimenter left the room, and the other 
the t esult of the experimenter’s failure to remove 
°y without detection in the disappear condition. 
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TABLE 2 


CHANGES IN RERATING THE ATTRACTIVENESS OF THE 
Criticar Toy (Experiment II) 


Condition | Increase | No change| Decrease Total 
Disappear 10 4 2 16 
Fix 9 5 2 16 
Removal 2 4 10 16 
Never 1 13 16 
Later 10 3 3 16 

Total 33 17 30 80 


up for you. TIl be right back.” Before leaving in the 
removal condition the experimenter said, “You can’t 
play with this toy now, maybe later.” 

In the never condition the experimenter left the 
room carrying the critical toy, after telling the child 
that he could play with any of the toys except the 
critical toy, which he could not ever play with. In 
the later condition the experimenter said, “You can’t 
play with this toy now, but when I come back 
and we finish the game, I promise that you'll get a 
chance to play with it.” 

When the experimenter returned to the room the 
child was told that he would get a chance to play 
with the toys again a little later, but first they 
would play the “choosing game” again. In the never 
condition, the instructions varied: the child was 
told that after he had reranked the toys he could 
have another chance to play with the toys except, 
of course, the forbidden toy. In all conditions after 
the second ranking, the child was given an oppor- 
tunity to play with all the toys for a few minutes 
before he returned to class. 


Results 


As shown in Table 2 the data from the dis- 
appear, fix, and later conditions were similar. In 
these conditions the subjects tended to increase 
the position of the critical toy from the first 
ranking to the second. In contrast to these three 
conditions, the removal and never conditions 
showed a tendency for the critical toy to decrease 
in the ranking. By a chi-square test these condi- 
tion differences were found to be highly signifi- 
cant (x? = 31.18, df=8, p<.001). Excluding 


TABLE 3 


MEAN CHANGE IN ATTRACTIVENESS OF CRITICAL Toy 
(ExpermMent II) 


Condition N M SD 

i r 16 +75 1.34 

eee | E | 
l 1 e, 1 

N 16 —1.38 1.31 

Later 16 + 38 1.50 


‘All conditions 80 


132 


the cases in which there was no change in rank- 
ing did not affect the significance of the test. 
Table 3 shows the mean change of the critical 
toy for each of the five conditions. The pattern 
of the results is the same as in Table 2. The 
difference between the disappear and fix condi- 
tions was not significant (¢=.133, df= 30). 
However, the difference between the disappear and 
removal conditions (t = 3.64, df = 30, p < 001) 
and the difference between the fix and removal 
conditions (t= 3.76, df =30, p< .0005) were 
highly significant. Finally, the difference between 
the never and later conditions was found to be 
very significant (t= 3,52, df = 30, p < .005). 


Discussion 


The results of Experiment I were quite con- 
sistent with those obtained by Aronson and Carl- 
smith (1963). The tendency of subjects to 
devalue the critical toy in the mild-threat condi- 
tion of both studies is in accord with the dis- 
sonance-theory prediction. Moreover, the effect 
operates in spite of a general tendency for the 
critical toy to rise as seen in the strong-threat 
and control conditions, 

In Experiment II it was expected that subjects 
in the disappear, fix, and removal conditions 
would show an increase in the position of the 
critical toy if the satiation-novelty hypothesis 
were correct. If the general rise were due prima- 
rily to the special attention called to the critical 
toy by the experimenter, then only the fix and 
removal conditions would produce a rise. Finally, 
it was expected that only the removal and never 
conditions would produce a rise if the increase 
could best be explained on the basis of the 
unavailability hypothesis (loss of freedom of 
choice), 

Subjects in the disappear and fix conditions 
tended to increase their liking for the critical toy 
while subjects in the removal condition tended 
to decrease their liking for the toy, Although 
there was not a tendency for the critical toy 
to rise in the removal condition, the rise in the 
disappear and fix conditions still seems to suggest 
that satiation with respect to the noncritical toys, 
producing relative novelty for the critical toy, 
is primarily responsible for the tise. The 
“attention” hypothesis is weakened by the failure 
of the fix condition to elicit a greater rise than 
the disappear condition, and the unavailability 
hypothesis is directly contradicted by the decline 
obtained in the removal condition. There Temains 
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only the problem of explaining the decline jn | 
the removal condition, since the satiation-novelt 
hypothesis tentatively accepted above, predicts 
a rise. i 

The devaluation of the critical toy found in 
the removal condition resembles the dissonance i 
effect found in the mild-threat condition of" 
Experiment I. It is possible that the removal + 
condition also aroused dissonance, in this case 
between the high rating and the unavailability 
of the critical toy. Because of the ambiguity of 
the instructions concerning future availability of 
the toy in the removal condition, the never 
and later conditions were devised. The results 
clarify the findings for the removal condition. 
When the subject knows with certainty that 4 
highly evaluated toy is permanently unavailable 
(never condition) he tends strongly to devaluate l. 
it. This devaluation may be attributed to the at- 
tempt by the subject to reduce dissonance be- 
tween the desirability and the unavailability of 
the critical toy (the sour-grapes effect). Finally, 
the later condition, like the fix condition, fails 
to arouse dissonance and is hence susceptible to 
the satiation-novelty effect—a rise in evaluation 
of the critical toy. 

Tt is concluded that the temporary unavailabil- 
ity of a desired object tends to increase its | 
desirability in relation to other available objects 
as predicted by the satiation-novelty hypothesis. 
However, if the unavailability is clearly perma- 
nent or if the object is voluntarily avoided as & | 
result of barely sufficient coercion, then cognitive f 
dissonance is aroused which can only be reduced | 
by devaluation of the desired object. 

y 
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OF THE CONFIRMATION OR 


NONCONFIRMATION OF EXPECTANCIES AND DESIRES 


EDWARD E. SAMPSON anp LINDA B. SIBLEY 
University of California, Berkeley 


This study replicates and extends the basic concepts of the Carlsmith and 
Aronson study involving the disconfirmation of expectations and its effect 


on judgments of sweet and bitter solution. 40 Ss were divided into 4 experi- 
mental conditions by independently varying the factors of expectation and 


motivation. 4 of the Ss had their expectations confirmed and 4, disconfirmed. 


To provide a motivational preference for a given solution, in 4 of each of the 


and 4, each time a sweet solution was 
predicted its occurrence. Our results o) 


above conditions, Ss were paid $.25 each time a bitter solution was presented 


presented, whether or not they correctly 
n expectation confirm the original study 


with respect to judgments of the solution which was judged sweeter when 


sweetness was the expected quality. Payment of $.25 produced no significant 


change in ratings. 


In an attempt both to replicate the basic theo- 
retical scheme presented in the Carlsmith and 
. Aronson (1963) study, and to examine an aspect 
Suggested but not systematically considered in 
that work, the present experiment was under- 
taken. 

Based upon Festinger’s model of cognitive dis- 
Sonance (Festinger, 1957; also see Brehm & 
Cohen, 1962), Carlsmith and Aronson deduced 
that the nonconfirmation of one’s expectation 
_ about the environment would produce the un- 

pleasant state of cognitive dissonance which in 
turn would result in “a hedonically negative 

State.” In their experiment, involving subjects’ 

Making judgments as to the sweetness or bitter- 

hess of tasted solutions, they found general con- 

firmation for their expectation. That is, the bitter 

Solution tasted more bitter when it occurred 
Unexpectedly and the sweet solution tasted Jess 

Sweét when one’s expectation was not confirmed, 

While reading their recent article and the 
| generalizations based upon it, we were struck by 
What appeared to be the possibility of systemati- 
cally exploring the effects of disconfirmation of 
expectation and its complex relationship with 
Motivation or desire within the context of their 
experiment. As both Carlsmith and Aronson and 
Others have pointed out (e.g., Harvey, Hunt, & 
Schroder, 1961, pp. 50-58), if one’s expectations 
disconfirmed by an action which is in the 

rection of one’s motivations or desires, this is 
Not particularly unpleasant. For example, the 
Student who expects to fail a course but then 
Passes probably does not feel very unpleasant by 
es his expectations disconfirmed. Passing in 
he instance is in line with his motivations and 
ould lead him to feel pleasant. 


The preceding suggested a variation of the 
Carlsmith and Aronson technique, independently 
varying expectancy and desire in an experimental 
situation comparable in many of its essentials 
to theirs. 


METHOD 


Although the general format originally employed 
by Carlsmith and Aronson was followed, there were 
sufficient variations to suggest that this is not a 
simple replication of their design, but rather a 
replication and extension of their concepts. 


General Procedure 


As with their original study, the experimenter 
met the subjects individually and informed them that 
they were going to participate in two studies. One 
study presumably involved their sensitivity to inter- 
personal cues. The other presumably was concerned 
with their perception of taste. As part of the first 
study, the experimenter was to give them subtle 
cues involving idiosyncratic movements (e.g. scratch- 
ing her head) which would permit them to predict 
whether the liquid which they were to taste was 
sweet or bitter. As part of the second study, the sub- 
jects were to compare each solution they tasted with 
a standard sweet and a standard bitter solution. 
These comparisons were to be made on the basis of 
evaluating the standard solution as 10 and judging 
the degree to which the solution tasted was more or 
less bitter or more or Jess sweet than the standard 
solution rated 10.1 


1Jn their study, Carlsmith and Aronson used a 
quinine sulfate solution to produce bitter and sac- 
charine solution to produce sweet. Our pretesting 
with these solutions suggested that the saccharine 
was not sweet tasting to our subjects. Therefore we 
employed a sweet solution containing regular granu- 
lated table sugar mixed with water. We mixed 12 
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Up to this point, the procedure and instructions 
were parallel to those employed by Carlsmith and 
Aronson, However, in order to test the effects of 
motivation as an independent factor, it was neces- 
sary to vary their procedure somewhat. It was 
decided that one way to induce a desire for either 
bitter or sweet involved offering a monetary reward 
for one of the solutions. To introduce money into 
the picture, the following instructions were employed: 


There is one more thing we would like to find 
out while we are at it. We are curious whether 
various external circumstances will have any ef- 
fects on these taste ratings. Therefore we have 
several groups, all of which are being paid, but 
under different circumstances. Your group is being 
paid a quarter [$.25] every time you taste a bitter 
[sweet] solution. This is not connected with the 
prediction part of the experiment. That is, you 
will receive the money simply because you received 
a bitter [sweet] solution, As I mentioned before, 
the number of bitter [sweet] solutions you will 
receive and the order in which they will come 
were determined randomly. 


Thus one half of the subjects were to receive $.25 
each time a sweet solution was presented and the 
other half, $.25 each time a bitter solution was 
presented. It was reemphasized that this payment 
would be given regardless of whether or not they 
correctly guessed which solution was to be presented. 

At the conclusion of these instructions, all subjects 
were given a series of trials. Each trial consisted in 
the experimenter’s giving the subject a relevant inter- 
personal cue (the experimenter scratched her head 
to indicate that the next solution was sweet and 
chewed her pencil to indicate that the next solution 
was bitter), the subjects’ making their prediction of 
Sweet or bitter just as they actually tasted the 
solution, and their rating the solution with respect 
to the standard. They were then either paid $.25 
or not depending on the condition they were in, An 
intertrial interval of 1 minute was maintained. 

All subjects were given 6 training trials before 
being confronted with Critical Trial 7, at which 
point the expectancies established during the prelimi- 
nary 6 trials were either confirmed or not. During 
the initial 6 trials, half the subjects were paid when 
bitter occurred (on 3 of the 6) and half were 
paid when sweet occurred (also on 3 of the 6). As 
far as the subjects knew, they were to receive over 
20 trials as over 20 cups with prepared solutions 
were placed in front of them, 

The present study differs from the original there- 
fore both with respect to the payment factor and 
the establishment of a specific set of training trials 
Prior to a single critical trial, In addition, we were 
able to establish independent experimental conditions 
to study the effects of disconfirmation and motiva- 


level tablespoons of table sugar per liter of water 
to obtain the sweet solution. The bitter solution 
remained .08 gram of quinine sulfate to 1000 cubic 
centimeters of distilled water. 
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tion, rather than the single treatment condition sepa- | 


rately analyzed by trials as in the original by 
Carlsmith and Aronson.? F 


As a function of varying expectancy and motiva-, — 


tion, the preceding yielded four major experimental 
conditions. In each condition there was a further 
variation as a function of whether Critical Trial 7 
involved a sweet or a bitter solution: 


Condition I. Expectancy confirmed and motiva- 
tional desire confirmed: (a) expect sweet, paid for 
sweet, get sweet; (b) expect bitter, paid for bitter, 
get bitter. 

Condition II. Expectancy nonconfirmed and moti- 
vational desire nonconfirmed: (a) expect sweet, paid 
for sweet, get bitter; (b) expect bitter, paid for 
bitter, get sweet, 

Condition III. Expectancy nonconfirmed but moti- 
vational desire confirmed: (a) expect sweet, paid for 
bitter, get bitter; (b) expect bitter, paid for sweet, 
get sweet. 

Condition IV. Expectancy confirmed but motiva- 
tional desire nonconfirmed: (a) expect sweet, paid 
for bitter, get sweet; (b) expect bitter, paid for 
sweet, get bitter. 


Forty male and female college subjects, divided 


into 10 in each condition, 5 in each subgroup, 
were run. 


RESULTS AND Discussion 


Three major sets of analyses were conducted. 


| 


Each set consisted of a two-way analysis of ' 


variance, with two levels of expectancy (con- 
firmed versus nonconfirmed) and two levels of 
motivational desire (confirmed versus noncot- 
firmed), computed separately for those who 
received a sweet solution in Critical Trial 7 and 
those who received a bitter solution. The three 
sets differ with respect to the particular depend- 
ent measure employed. In the first set we used 
the simple scale value rating the solution. In the 
second set we used a prepost discrepancy ŝcore. 
We took as the prerating the average rating 0n 
the first 6 trials of the solution actually received. 
As there were only 3 sweet and 3 bitter, this 
average was based on these three ratings. We 
took as the postrating the value given on Trial 7. 


*It should be noted that as a function of this 
their original design used a greater number © 
observations than our study. It is difficult to know 
whether or not this factor by itself would lead to 
significantly different results in comparing the two 
studies, but at least, this additional point © 
difference should be kept in mind. 

S Four subjects were eliminated either because they 
failed to learn the cues given by the experimenter oF 
they correctly assumed they were receiving exactly 
the same concentration of bitter or sweet on eal 
trial and thus were not subjected to the necessary 
experimental deception. 


| 
| 
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, 
In the third set we converted all scores to stand- 
ard scores with a mean of 10 and a standard 
“deviation of 1 as in the original article. Because 
sall three analyses yield the same results, we shall 
+ only present the simplified first measure. Table 1 
? contains the means and Table 2 the summary of 
the analysis of variance based on these values. 
» As is apparent from this analysis of variance, 
. the only significant effect is that attributable to 
expectation (p< .05). This effect is significant, 
however, only with respect to the ratings of the 
sweet solution. The sweet solution is rated as 
tasting less sweet when the subjects’ expectancy 
is nonconfirmed, regardless of whether or not the 
subject was being paid for receiving a sweet 
' solution. The ratings of the bitter solution are 
not affected by either payment or expectancy. 


TABLE 1 


ı Mean Ratincs or SUBJECTS RECEIVING A SWEET AND 
A BITTER SOLUTION ON TRIAL 7 


Expectancy: 


Confirmed | Disconfirmed 


Sweet 
Paid for what is expected 


(desire confirmed) 15.4 6.2 
Paid for what is unexpected 
(desire not confirmed) 12.8 9.6 
Bitter 
Paid for what is expected 
desire confirmed) 14.0 15.8 
aid for what is unexpected 
(desire not confirmed) 18.0 17.0 


The findings with respect to sweet provide 
strong confirmation for the original findings of 
arlsmith and Aronson. That disconfirmation of 
expectation has no apparent effect on the ratings 
of bitter, however, does not support their original 
study. In an effort to explain this latter finding 
peel as the absence of the effect of motiva- 
ional desire which we predicted, we reexamined 

, Some aspects of the data. 
secu though we attempted to manipulate the sub- 
4 Motivation or desire for a given solution 
j soe ating money with the solution, the re- 
» UMS Suggest that either motivational factors are 
ina Portant or that payment of the $.25 was an 
ea puste way of increasing the subjects 
a re for a given solution. Accepting this latter 
iq pretation, we next assumed that the intrinsic 
ciate between bitter and sweet is itself a 
is oent motivational factor—one which, in fact, 
Undoubtedly more significant than the mone- 
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TABLE 2 
ANALYSIS OF VARIANCE OF RATINGS OF 
SWEET AND BITTER 
Source df SS MS F 
Sweet 
Expectation (A)| 1 | 192.2 | 192.2 | 6.03* 
Motivation (B) 1 8 8 025 
AXB 1 45.0 45.0 1.41 
Within 16 | 510.0 31.88 
Bitter 
A 1 8 8 016 
B 1 33.8 33.8 714 
AXB 1 9.8 9.8 .207 
Within 16 756.8 47.3 
*p <.05. 


tary manipulation we employed. According to 
this assumption, the nature of the solution itself 
determines its desirability such that the sweet 
solution is intrinsically more desirable than the 
bitter solution. 

Although we have no independent check on this 
rather plausible assumption, by analyzing the 
subjects’ prediction on the first trial of their 
learning series, we are able to get some indica- 
tion of its validity. On this first trial, the subjects 
had not yet formed any association between the 
experimenter’s interpersonal cues and a particular 
solution thus their prediction was simply a 
guess. However, all subjects knew that they were 
to be paid for either a bitter or a sweet solution. 
By examining their predictions on this first trial 
therefore, we could get an indication of what 
kind of solution they “really” wanted or wished 
for. If the payment of $.25 added significantly 
to the valence of the solution, it seems plausible 
to maintain that subjects would predict that the 
first solution to be tasted would be the one they 
were being paid for: for example, a sort of wish- 
fulfilling guess. On the other hand, if one solution 
were intrinsically more desirable than the other 
(as it very well might be given the fact that 
each subject had already tasted a standard sweet 
and a standard bitter solution and thus might 


TABLE 3 


TRIAL 1 PREDICTIONS FOR SUBJECTS Pam FOR 
SWEET AND BITTER 


Paid for: 


Predict sweet on Trial 1 18 8 
Predict bitter on Trial 1 2 12 
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have already developed a distaste for the bitter), 
this solution might be predicted as the first solu- 
tion in this somewhat wish-fulfillment approach 
to prediction in the absence of other cues. 
Table 3 presents the revelant data. 

As is rather plainly evident, although they 
were being paid for the bitter solution, 40% of 
the subjects predicted that the first solution 
would be sweet not bitter. On the other hand, 
only 10% of the subjects being paid for sweet 
predicted bitter for the first solution.* This is 
taken to provide indirect, yet rather compelling 
support for the intrinsic difference in desirability 
between the sweet and the bitter solution. 

Given the preceding, it becomes possible to 
reanalyze the data; substituting “desires sweet” 
for the motivational component in each experi- 
mental group. With this substitution, it is ap- 
propriate to compare those groups in which sweet 
was expected and received (Ia and IVa: mean 
rating of 14.1) with those in which bitter was 
expected but sweet received (IIb and IIIb: mean 
rating of 7.9), In this comparison, we will assume 
that since the subjects receive sweet, they are 
receiving what they want: for example, their 
motivations are confirmed, and the only differ- 
ence lies in the confirmation or nonconfirmation 
of expectancies. In addition, we may next com- 
pare those groups in which bitter was expected 
and received (e.g., Ib and IVb: mean rating of 
16.0) with those in which sweet was expected but 
bitter received (e.g., IIa and IIIa: mean rating of 
16.4). As we have assumed that sweet is the de- 
sired solution, in this comparison we are examin- 
ing the effects of confirmation-disconfirmation of 
expectations involving nondesired elements. 

A t test of the difference between the means 
indicates that only the difference between the 
first set is significant (t= 2.49, p<.05), with 
the unexpected sweet tasting less sweet than the 
expected sweet. As this is just another way of 
looking at the data, this finding, of course, is 
consistent with the original finding reported in 
Table 2. The interpretation based on the finding, 
however, is somewhat different. 

Given our assumption about the intrinsic de- 
sirability of the sweet solution, the above findings 
Suggest that the disconfirmation of one’s expec- 
tation about an event is affectively unpleasant 
only when it involves an event which is desired 
(eg., in line with one’s motivations). On the 
other hand, when the event itself is undesirable, 


* Of course, the general trend in the overall table 
significantly demonstrates the effect of the $.25 on 
influencing initial guesses (p< .01 by Fisher’s exact 
test), However, this effect is mainly attributable to 
those paid for and receiving sweet. 


Brier ARTICLES 


the further increment of unpleasantness produced 
by having one’s expectations about it disconfirmed_ 
makes little or no difference. It is as if an already 
unpleasant or undesirable event cannot be made 
further unpleasant by adding to it the unpleasant 
quality of disconfirmation of expectation, whereas 
a pleasant event is more vulnerable to the un- 
pleasant impact of expectancy disconfirmation. 
The preceding interpretation in many respects 
parallels a just-noticeable-difference effect in psy- 
chophysical judgments. If we assume, as both we 
and the original authors have, that the discon- 
firrhation of an expectation about the world pro- 
vides an increment of unpleasant affect, this 
increment is more noticeable when placed against 
a background of positive or pleasant affect than 
when placed against an already affectively ún- 
pleasant ground. Therefore, one would expect 
that it would require a more potent violation 
of expectation than that involved in the present 
context to provide a just noticeable difference 
of unpleasantness when judging the already unde- 
sirable or unpleasant event (e.g., the bitter solu- 
tion). On the other hand, of course, a less potent 
disconfirmation should be effective in providing a 
just noticeable difference of unpleasantness when 
judging a desired event (e.g., the sweet solution). 
This leaves unanswered, however, why our re- 
sults for the bitter solution do not replicate 
Carlsmith and Aronson’s results. Although our 
experimental situations, including the subjects 
used, differed in many respects, perhaps the most 
striking point of difference lies in their use of 
sums of money to reward correct responses an! 
to punish (by fining the subjects) incorrect 
responses. Unlike our study in which the money 
was given regardless of the subjects’ correctness 
and merely as a function of the particular solu- 
tion which came up, their emphasis on being 
correct may have sufficiently engaged the subject 
to create the effective “potent violation of 
expectation” mentioned above. That is, in their 
study, disconfirmation of expectation was ° 
seemingly greater strength (being linked to cor- 
rectness, rewards, and fines) and therefore greater 


unpleasantness than in our study. As such, the. 


unpleasantness would have been of sufficient 
Magnitude to be a just noticeable difference 
against the already unpleasant background of the 
bitter solution. Although the preceding is specu- 
lation, it is consistent with the findings of both 
studies as well as with the restricting conditions 
laid down by Carlsmith and Aronson in discussing 
the generality of their results, 

Based on our results, we suggest that the 
direction of motivation, involving what the indi- 
vidual desires, wants, or finds pleasant, in con- 


| 
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$ . 
junction with the strength of the disconfirmed 
expectation are two factors requisite to under- 
= standing the affective consequences we have 
, dealt with. There does not appear to be any 
a very simple relationship between disconfirma- 
tion of expectation and negative affect as meas- 
~" wed by evaluative changes in judging events 
, or situations, As our study has suggested, the 
, disconfirmation of an expectation would have to 
jnvolve more than disconfirmation per se to 
produce a noticeable change in judgment in an 
jnitially undesirable or unpleasant context: for 
example, it would have to be an ego-involving 
disconfirmation. In many respects, we might 
conclude that under certain conditions (e.g., the 
ı „absence of a high degree of self-involvement over 
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being correct or incorrect and the absence of re- 
wards and fines), bitter is bitter is bitter... . 
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ization effect was found. 


McGinnies and Sherman (1952) performed an 
experiment which appeared to demonstrate the 
Stneralization of the perceptual defense effect. 

Their experiment entailed the initial presenta- 
tion of taboo and neutral words for periods of 2 
Seconds. Each of these “pretask” words was 
followed by the rapid exposure of a “task” word, 
always of neutral connotation, which the subject 
attempted to recognize. They found that the 
Subjects took longer to recognize those neutral 
Words that followed taboo words than those that 

‘cllowed neutral words. Thus, perceptual defense 


mee experimental work on which this paper is 


ased was carried out while all three authors were, 


at Bedford College, London. 


3 experiments were performed. In Exp. 
McGinnies and Sherman (1952) and found, as they did, higher recognition 
thresholds for neutral words following taboo words than for neutral words 
following neutral words. However, in a further experiment (Exp. II) of the 
same design nonsense words were substituted for taboo | words as pretask 
material and very similar results obtained. We suggest that in both experiments 
an interruption of set is produced by the occurrence of taboo (or nonsense) 
words and that it is unnecessary to hypothesize the generalization of a defensive 
process, Further support for this position is derived from Exp. III in which 
3 practice trials with taboo material were given and no evidence for a general- 


I we repeated an experiment by 


was said to have “generalized,” that is, the 
avoidance reaction triggered off by a taboo word 
persisted long enough to interfere with the rec- 
ognition of the neutral word which immediately 
followed. 

Although criticism of this explanation appeared 
soon after (Postman, Bronson, & Gropper, 1953) 
on the grounds that it was premature to hypoth- 
esize the generalization of an effect before that 
effect had been definitely demonstrated, no alter- 
native explanation was offered, and further re- 
search appeared to lend support to McGinnies 
and Sherman’s position. Walters, Banks, and 
Ryder (1959) obtained similar results when the 
pretask words were presented three times in 
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succession and subliminally. Although they re- 
quest replication of their experiment in view of 
an uncertainty in their timing procedure, this 
deficiency does not seem likely to be a potent 
source of error, and their results can be taken to 
support the findings of McGinnies and Sherman. 

In recent years the whole concept of perceptual 
defense has been seriously challenged by those 
investigators who claim that differential threshold 
effects should be attributed not to a directly 
perceptual process but to the response bias of the 
observer (e.g., Ericksen & Browne, 1956; Goldia- 
mond, 1958). In experiments on the generaliza- 
tion of perceptual defense the familiarity and 
emotional tone of the task words are rigidly con- 
trolled, only the pretask words being varied. Thus, 
the data cannot be reinterpreted in terms of re- 
sponse suppression or response probability, and it 
would seem that the original conception of per- 
ceptual defense as an avoidance response, au- 
tonomic in nature and capable of persistence 
over time, must be retained. 

It was the purpose of the present series of 
experiments to test a third possible explanation: 
namely, that the appearance of taboo words in 
a series of neutral words led to an interruption 
of “set” and that this interruption in turn was 
responsible for the difficulty experienced in rec- 
ognizing the subsequent task words. We consid- 
ered the interruption of set brought about by the 
introduction of dissimilar material to be crucial 
to the effect, rather than the emotional nature of 
that material. Thus, we supposed that the same 
effect would be produced by the introduction of 
dissimilar material which was not taboo in nature. 

First, an attempt was made to replicate Mc- 
Ginnies and Sherman’s results in another labora- 
tory. 


EXPERIMENT I 
Method 


Subjects, Twenty undergraduate women volun- 
teered for Experiment I. Although some were 
first-year students of psychology, none was aware of 
the purpose of the experiment. 

Apparatus. The words used by McGinnies and 
Sherman served as material. Twenty-two words in 
all were used, 18 neutral and 4 taboo words: 
neuttal—BRAND, CABLE, FLUSH, FROCK, GLIDE, GRIND, 
HOUND, LEGAL, LUCKY, OUTER, PHONE, QUEST, RANCH, 
RIDER, SCENT, SPRAY, TOWEL, WEAVE; taboo—srrcH, 
PENIS, RAPED, WHORE. j 

The neutral words had been equated by McGinnies 
and Sherman for frequency of usage on the Thorn- 
dike-Lorge (1944) word count (between 18 and 28 
per million). 

The words were printed singly on slides and 
projected onto a screen, where they appeared in 
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block capitals 2 inches high. The subject sat at a 
distance of 5 feet from the screen which was illum- 
inated at all times at an intensity of 1 footcandle = 
when the projectors were off and 3 footcandles when 
a pretask word was being shown. 

Two Liesegang projectors were used, each fitted 
with a timing device which enabled the experimenter 
to present the pretask word via one projector for 
a period of 2 seconds; immediately this shutter 
closed, a shutter on the other projector was simul- 
taneously opened to expose the task word for .75 
second. The brightness of the latter projector could 
be altered by means of a Variac. 

Procedure. Three pairs of words were chosen at 
random for each subject from the neutral series and 
used to determine the subject’s approximate recog- 
nition threshold. The shutter speed was held constant 
at .75 second and the brightness adjusted to a point ` 
at which the subject was just able to identify the 
words. 

Four of the remaining 12 neutral words were 
chosen at random for each subject and with the 4 
taboo words made up the pretask series. The re- 
maining 8 neutral words served as task words and 
were paired with the pretask words at random. The 
order of presentation of the 8 pairs was again 
determined randomly for each subject. 

For the experiment proper the task words were 
initially presented at a Variac setting 8 volts lower 
than the threshold value, an increase of 2 volts 
occurring on each trial until the words were cor- 
rectly recognized. 

Certain differences between our procedure and 
that of McGinnies and Sherman should be noted. 
During a series of pilot trials in which the appa- 
ratus was tested, it became apparent that an after- 
image of the pretask word frequently interfered with 
the attempt to recognize the task word. In order to 
overcome this the task word was projected 2 inches 
above the position where the pretask word had 
occurred. A fixation point was provided here. 

Second, thresholds were estimated by alterations 
in voltage, and therefore indeterminately in bright- 
ness, and not in exposure time. Both these changes 
from McGinnies and Sherman’s procedure were neces- 
sitated by the unavailability of a mirror tachisto- 
scope. 

A third change was introduced after observations | 
of the pilot subjects had convinced us that they were 
not bothering to read the pretask words at all but ` 
were concentrating solely on the task words. These 
subjects were frequently unable to report the pre 
task word at the end of a trial. We attempted t0 
overcome this difficulty by first presenting the pre- 
task word alone and asking the subject to read it 
silently to herself. It was then presented again 2 
seconds later while the subject watched the fixation 
point where the task word would appear. Tbis 
double presentation of the pretask word occurred 0n 
each trial of the experiment. 


Instructions to the subject were as follows: 


` 
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When I say “Ready” a word will appear in the 
center of the screen here for a long enough period 
for you to recognize it easily. I want you to read 
it silently to yourself; there is no need to read it 
aloud. It will then disappear and soon afterwards 
reappear again on the signal “Ready.” This time, 
as soon as it disappears, another word will flash 
on here (indicating the fixation point) and remain 
only for a very brief second. I want you to try 
and identify this second word and tell me what 
you see. Is that clear? Have you any questions? 
We will begin with one or two practice trials to 
show you what happens. 


. 


Results 
Table 1 gives the mean number of exposures 


, required to report correctly the task words. It 


ĉan be seen that more trials were required to 
report words that followed taboo pretask words. 
A t test of the difference gives a £ value of 2.99, 
p< .01. This statistical procedure is similar to 
that employed by McGinnies and Sherman, who 
also used 20 subjects, and in spite of the differ- 
ences between our procedure and theirs the main 
finding is the same. Thus, the effect appears to 
be a robust one, not unduly dependent on pro- 
cedural niceties. 

Having demonstrated the phenomenon with 
our technique, we proceeded in a second experi- 
ment to explore the effects produced by the use 
of nonsense instead of taboo pretask material. 


EXPERIMENT II 
Method 


Subjects, Eighteen women drawn from the same 
population served as subjects. 

Apparatus. The material used was similar to that 
of Experiment I except that the four taboo words 
were replaced by four 5-letter nonsense words. These 
were obtained by choosing at random eight nonsense 
syllables with an associative value of less than 15% 
from the Trapp and Kausler (1959) lists. The non- 
sense syllables were then combined in pairs by 
dropping the first letter of one syllable and adding 
the remainder as a suffix to another. By this pro- 
cedure the following four words were obtained: 
BIWUJ, MEFIV, WOJUB, ZAHIV. 

Procedure. Approximate thresholds were deter- 
mined in a manner exactly similar to that of Ex- 
periment I. In the presentation proper the pretask 
items consisted of four neutral words and four non- 
sense words, the task words all being neutral. The 
pairing was done at random as before. Instructions 
were identical. 


Results and Discussion 


Table 1 shows the mean number of trials re- 
Soe to identify the task words in Experiment 
a More trials were needed to recognize words 

t followed nonsense words than words follow- 
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TABLE 1 
Exposures REQUIRED TO REcoGNize Task Worps 
IN THE THREE EXPERIMENTS 
i Following 
Following 
fides taboo 
pretask word ee SEa | ¢ 
M 
Experiment I 19.55 23.55 1.34 | 2.99%* 
Experiment IL 18.83 21.39 0.95 | 2.68* 
Experiment I| 19.4 19.3 0.78 | 0.19 
* p <.05. 
** p <.01 


ing neutral words. A ¢ test of the difference gives 
a t value of 2.68 and a p < .05. 

The results of this experiment are very similar 
to those of the previous experiment in which 
taboo material was used. Yet in the present case 
the difficulty in recognition was consequent on 
the pretrial exposure of a nonsense word. It does 
not seem feasible to ascribe this raised threshold 
to the generalization of an avoidance response. 
On the other hand, it is not at all clear that it 
can be ascribed to an interruption of the ob- 
server’s set as intimated earlier. If a set to see 
neutral words was disrupted by the occurrence 
of a nonneutral word, whether of the taboo or 
nonsense variety, it might be expected that the 
first example of the contraset material would 
have a greater disrupting effect than later occur- 
rences (Bitterman & Kniffin, 1953; Postman & 
Leytham, 1951). Accordingly we examined the 
number of exposures required to recognize each 
of the critical words (ie., those following a non- 
sense pretask word) in their order of presenta- 
tion, but found no significant differences and no 
evidence of a trend. It might be argued that as 
only 4 of the 22 words were contraset the subject 
had not time to become accustomed to them, but 
it cannot be confidently maintained that an ex- 
planation in such terms is entirely satisfactory. 
A more direct attack on the set hypothesis was 
provided by the third experiment in which the 
subjects were warned beforehand and given prac- 
tice with the contraset material. 


EXPERIMENT IIT 
Method 


Subjects. Twenty subjects were employed; they 
were undergraduate women taken from the same 
population. 

Apparatus. The same material used in Experi- 
ment I was utilized. Six additional words were used 
in the practice trials when the subject’s approximate 
threshold was determined. Three of these words 
were neutral—BASIN, DWARF, LEMON; three were 
taboo—BELLY, PUBIC, SEMEN. The neutral words all 
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occur in the Thorndike-Lorge word count between 
25 and 27 per million, while of the taboo words 
belly only is included, at 10 per million. 

Procedure. In two respects the procedure differed 
from that of the earlier experiments. The practice 
trials now consisted of six pairs of words. Three 
of the pretask words in these trials were neutral 
and three taboo, while in all cases the task words 
were neutral. Each subject thus had experience of 
taboo words as pretask material. 

Second, additional instructions were given to the 
subject after the practice trials. She was told to 
expect during the remainder of the experiment words 
of the same kind as those she had just seen. 


Results 


From Table 1 it can be seen that words follow- 
ing taboo pretask words were not appreciably 
more difficult to recognize than those following 
neutral pretask words; a ¢ test of the difference 
gives an insignificant value of .19. Repetition of 
Experiment I with the addition of three taboo 
words in the practice trials and with additional 
instructions has produced no evidence for the 
generalization of defense. 

It can also be seen from Table 1 that the 
thresholds following the exposure of neutral pre- 
task words are very similar in the three experi- 
ments. This finding indicates that the samples 
of subjects did not differ in their perceptual 
ability and that our assumption of their 
comparability is justified. 


Discussion 


There appear to be two possible explanations 
of the generalization effect reported by McGin- 
nies and Sherman and confirmed here in Experi- 
ment I: either it was due to the persistence 
(generalization in time) of an autonomic avoid- 
ance reaction to threatening stimuli, or it repre- 
sented the disruption of a set to perceive neutral 
rather than taboo material. In Experiment II 
we demonstrated that the phenomenon still 
occurs, and with equal magnitude, in a situation 
where the explanation in terms of an avoidance 
reaction is inappropriate and the alternative 
explanation applies, Finally, Experiment III 
showed that under conditions to which the for- 
mer explanation would be appropriate and an 
explanation in terms of set inappropriate, the 
phenomenon does not occur. Thus, our data 
indicate that an adequate explanation of the 
generalization effect can be made in terms of set. 
The findings of Walters et al. (1959) present no 
additional difficulties for this explanation since 
there‘ is no reason to suppose, and indeed good 
reason to deny, that a set must be consciously 
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established in order to be effective (Luchins, 
1942). 

The exact nature of the set which is disrupted” 
remains unclear. It may be, as Freeman (1954) . 
and others have claimed, that in the experimental ` 
situation naive subjects are generally set to per- | 
ceive neutral rather than taboo words. However, 
the sets commonly demonstrated in the labora- 
tory are highly specific and there are difficulties 
in assuming that a class of stimuli as broad as 
that comprising neutral words could be thus 
benefited. Alternatively it might be argued that ~ 
subjects in this situation are set to perceive a 
task word which is similar in kind to the pre- 
task word: thus, if the pretask word is neutral, — 
or meaningful, subjects expect a neutral, or ° 
meaningful, task word. Such an effect would 
perhaps be analogous to the finding of Cofer 
and Shepp (1957) that visual recognition of a 
stimulus word is facilitated by prior, supra- 
threshold, presentation of a synonymous word. 
A set of this type may be either a relatively 
stable characteristic of the subject’s expectation — 
in an appropriate situation, or, more simply, it 
may have been induced by the uniform pairing 
of the practice words. The results of Experiment 
III do not invalidate either of these possibilities 
since the practice trials included taboo pretask 
words paired with neutral task words. Thus, the 
set to perceive a task word similar in nature to 
its pretask companion would not arise, or if such 
a set were held to be already present it would 
immediately be invalidated. Any decisive state- 
ment of the nature of the set involved must 
therefore await further evidence. 

It is possible that the effect observed in 
Experiment II, although identical with that ob- 
served in Experiment I, has a different cause: 
the occurrence of raised thresholds following 
nonsense words might represent the disruption 
of set, while the raised thresholds following 
taboo words might represent the operation of 
perceptual defense. Such a viewpoint is tenable 
but lacks the advantage of parsimony. It can 
also be argued that perceptual defense cannot 
be expected to occur in Experiment III, as when 
taboo words are made socially acceptable through 
their presentation in practice trials with encour- 
agement to report them they become no longer 
taboo and a defensive reaction becomes no longer 
necessary. 

Thus, we do not claim from the present series 
of experiments to have demonstrated beyond 
controversy that the apparent generalization of 
perceptual defense is a phenomenon of set. But 
it is now apparent that the evidence supplied by 
McGinnies and Sherman is capable of explana- 


tion in other terms and therefore does not, 
through lack of alternatives, enforce acceptance 
ef the hypothesis of a conditioned avoidance 
t Jesponse which through its persistence in time 
_, renders more difficult the subsequent recognition 
t of neutral material. 
oo 
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work relevant to his interests. Accordingly, the Project is sponsoring a series of changes during į 
the next 12-15 months which are designed to enhance scientific communication. These changes — 
are planned as limited trials. Moreover, they are wholly supported by the Project and hence do 
not cut into our page allotment. 

The Editor has agreed to make possible a test of one of the changes, that is, the titles and ` 
authors of accepted papers will be listed following the last articles in the journal. Such listing will | 
make it possible for our readers to become aware of research many months in the advance of its , 
journal publication. At the end of the year’s trial the results of the trial will be evaluated and 
further consideration will be given at that time as to whether the service is worth continuing. 

This listing plus those published in the preceding 1965 issues are the entire backlog of manu- ` 
scripts accepted by this journal through January 1965. 
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s ' A cross-cultural experiment is reported in which the structures of affective mean- 
j ing systems were tested for 24 perceptual signs judged on 10 semantic differ- 


ential scales by 3 groups of Ss with contrasting linguistic and cultural back- 
i grounds. The results of a pancultural factor analysis indicated that 4 semantic 


language/culture groups, 


$ It was also noted that the structure o0 


factors are most salient for these perceptual signs and similarly so across the 3 
namely, dynamism, evaluation, warmth, and weight. 
f meaning spaces for perceptual signs 


differs somewhat from the structure of those for linguistic signs. Scale intercor- 
p relation matrices for the perceptual signs indicated that scale relations are highly 


stable across them within each S group. 


Scale intercorrelation matrices for se- 


lected individual Ss drawn from each of the 3 culture groups also demonstrated 


Using the factor-analytic method, Osgood, 
Suci, and Tannenbaum (1957) were able to 
demonstrate that three factors, which they 
"call evaluation, potency, and activity, are the 
Most salient factors in the meaningful judg- 
ments of humans, In more recent cross-cul- 
tural research, Osgood, Archer, and Miron 
, (1963) have been able to show that people in 
Over 10 different language/culture communi- 
_ ties use quite similar semantic factors. Again, 
the three most salient factors identifiable are 
evaluation, potency, and activity, regardless 
of differences in both language and culture. 
_ Tanaka, Oyama, and Osgood (1963), on the 
other hand, present evidence of what they 
| term j “concept/scale interaction,” in that 
, affective meaning systems are somewhat 


Pans study was planned and carried out as a part 
of Me research project on “Cross-Cultural Generality 
ogy oe Systems” supported by the Human Ecol- 
if) Fund, New York City, for which C. E. Osgood 
aPtinepal investigator. The authors wish to ac- 
 Maedae the contributions to this study made by 
ei S. Miron, Center for Comparative Psycho- 
University Institute of Communications Research, 
» ment ae of Illinois, and Edward E. Ware, Depart- 
© their gy, Psychology, University of Rochester, for 
PP Talay eons on the statistical design; Martti 
ta in aR University of Jyväskylä for collecting 
* for their and; and William May and Sharon Wolfe 
Preciation assistance in computer programing. Ap- 
» nois Statisti also expressed to the University of Illi- 
_ Laborato ical Service Unit and Digital Computer 
i ry for the use of their facilities, without 


which a 
imposi ™mPletion of the study would have been 


h that scale relations are reasonably consistent across Ss, but the between-S con- 
sistency was even higher within language/culture boundaries than across them. 


changed as concept classes are changed. Using 
three different classes of concepts—that is, 
colors, line forms, and abstract words—they 
found that several scales lawfully change their 
affective meaning from concept class to con- 
cept class, causing a change in the structure 
of various semantic spaces. They were also 
able to show that such concept/scale inter- 
action is quite consistent across two subject 
groups differing in both language and culture 
— American and Japanese. In still another 
cross-cultural study, Tanaka (1961) found 
remarkable cross-subject stability of affective 
meaning systems among both American and 
Japanese subjects. Cross-subject stability, 
however, was found to be slightly higher 
within each of the two language/culture 
groups than in a cross-cultural comparison. 
The present research is an extension of 
these previous studies on affective meaning 
systems. In the present investigation, how- 
ever, a somewhat different set of stimulus 
materials is used as concepts to be judged— 
that is, colors, line forms, and the color/form 
combinations, which we generally term per- 
ceptual signs in contrast to linguistic signs. 
Using these perceptual signs, the generality of 
affective meaning systems will be tested across 
three subject groups having a different 
linguistic and cultural base—Americans, 
Finns, and Japanese. i 
Our first interest in the present work lies in 
determining if the same three, most salient 
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semantic factors—that is, evaluation, potency, 
and activity, as have been found in previous 
studies—also appear in the meaningful judg- 
ments of a set of perceptual signs consisting 
of colors, line forms, and their combinations; 
and if these factors appear consistently across 
the three subject groups. Second, we are 
interested in ascertaining if similar affective 
meaning systems are operating in the judging 
of different perceptual signs: that is, would 
the semantic space constructed for the color 
red, for example, be similar to that of the line 
form star or the color/form combination, red 
star? Third, we shall investigate whether indi- 
vidual subjects in different language/culture 
groups use a similar semantic space. If the 
affective meaning systems of humans operate 
lawfully despite the subjects’ different lan- 
guage/culture origin, this would be evidence 
that each subject in each subject group uses 
a semantic space similar to every other sub- 
ject in every subject group. Thus, in sum- 
mary, the purpose of this study is to investi- 
gate the cross-culture, cross-concept, and 
cross-subject generality of affective meaning 
systems among Americans, Finns, and Japa- 
nese in their judging of colors, line forms, 
and the color/form combinations, 


METHOD 
Subjects 


The experiment was run on three different groups 
of subjects—American, Finnish, and Japanese—each 
representing a different language/culture base. The 
American subjects were 47 undergraduate and grad- 
uate students, both males and females, attending the 
University of Illinois, The Finnish subjects were 50 
male and female undergraduates attending the Uni- 
versity of Jyväskylä in Jyväskylä. The Japanese 
subjects were 53 male and female undergraduates at 
the Kuwazawa College of Arts and Design in Tokyo. 
To avoid possible bias resulting from excessive ex- 
posure to, foreign culture, it was desired that the 
subjects be monolinguals, at least with respect to 
English, and have no experience with foreign lan- 
guage other than ordinary training in school. Those 
who were born and brought up in a foreign country 
were therefore not included in the sample. Each 
experimental group is assumed to be reasonably 
representative of each of the three sample cultures, 
apart from the relatively high education level which 
was common to all groups. 


Stimulus Materials 


The perceptual signs used as concepts in the 
present work consisted of 4 colors, 4 line forms, and 
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the “component” color and form concepts--as" 
dictated by consideration of codability, an attempt 
being made to sample the range of high-codable ani 
low-codable, color and form alternatives. The notion , 
of codability, which was originally developed from, 
the work of Brown and Lenneberg (1954) on colors | 
for English speakers, was extended in the present + 
work to line forms and to different languages (ie, a 
Finnish and Japanese) as well. The component per- _ 
ceptual signs selected for the experiment included » 
two colors with high codability (red and green) and 
two colors with low codability (yellow green and — 
red purple), two high-codable forms (star and ui 
circle), and two low-codable forms which are non- 
sense figures generated by the Attneave and Ar, 
noult (1956) method. The resultant color icin 7 
combinations can be classified into the following’. 
four categories in terms of component codabilty: 4 
high-codable color combined with high-codable form, 
high-codable color combined with low-codable form, 1 
low-codable color combined with high-codable form, 
and low-codable color combined with low-codable 
form. 

For the experiment, the color materials were ob. 
tained from the All-Color Company, Limited, i 
New York. The colors were represented by four 
color sheets, approximately 14 X 20 centimeters 1M. 
size. The forms were drawn in black ink on thick 
white construction paper, approximately 20 X 22) 
centimeters in size, For the combinations color and 
form components were combined according to 4 1, 
Latin square design: the appropriate color sheet 
were cut into the appropriate forms, the size of each 
form being identical to the original component. For 
the individual colors and the color/form combina- 
tions, a neutral grey background paper (approx 
mately 6.0 in Munsell value) was used. For protec 
tion, all the stimulus items were covered with a a 
transparent polyethylene sheet. Finally, Munsell © 
notations of color materials are 5.0 G 5/8 for gree 
5.0 R 4/14 for red, 5.0 YG 8/2 for yellow gree | 
and 5.0 RP 6/2 for red purple. The two monsenst M 
figures generated by the Attneave-Arnoult method”) 
are shown in Figure 1. 


Measuring Instrument : 
In order to index the affective meaning of wa) 
ceptual signs, a 10-scale semantic differential } in 
constructed. Three scales representing each of | 
three most salient affective factors, evaluatio™ 


2 The same data obtained from the American, ry f 
nish, and Japanese groups and used for the analys” 
reported in the present study were also used, ba } 
separate analysis, for testing the congruity hypothe gi 
developed by Osgood et al. (1957). In the oA 
gruity tests, the meaning of a combination con’? 
was predicted from the meaning of a color an u 
form component concept. The sampling of stimu of 
concepts was therefore dictated by consideration k l 
the “congruity” design. 


ai 


cy, and activity, were included, along with 


Me familiar-unfamiliar scale. The nine factorial 
scales were selected from materials being obtained 
and used in a larger project (cf. Osgood et al., 1963). 
This Project is being carried out in 16 Janguage/ 
Haute communities, which includes the United 
States, Finland, and Japan. In the larger study, 
qualifiers were elicited within each community as 
‘Tesponse to 100 common, translation-equivalent 
4 bstantives and transformed into scales by the 
tation of qualifier-opposites. 

~ For present purposes, three typical scales for each 
‘Of the three factors were selected on the basis of the 
1 ee ic results obtained in the larger project 
: i American, Finnish, and Japanese groups. 
% pring desiderata were established for scale 
ion: the scale should display reasonably high 

Ris a factor loadings on the relevant factors; and 
vee poe be available as translation equivalents 
. ip three language/culture groups. For exam- 
yá ‘UTIFUL-UcLY for English, KAUNIS-RUMA for 
eo KIREINA-KITANAT for the Japanese are 
Bec equivalent and load highly on the evalua- 
ict th in all the three groups. In other words, 
e scales is assumed to be approximately 
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Fic. 1. Examples of the two nonsense figures used in the experiment. 


similar across the three groups, not only in terms of 
lexical meaning but also with regard to affective 
meaning, The 10 scales, with FAMILIAR-UNFAMILIAR 
included, were then randomized once as to order 
and direction, as displayed in Table 1. The same 
order and direction were used in all language/ 
culture groups. Booklets of 12 pages were assembled, 
each page consisting of two sets of the 10 scales in 
the language appropriate to each group to be 
studied, thus providing semantic differentials for 24 
concepts—8 component and 16 combination con- 


cepts. 


Procedure 


The experiments were performed in Urbana, Mi- 
nois, in Jyväskylä, Finland, and in Tokyo, Japan. 
Instructions for the experiment in the subjects’ own 
language were both attached to the semantic- 
differential booklets and read to the subjects in the 
American and Japanese groups; in Finland, the sub- 
jects were given oral instructions only. The English 
instructions paralleled those suggested by Osgood et 
al. (1957, pp. 82 ff.) and these were freely translated 


into Finnish and Japanese. 


TABLE 1 


FINNISH, AND JAPANESE SCALES 


American 


Finnish 


Japanese 


UGLY- 
BEAUTIFUL RUMA-KAUNIS 


KITANAI-KIREINA 


_ STRONG-WEAK 

_ SLOW-FAST 
PLEASANT-UNPLEASANT 
LIGHT-HEAVY 
HOT-COLD 
SAD-HAPPY 
~ HARD-SOFT 
T PAL EXCITABLE 
MILIAR-UNFAMILIAR 


VAHVA-HEIKKO 

HIDAS-NOPEA Ass 
MIELLYTTAVA-EPAMIELLYTIAVA 
KEVYT-RASKAS 

KUUMA-KYLMA 
MURHEELLINEN-ONNELLINEN 
KOVA-PEHMEA 
RAUBALLINEN-KIIHTYNYT 
UTTU-TUNTEMATON 


TSUYOI-YOWAI 
OSOI-HAYAI 
KIMOCHIYOI-KIMOCHIWARUI 
KARUI-OMOI 
ATSUI-TSUMETAI 
KANASHII-URESHIL 
GANJONA-MOROI 
ODAYAKANA-HAGESHIL 
NAJIMIARU-NAJIMINAT 
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semantic factors—that is, evaluation, potency, 
and activity, as have been found in previous 
studies—also appear in the meaningful judg- 
ments of a set of perceptual signs consisting 
of colors, line forms, and their combinations; 
and if these factors appear consistently across 
the three subject groups. Second, we are 
interested in ascertaining if similar affective 
meaning systems are operating in the judging 
of different perceptual signs: that is, would 
the semantic space constructed for the color 
red, for example, be similar to that of the line 
form star or the color/form combination, red 
star? Third, we shall investigate whether indi- 
vidual subjects in different language/culture 
groups use a similar semantic space. If the 
affective meaning systems of humans operate 
lawfully despite the subjects’ different lan- 
guage/culture origin, this would be evidence 
that each subject in each subject group uses 
a semantic space similar to every other sub- 
ject in every subject group. Thus, in sum- 
mary, the purpose of this study is to investi- 
gate the cross-culture, cross-concept, and 
cross-subject generality of affective meaning 
systems among Americans, Finns, and Japa- 
nese in their judging of colors, line forms, 
and the color/form combinations. 


METHOD 
Subjects 


The experiment was run on three different groups 
of subjects—American, Finnish, and Japanese—each 
representing a different language/culture base, The 
American subjects were 47 undergraduate and grad- 
uate students, both males and females, attending the 
University of Illinois. The Finnish subjects were 50 
male and female undergraduates attending the Uni- 
versity of Jyväskylä in Jyväskylä. The Japanese 
subjects were 53 male and female undergraduates at 
the Kuwazawa College of Arts and Design in Tokyo. 
To avoid possible bias resulting from excessive ex- 
posure to foreign culture, it was desired that the 
subjects be monolinguals, at least with respect to 
English, and have no experience with foreign lan- 
guage other than ordinary training in school. Those 
who were born and brought up in a foreign country 
were therefore not included in the sample. Each 
experimental group is assumed to be reasonably 
representative of each of the three sample cultures, 


apart from the relatively high education level which 
was common to all groups. 


Stimulus Materials 


The perceptual signs used as concepts in the 
present work consisted of 4 colors, 4 line forms, and 
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the 16 color/form combinations, resulting from com- 
bining each color with each form.? The selection of)’ 
the “component” color and form concepts,.7ms! 
dictated by consideration of codability, an attemp | 
being made to sample the range of high-codable and 
low-codable, color and form alternatives. The notion’ 
of codability, which was originally developed from, 
the work of Brown and Lenneberg (1954) on colon 
for English speakers, was extended in the present 
work to line forms and to different languages (ie, 
Finnish and Japanese) as well. The component per-), 
ceptual signs selected for the experiment included 
two colors with high codability (red and green) and 
two colors with low codability (yellow green and 
red purple), two high-codable forms (star and 
circle), and two low-codable forms which are non- 
sense figures generated by the Attneave and Ar 
noult (1956) method. The resultant color/form 
combinations can be classified into the following 
four categories in terms of component codability: 
high-codable color combined with high-codable form, f 
high-codable color combined with low-codable form, 
low-codable color combined with high-codable form, 
and low-codable color combined with low-codabl 
form. l 

For the experiment, the color materials were ob 
tained from the All-Color Company, Limited, iN 
New York. The colors were represented by fou 
color sheets, approximately 14 X 20 centimeters i 
size. The forms were drawn in black ink on thick 
white construction paper, approximately 20x tt 
centimeters in size. For the combinations color anl 
form components were combined according t0 4 
Latin square design: the appropriate color sheet 
were cut into the appropriate forms, the size of ad 
form being identical to the original component. Foti 
the individual colors and the color/form combine) 
tions, a neutral grey background paper (appros 
mately 6.0 in Munsell value) was used. For prote 
tion, all the stimulus items were covered with at 
transparent polyethylene sheet. Finally, 
notations of color materials are 5.0 G 5/8 for gt 
5.0 R 4/14 for red, 5.0 YG 8/2 for yellow green 
and 5.0 RP 6/2 for red purple. The two nonsens 
figures generated by the Attneave-Arnoult metho 
are shown in Figure 1. 


Measuring Instrument 


In order to index the affective meaning © He 
ceptual signs, a 10-scale semantic differential tt 
constructed. Three scales representing each of ion 
three most salient affective factors, evaluatiO® 


ey) 
2 The same data obtained from the American, a 
nish, and Japanese groups and used for the Bee a 
reported in the present study were also used: je 
separate analysis, for testing the congruity hyp! ot 
developed by Osgood et al. (1957). In the ~d 
gruity tests, the meaning of a combination con! 
was predicted from the meaning of a color am 
form component concept. The sampling of sti” 
concepts was therefore dictated by consideration © 
the “congruity” design. 


= 
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Fic, 1. Examples of the two nonsense figures used in the experiment. 


„potency, and activity, were included, along with 
the familiar-unfamiliar scale. The nine factorial 
_ Seales were selected from materials being obtained 
‘and used ina larger project (cf. Osgood et al., 1963). 
This project is being carried out in 16 language/ 
kupie communities, which includes the United 
tates, Finland, and Japan. In the larger study, 
ees were elicited within each community as 
e to 100 common, translation-equivalent 
lbstantives and transformed into scales by the 

_ elicitation of qualifier-opposites. 
a eet purposes, three typical scales for each 
z E X factors were selected on the basis of the 
(Gi ave ytic results obtained in the larger project 
» The f a merican, Finnish, and Japanese groups. 
eae desiderata were established for scale 
ne ay pes scale should display reasonably high 
Mischa ae loadings on the relevant factors; and 
aan he i be available as translation equivalents 
Rie ree language/culture groups. For exam- 
piia TIFUL-UGLY for English, KAUNIS-RUMA for 
a EEA KIREINA-KITANAT for the Japanese are 
five factas equivalent and load highly on the evalua- 
Er ik in all the three groups. In other words, 
the scales is assumed to be approximately 


similar across the three groups, not only in terms of 
lexical meaning but also with regard to affective 
meaning. The 10 scales, with FAMILIAR-UNFAMILIAR 
included, were then randomized once as to order 
and direction, as displayed in Table 1, The same 
order and direction were used in all language/ 
culture groups. Booklets of 12 pages were assembled, 
each page consisting of two sets of the 10 scales in 
the language appropriate to each group to be 
studied, thus providing semantic differentials for 24 
concepts—8 component and 16 combination con- 


cepts. 


Procedure 


The experiments were performed in Urbana, Tli- 
nois, in Jyväskylä, Finland, and in Tokyo, Japan. 
Instructions for the experiment in the subjects’ own 
language were both attached to the semantic- 
differential booklets and read to the subjects in the 
American and Japanese groups; in Finland, the sub- 
jects were given oral instructions only. The English 
instructions paralleled those suggested by Osgood et 
al. (1957, pp. 82 ff.) and these were freely translated 
into Finnish and Japanese. 


e TABLE 1 
$ TRANSLATION-EQUIVALENT AMERICAN, FINNISH, AND JAPANESE SCALES 
Saeed USED IN THE EXPERIMENT 
American Finnish Japanese 


UGLY-BEAUTIFUL 

STRONG-WEAK 

SLOW-FAsST 

P 

Te ASANT-UNPLEASANT 
IGHT-HEAVY 

HOT-coLp 

SAD-HAPPY 

HARD-SOFT 

SAUA EXCITABLE 

TLIAR-UNFAMILIAR 


RUMA-KAUNIS 
VAHVA-HEIKKO 
HIDAS-NOPEA 


KEVYT-RASKAS 
KUUMA-KYLMA 


KOVA-PEHMEA 


MIELLYTTAVA-EPAMIELLYTIA) 


MURHEELLINEN-ONNELLINEN 


RAUHALLINEN-KIHTYNYT 
PUTTU-TUNTEMATON 


KITANAI-KIREINA 
TSUYOI-YOWAI 


. OSOI-HAYAT 
AVA KIMOCHIYOI-KIMOCHIWARUT 
KARUI-OMOI 
ATSUI-TSUMETAI 


KANASHII-URESHII 
GANJONA-MOROI 
ODAYAKANA-HAGESHIL 
NAJIMIARU-NAJIMINAT 
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The 24 perceptual signs were shown to the sub-* were factored, since interest centered 


jects one by one, the duration of exposure for each 
being approximately 1.5 minutes—or until all the 
subjects had completed checking all 10 scales for 
that item. All the testings were performed in 
groups, The subjects in each subject group first 
rated the four colors and then the four forms; these 
were followed by the 16 color/form combinations, 
The order of the 24 perceptual signs was constant 
for all the subject groups. 


RESULTS 
Scale Intercorrelations and Factor Analysis 


Factor analyses in the present work were 
carried out in order to determine the structure 
of the judgmental universe and in order to 
obtain evidence as to whether the same judg- 
mental criteria are used across the three 
language/culture groups, across the 24 con- 
cepts differing in psychophysical attributes 
(i.e., colors, forms, and color/form combina- 
tions, varying in their respective codability 
factors), and across individual subjects se- 
lected from the three language/culture groups. 
Only the intercorrelation matrices for scales 


TABLE 2 


Frest THREE VARIMAX ROTATED PRINCIPAL COMPONENT FACTORS, 
BY LANGUAGE/CULTURE GROUPS 
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ing the extent of stability with wh 
scales are used in various semantic 
Principal component solutions and 
rotations were used throughout. 


Factor Analysis of the Cultural Space 


This analysis basically took the 
three sets of factor analyses and 
intercorrelations of the scale means i 


correlations is based upon an N of 24 € 
for which the basic entry was the mei 
value of each of the three subject g 
Since the present analysis thus deals 
semantic space which is considered for 
of the language/culture groups as a who 
will be called a “cultural” semantic 
be compared with other spaces w 
analyzed in subsequent sections. 
These correlations were then fac’ 
lyzed using the principal components 
The principal components solution i; 


Factor I Factor II Factor III 
American 
PLEASANT-UNPLEASANT 97 HARD-SOFT 95 HOT-COLD 
BEAUTIFUL-UGLY 95 HEAVY-LIGHT 92 
FAMILIAR-UNFAMILIAR 93 STRONG-WEAK 87 
HAPPY-SAD 73 FAST-SLOW 86 
EXCITABLE-CALM 97 
37% 47% 
Finnish 
BEAUTIFUL-UGLY 97 HEAVY-LIGHT 96 
5 EXCITABLE-CALM 

PLEASANT-UNPLEASANT 96 STRONG-WEAK -88 HARD-SOFT 
HAPPY-SAD 94 FAST-SLOW 
FAMILIAR-UNFAMILIAR 84 
HOT-COLD 74 

46% 19% 

Japanese 
FAMILIAR-UNFAMILIAR -96 HOT-COLD 76 EX 
$ CITABLE-CALM 

PLEASANT-UNPLEASANT 94 HAPPY-SAD 76 Shera 
BEAUTIFUL-UGLY -93 HEAVY-LIGHT 72 STRONG-WEAK 
HARD-SOFT -10 

41% 24% 


a Percentage of total variance, 
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TABLE 3 
First Four UNROTATED PRINCIPAL COMPONENT FACTORS OF COMBINED 


AMERICAN, FINNISH, 


AND JAPANESE SCALES 


Factor I (55%)* Factor II (20%). Factor III (12%) Factor IV (7%) 
American 

STRONG-WEAK .95 | FAMILIAR-UNFAMILIAR | .61 | COLD-HOT 51 
HAPPY-SAD .95 | SOFT-HARD .54 | PLEASANT-UNPLEASANT] .45 
PAST-SLOW .94 | PLEASANT-UNPLEASANT] .44 
HEAVY-LIGHT .87 | BEAUTIFUL-UGLY 43 
EXCITABLE-CALM 84 
HOT-COLD 84 £ 

Finnish 
STRONG-WEAK .89 | FAMILIAR-UNFAMILIAR | .81 | COLD-HOT .74 | HEAVY- ht) 
FAST-SLOW ‘85. | PLEASANT-UNPLEASANT| .78 | SAD-HAPPY 50 LIGHT 
EXCITABLE-CALM .66 | SOFT-HARD 15 SLOW- Al 
HAPPY-SAD .64 | BEAUTIFUL-UGLY 12 FAST 
HOT-COLD 63 SOFT- AL 
HEAVY-LIGHT 54 HARD 

Japanese 
STRONG-WEAK (95 | FAMILIAR-UNFAMILIAR | .57 | COLD-HOT .69 | HEAVY- .60 
HARD-SOFT .92 | CALM-EXCITABLE "36 | PLEASANT-UNPLEASANT| .48 LIGHT 
HAPPY-SAD .83 | LIGHT-HEAVY 39 SLOW- 42 
BEAUTIFUL-UGLY 82 FAST 
PLEASANT-UNPLEASANT | .78 
PAST-SLOW 74 


^ Percentage of total variance. 


matically unique, in that it represents a de- 
scription of the maximum variance that can 
be extracted. A total of five factors were 
extracted from each of the language/culture 


` group scale intercorrelation matrices. Three 


of these factors * were then chosen for subse- 
quent, orthogonal rotation using the varimax 
criterion devised by Kaiser (1958). The ro- 
tated factor structures for the three “cultural” 
Spaces are displayed in Table 2. 

In the rotated data, despite some shuffled 
Scales in the separate analyses, the three fac- 
tors are generally identifiable as evaluation, 
Potency, and activity (or dynamism, a com- 


‘ nation of potency and activity), which are 
“he well-established, most salient factors 


t praet be pointed out, however, that all these 
to TA 1, ors in the Japanese group alone were found 
consider rao roots greater than unity and were thus 
Orato as the most salient factors present in the 
only t on data. In the American and Finnish groups, 

Wo factors were found to have latent roots 


greater thay 3 7 e z 
sideratum, n unity and meet with this unique de- 


regularly obtained from analyses of meaning- 
ful verbal stimuli by Osgood et al. (1963, 
1957). Although the choice of scales in the 
present investigation was dictated by the 
expectation that the same basic factors would 
appear in the present situation, and actually 
was based upon the American, Finnish, and 
Japanese factor-analytic results accruing from 
the verbal stimuli used in cross-cultural studies 
(cf. Osgood et al., 1963), the fact that they 
tend to maintain this structure when colors, 
forms, and their combinations are judged, sug- 
gests that a similar, if not identical, system of 
affective meaning can also be extended to the 
new types of stimulus concepts used in the 
present study. 

On the other hand, since the choice of fac- 
tors to be rotated for more meaningful in- 
terpretation is still largely a rule-of-thumb 
procedure, and since the problem is here con- 
founded by the necessity of finding an analytic 
procedure that can defensibly be applied to 
all of the “cultural” spaces simultaneously, a 
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pancultural factorization of all scales was car- 
ried out, following Osgood et al. (1963). 

This analysis is pancultural in that each 
scale is correlated with each other scale over 
the 24 common concepts regardless of the 
scales’ language of origin. A 30 X 30 inter- 
correlation matrix (i.e., 10 scales X 3 groups) 
is thus generated for the scales, putting the 
data from all three language/culture com- 
munities into the same mathematical space. 
The factorization of the scale intercorrelation 
matrix was carried out, again using the prin- 
cipal component solution, and five factors 
were extracted. Four of these factors were 
found to account for 94% of the total vari- 
ance, and to have latent roots greater than 
1.00. They were thus considered to be psycho- 
logically meaningful. These four factors were 
rotated, again using the varimax method. The 
results of this pancultural factorization are 
presented in Table 3 for the unrotated data 
and in Table 4 for the rotated data, with 
translations given for convenience in making 
comparisons, 

The four factors in Table 4 can be readily 
identified as dynamism, evaluation, and what 
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may be called “warmth” and “weight” fac. 
tors. In all subject groups, Factor I is char; | 
acterized by the scales STRONG-WEAK, HARD. | 
SOFT, EXCITABLE-CALM, and FAST-SLOW; Fac. 
tor II, by FAMILIAR-UNFAMILIAR, BEAUTIFUL: ¢ 
UGLY, and PLEASANT-UNPLEASANT; Factor | 
III, by Hot-corp; and Factor IV, by HeAvY- | 
LIGHT. ‘a 
Although these results display some differ- | 
ences from most previous verbal studies (Os 
good, 1962; Osgood et al., 1963; Tanaka et 
al., 1963), presumably because of the re, 
stricted concept classes, they nonetheless dis- 
play remarkable consistency across the three. 
subject groups which differ markedly in both 
language and culture. It is also evident in 
Table 4 that coefficients of these pancultural » 
factors overlap in magnitudes to such an ex- 
tent that each of the language/culture groups j 
contributes to each of the factors with striking | 
similarity. For example, the first pancultural ’ 
factor is composed of American, Finnish, and 
Japanese scales having identical affective 
meanings and nearly identical loadings, despite i 
different origins. Dimensions, in other words, | 
cut across all three languages and cultures and 


TABLE 4 


First Four Variax ROTATED PRINCIPAL COMPONENT Factors or COMBINED 
AMERICAN, FINNISH, AND JAPANESE SCALES 


Factor I (29%)s Factor II (29%) Factor III (21%) Factor IV (14%) 
American 
HARD-SOFT -91 | FAMILIAR-UNFAMILIAR | .93 | HoT-coLD 78 | HEAVY- 65 
EXCITABLE-CALM -80 | PLEASANT-UNPLEASANT]| .93 EXCITABLE-CALM A8 LIGHT | 
FAST-SLOW -78 | BEAUTIFUL-UGLY 91 STRONG- 4 
STRONG-WEAK .71 | HAPPY-SAD -74 WEAK h | 
4 
Finnish 4 
EXCITABLE-CALM -91 | FAMILIAR-UNFAMILIAR | .79 | noT-coLp 93 | HEAVY- 8 \ 
HARD-SOFT -88 | BEAUTIFUL-UGLY -58 | HAPPY-sAD 89 LIGHT | 
FAST-SLOW -70 | PLEASANT-UNPLEASANT| .48 STRONG- oy 
STRONG-WEAK 25 WEAK 
Japanese 
EXCITABLE-CALM -89 | FAMILIAR-UNFAMILIAR | .93 | nor-cop .90 | mEavy- | 
FAST-SLOW -88 | PLEASANT-UNPLEASANT) .87 | HAPPY-SAD .67 LIGHT 
STRONG-WEAK .71 | BEAUTIFUL—-UGLY 83 HARD- Al 
SOFT f 
STRONG- 5 r 
WEAK 1] 
i] 
i 


a Percentage of total variance, 
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are almost equally defined by all three; sepa- 


ate factors for Americans, Finns, and Japa- 
nese are not found, as might have been ex- 
„ pected if there had been great intracultural 
’ consistency but cross-cultural uniqueness 
present in the affective use of scales. 


On the other hand, however, if the factors 


; 


-` fouid in this study are compared with those 
obtained in previous studies (Osgood, 1962; 
Osgood et al., 1963), which were concerned 
with diverse concepts in the verbal universe, 

, one finds noticeable differences. In studies 
employing verbal concepts, evaluation is 

_ ,characteristically the dominant factor. How- 

ever, in this study dynamism and evaluative 
factors account for exactly equal amounts of 
the total variance, thus clearly demonstrating 
a relative dominance of dynamism in this 
color/form space. To the best of our knowl- 
_ edge, no previous factor analysis carried out 
` in the verbal domain has shown such dyna- 
mism dominance. Such shifts in factor domi- 
hance are not entirely novel, however. Ta- 
naka et al, (1963) report relative dominance 
of the evaluative factor in the semantic space 
for abstract words, but dominance of a po- 
tency factor for line forms and an activity 
factor for colors, these shifts holding for both 

American and Japanese subject groups. The 

finding of the present work is thus quite 

Consistent with the previous investigation, in 

that perceptual signs produce a somewhat 

different ordering of factor salience than 
linguistic signs. The present investigation also 
demonstrates similarities across the American, 

ı Finish, and Japanese language/culture 
_ Broups, despite this shift in factor salience. 


Cross-Concept Stability Examined by Scale 
Intercorrelations Jor Single Concepts 


In order to compare the affective meaning 
of various concepts, it is necessary to assume 
»» that these concepts have a similar factor struc- 

x WS and the scales have the same, consistent 
affective meaning. If there were no concept/ 
a Interaction whatsoever, that is, if scales 
fih not change their intercorrelations as a 
ction of the concepts being judged, then 

a correlation across corresponding cells of 
E Intercorrelation matrix for one con- 

e8., red) with that of another (eg., 
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star) would be unity. To the extent that there 
is concept/scale interaction, however, scale 
intercorrelations will vary from concept to 
concept, and hence overall correlations be- 
tween the scale intercorrelation matrices will 
decrease. 

To get at this problem, the scale intercor- 
relation matrix was computed over subjects 
for each of the 24 concepts used in this study 
and for each language/culture group. Each of 
the correlations was obtained based upon an 
N of 47 subjects in the American group, 50 
in the Finnish, and 53 in the Japanese, for 
which the basic entries were the obtained 
scale values of individual subjects for each 
concept. A total of 24 scale intercorrelation 
matrices were derived in each subject group. 
Within each subject group, intercorrelations 
among these 24 “concept” correlation matrices 
were computed across corresponding cells. 
Thus, for each language/culture group, a 
24 X 24 concept intercorrelation matrix was 
generated, each cell representing the correla- 
tion of one concept with another. 

The scale intercorrelation matrices for each 
separate color, form, and combination concept 
were examined and the results compared 
across concepts, using these concept-by-con- 
cept correlation matrices. For example, if the 
scale intercorrelation matrix for a color is 
exactly the same as that of a form, the corre- 
lation between the correlation matrices should 
be unity. Similarity between the factor struc- 
tures of two concepts, or cross-concept sta- 
bility, is thus represented by a correlation in 
the 24 x 24 concept correlation matrix. Every 
set of cross-concept correlations—first be- 
tween a color and a form (reyr), second be- 
tween the compound and the color (Yeompyc), 
and third between the compound and the form 
(Yeomp/t) Should be near unity. 

The results clearly support the foregoing 
proposition for all subject groups. From in- 
spection, it is evident: that the scale correla- 
tions are generally high between component 
concepts and their combinations, and equally 
for all three subject groups; that combination 
concepts are slightly more closely related to 
their components than the components to each 
other; and that there appears to be a slight 
but consistent dominance of color over form 
in determining the meaning of combination 
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concepts, as shown by higher r’s between 
colors and combination concepts. 

If we factor the 24 X 24 matrix for each 
culture, we can test the hypothesis of cross- 
concept stability in scale meaning by deter- 
mining the proportion of the variance ex- 
tracted by the first factor. In the ideal case 
of zero concept/scale interaction, all cells in 
the 24 24 matrix would have values of 1.00 
and Factor I would extract 100% of the vari- 
ance, Although we cannot expect this ideal 
state of affairs to hold for empirical data, we 
can determine the closeness of approximation. 
The three 24 x 24 concept-by-concept corre- 
lation matrices were thus factored, using the 
principal component solution. The factor- 
analytic results, obtained for each of the three 
subject groups, clearly confirm the hypothesis. 


TABLE 5 


First THREE PRINCIPAL COMPONENT FACTORS oF 
COMBINED AMERICAN, FINNISH, AND JAPANESE 
SELECTED SUBJECTS 


Subject Factor I | Factor II | Factor III 
American 
F 85 .06 —.45 
F 94 —.08 = 15 
F .85 13 — 49 
F .93 —.06 =.25 
F .92 —.16 —.01 
M 92 16 —.15 
M 13 — AT —.10 
M 86 03 —.46 
ni ree —.22 =19 
—.16 08 
Finnish 
F -80 —.18 —.03 
F 92 —.12 -20 
F 92 =ô =12 
F 87 =.35 24 
F 85 Gikk] 15 
M 84 —.37 22 
M 85 =,26 23 
M 90 —.01 05 
M 92 —.09 20 
M 82 —.01 31 
Japanese 
E 81 50 17 
F .85 38 13 
F 85 42 .08 
F 82 SI ll 
F 86 21 19 
M 91 —.03 —.03 
M 89 14 09 
M 88 34 el 
M 91 :08 .06 
M 81 30 18 
Percentage of 
variance 76 J 5 


Note, —F = female, M = male, 
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In every group, Factor I accounts for more 
than 90% of the total variance with all the 
concepts in the matrix loading very highly ön 
it, the next factor in order of magnitude ac- 
counting for only 2% of the variance. No ro- 
tation of these factors was necessary because — 
only Factor I, and it alone, was found to 

have a latent root greater than 1.00. It is thus © 
clear that scales’ stability is extremely high 

across the concepts—colors, forms, and their 

combinations, differing in the physical at- | 
tributes and cognitive familiarity—and for 
every language/culture group. 


Cross-Subject Stability Examined by Scale 
Intercorrelations for Each Subject 


An additional examination was made to de- 
termine the stability of scale usage across in- 
dividual subjects. The question here is whether | 
or not individual subjects differ significantly 
in their meaning spaces. Previous to the pres- 
ent study, Tanaka (1961) used an analytic 
technique developed by Tucker and Messick 
(1960) for use with scaling data and general- 
ized for use with semantic differential data by 
Ware,‘ and was able to find remarkable cross- 
subject stability among both American and 
Japanese subjects. In this earlier study, scale 
intercorrelations were computed over 29 na- 
tional stereotypic concepts for each of 32 
American and 36 Japanese subjects, the basic 
entry to the original data being 12 scale values 
for the 29 concepts,’ If there were no subject/ 
scale interaction whatsoever—that is, if scales 
did not change their intercorrelations as 4 
function of the subjects doing the judging— 
then, the correlation of the scale correlation 
matrix of one subject with that of another” 
would be 1.00. To the extent that there iS 
subject/scale interaction, however, scale in- 
tercorrelations will vary from subject to sub- 
ject, and hence correlations between the scale 
intercorrelation matrices of subject will de- 
crease. A total of 68 (32 American and 36 
Japanese) scale intercorrelation matrices were 
thus correlated across corresponding cells, and 
the resultant 68 x 68 subject intercorrelation 


4 Personal communication, 1961. 

5 For the concepts and the scales used, and the 167 
sults of the separate “cultural” space factorization for 
the American and Japanese groups as a whole, S¢ 
Tanaka (1962). 
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matrix was then factored, using the principal 
_ component method. The first principal com- 
ponent factor was found to account for 77% 
of the total variance, all American and Japa- 
nese subjects loading high on it. On the other 
hand, inspection of rotated data demonstrated 
that Factor I (23% total variance) can be 
termed a Japanese factor, a majority of the 
Japanese and a few Americans loading high on 
it, while Factor II (21% total variance) can 
be called an American factor, a majority of 
American and a few Japanese subjects loading 
high on this factor. These results clearly in- 
dicated that the consistency of subjects’ mean- 
sing systems is reasonably high across different 
language/culture groups, but is still higher 
within than between the subject groups. 
Turning now to the present investigation, 
10 subjects—5 male and 5 female—were ran- 
domly selected from each of the American, 
Finnish, and Japanese groups, to bring this 
analysis within feasible bounds. For each of 
these 30 subjects, a 10 X 10 scale intercor- 
relation matrix was computed over the 24 con- 
cepts. Then, in the same manner as described 
above, each subject’s scale intercorrelation 
Matrix was correlated with every other sub- 
_ Ject’s scale intercorrelation matrix across cor- 
responding cells, and the resultant 30 X 30 
Subject intercorrelation matrix was then fac- 
tored using the principal components method. 
The results displayed in Table 5 are im- 
Pressive in two ways, as were the results of 
the previous study. First, there is one large 
factor, accounting for 76% of the total vari- 
ance, on which every subject loads most 
highly. This again is a clear indication that 
the 30 subjects used the 10 scales very simi- 
larly. It should be noted that this is true de- 
Spite the fact that differences in culture, lan- 
» guage, and sex are included. Second, however, 
in spite of this general similarity in the sub- 
Ject spaces, some cross-cultural differences do 
appear. Although Factors II and III account 
ay for 7% and 5%, respectively, of the 
otal variance—indeed trivial in comparison 
With Factor I—there are still certain con- 
poe in the subjects’ loading on these less 
AY aoe It must be pointed out that, 
tie ugh the amounts of the extracted vari- 
Ba are small, latent roots for these two less 
Portant factors are still greater than unity 
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TABLE 6 


FIRST THREE VARIMAX ROTATED, PRINCIPAL COMPO 
NENT FACTORS OF COMBINED AMERICAN, FINNISH, 
AND JAPANESE SELECTED SUBJECTS 


Subject Factor I | Factor II | Factor III 

American 
F 35 39 —.81 
F 61 44 —.60 
F .28 43 —.85 
F .55 42 —.68 
F -70 41 —.46 
M .43 6l —.58 
M 15 05 —.44 
M 36 38 —.82 
M 63 27 —.59 
M -14 45 —.40 

Finnish 
F 63 32 — AL 
F 15 52 —.28 
F <5 27 —.56 
F .88 32 —.22 
E .83 30 —,29 
M 87 28 —.22 
M 81 38 —.22 
M -61 53 —.40 
M 73 52 —.28 
M 65 57 —.14 

apanese 

:! F 25 90 —.25 
F 35 82 —.31 
F .30 .83 —.34 
F 23 89 —.31 
F A9 19 —.25 
M -60 50 —.47 
M 1 65 —.35 
M .26 70 —.61 
M 56 ól —.40 
M -39 75 —.24 

Percentage of 
total variance 35 30 22 


Note.—Factor coefficients > |.55| are boldface. F = female 
M = male. 


and they are therefore considered as mean- 
ingful factors in the correlation data. 

From inspection, it is clear that all the 
Finnish and many American subjects load 
negatively, whereas the Japanese subjects load 
positively, on Factor IT. On the other hand, 
the Americans tend to load negatively on Fac- 
tor III, whereas both the Finns and the Japa- 
nese tend to load positively on it. In other 
words, the Americans, Finns, and Japanese 
can be allocated to somewhat different posi- 
tions in a plane defined by Factors II and IIT. 
Are these differences in the locations of the 
subjects in the hyperplane defined by Factors 
JI and III an indication that there are some 
significant differences in the structures of sub- 
ject spaces? 

In an attempt to answer this question, the 
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three principal component factors, which were 
found to have latent roots greater than 1.00, 
were rotated by the varimax method. The re- 
sult, given in Table 6, clearly demonstrates 
cross-cultural distinctions. Now, Factor I is 
predominantly Finnish, every Finn loading 
relatively high on it, Factor II is predomi- 
nantly Japanese, and Factor III is predomi- 
nantly American. A few subjects in each cul- 
tural group are less definable because of rela- 
tively high loadings on factors other than their 
own.® 

In order to ascertain the extent to which 
each factor is unique to one of the three lan- 
guage/culture groups, Mann-Whitney U tests 
were carried out on the rotated data. Our in- 
terest here lies in testing whether the loadings 
of the subjects in one cultural group on one 
factor is significantly greater than those of 
the subjects in other groups on the same fac- 
tor. On each factor, therefore, the 30 subjects 
were ranked according to their factor load- 
ings. Then, a U was computed, for example, 
between the Finnish subjects who load high 
on the Factor I and the Japanese and the 
American subjects combined who load rela- 
tively low on the same factor. A total of three 
U’s were thus obtained. These tests clearly 
demonstrated that the Finnish subjects on 
Factor I, the Japanese on Factor TI, and the 
Americans on Factor III do load significantly 
(p < .001) higher than the subjects in other 
language/culture groups compared. These U 
test results thus confirm our previous assump- 
tion that—despite the great extent of overall 
similarity demonstrated by the unrotated Fac- 
tor I, with the large amount of total variance 
it removed—there are still significant differ- 
ences cross-culturally in the structures of in- 
dividual subject spaces. 

One determinant of such cross-cultural dif- 
ferences may be the different affective mean- 
ings of a few scales, For example, in the fore- 
going analyses of the pancultural spaces, the 
HAPPY-SAD scale was found to interact with 
language/culture groups: it is clearly an 
evaluative scale for the American group as a 
whole, but it is associated to the warmth fac- 
tor for both the Finnish and the Japanese 
groups as a whole. Such differences should be 


® High loadings as used here mean high in absolute 
value, 
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reflected in the factor structures of individual 
subjects’ semantic spaces, when the 10 x 10 


scale intercorrelation matrix for each subject” 


is factored. To test this hypothesis, the scale- 
by-scale intercorrelation matrix for each of 
the 30 subjects was factored, using the prin- 
cipal components method, and the extracted 
factors varimax rotated, only those factors 
having latent roots greater than 1.00 being 
rotated. 

In the rotated data, it was first noted that 
every subject used certain pairs of scales quite 
consistently: for example, for almost every 
subject, BEAUTIFUL-UGLY and PLEASANT-UN- 
PLEASANT, HARD-SOFT and STRONG-WEAK, HOT- 
COLD and EXCITABLE-CALM, tend to cling to- 
gether firmly despite the subjects’ language/ 
culture distinctions and differences in sexes. 
This certainly indicates that general stability 
is working across the subjects’ meaning spaces. 
Second, however, from inspecting the factor 
loadings of the Happy-sap scale in question 
for individual subjects, seven American sub- 
jects were found to use this scale in associa- 
tion with the evaluative factor, and two 
Americans in association with the warmth 
factor. Three Finns used the same scale as an 
evaluative scale and two Finns as a potency 


scale, HAPPy-sap being highly correlated with ` 


STRONG-WEAK in the latter case. For the three 
remaining Finns, Happy-sap loaded equally 
high on both the evaluative and the warmth 
factors. On the other hand, for seven Japa- 
nese subjects, HAPPy-saD was associated with 
the warmth factor, while three Japanese used 
the same scale as an evaluative scale. It is 
reasonably clear, therefore, that the use of 
HAPPY-SAD is somewhat unique cross-cul- 
turally, greater consistencies appearing within 
than between the cultural groups. 


Yet another factor seems to contribute to. 


these cross-cultural differences. Inspecting the 
results of the principal component factor 
analyses for individual subject, it was noted 
that the number of salient factors used does 
change from one subject to another. For ex- 
ample, 7 out of the 10 American subjects were 
found to use only two major factors in their 
judgments of the 24 concepts; that is, for 
these 7 Americans, only two factors displayed 
latent roots greater than 1.00. Three Ameri- 
cans were found to use three meaningful fac- 
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Cross-CULTURE GENERALITY 


tors. On the other hand, both Finnish and 


. Japanese subjects appear to utilize more fac- 


tors, as compared with the Americans. For 
example, only 1 Finn and 2 Japanese were 
found to use two meaningful factors, while 8 
Finns and 6 Japanese were found to use three 
salient factors. One Finn and 2 Japanese were 
found to utilize four factors with latent roots 
greater than 1.00 in their semantic judgments. 

By inspecting several cross-cultural data 
available to the present investigators, Ameri- 
cans appear to utilize consistently a fewer 
number of meaningful factors in their judg- 
ing. For example, a similar result was ob- 


‘tained in the Tanaka (1961) experiment, in 


which 18 (or 55%) out of 32 Americans, as 
compared with 22 (or 61%) out of 36 Japa- 
nese, were found to use more than four mean- 
ingful factors in their judging of 29 national 
stereotypic concepts. In another cross-cultural 
and cross-concept study of semantic spaces, 
Tanaka et al. (1963) noted that the Ameri- 
cans as a group utilized five meaningful fac- 
tors in their judging of colors, four factors for 
abstract words, and six factors for line forms, 
as compared with the Japanese as a group 
utilizing six meaningful factors for all the 
concept classes. In yet another cross-cultural 
research presently being carried out by the In- 
stitute of Communications Research, Univer- 
sity of Illinois (cf. Osgood et al., 1963) slight 
but real differences have been noted among 
the Americans, Finns, and Japanese as groups, 
who judged 100 verbal concepts: out of 10 
Principal axis factors, separately extracted for 
each subject group, 6 factors in the American, 
7 in the Finnish, and 8 in the Japanese were 
found to have latent roots greater than 1.00, 
and considered as meaningful factors in the 
Judging of the 100 concepts. All these separate 
research results are quite consistent with the 
trend described above. Although no immedi- 
ate, diachronic explanation can be given at 
E point, the foregoing results may be an 
aon that the American tends to use 

er criteria of affective meaning than both 

Finnish and the Japanese. 
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In summary, evidence is presented that 
there is high consistency across the subjects’ 
meaning systems, and yet that the consistency 
is even higher within each subject group. 
Since these perceptual sign data demonstrate 
general cross-subject stability across all the 
three language/culture groups, the intracul- 
tural consistency may be due to: subject/ 
scale interactions for a few scales, such as 
HAPPy-sAD—interaction which is greater 
across language/culture boundaries than 
within; and culturally unique, but presum- 
ably less salient, semantic factors present in 
the meaning space for subjects in different 
language/culture communities. 
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INDIVIDUAL DIFFERENCES IN THE RESOLUTION 
OF RESPONSE UNCERTAINTY * 


JEROME KAGAN* 
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This study assessed the generality of a tendency toward fast vs. slow response 
times in 2 different situations: (a) perceptual matching tests in which the 
number of response alternatives was fixed and objectively present for all Ss, 
and (b) a tachistoscopic recognition task in which the response alternatives 
had to be generated mentally. 60 boys and 53 girls in Grades 2 and 3 were 
shown 6 scenes tachistoscopically and response times were recorded. Several 
months later Ss were given 2 perceptual matching tests in which they had to 
select from a large array the 1 stimulus that matched a standard, and response 
times were recorded. Response times to descriptions of the tachistoscopic 
scenes were positively associated with response times on the matching tests. 
Moreover, long response times were positively correlated with the complete- 
ness of S’s drawing of the tachistoscopically presented scenes. It was suggested 
that individual differences in speed of decision making is a pervasive char- 


acteristic that can influence performance in a variety of test situations. 


The influence of information theory upon 
psychological research has taken many forms, 
one of which has been a study of the relation 
between reaction time and degree of stimulus 
uncertainty. Hick (1952) and Hyman (1953), 
for example, found a linear relationship be- 
tween reaction time and information per 
stimulus presentation. But the postulation of 
a simple, functional relation between these 
two variables has been challenged by several 
investigators (Morin & Forrin, 1963; Mow- 
bray & Rhoades, 1959; Seibel, 1962, 1963) 
who have suggested that response uncer- 
tainty, as well as stimulus uncertainty, con- 
tributes variance to response time scores. 
That is, the process of selecting one of many 
response alternatives affects response time 
when the amount of stimulus uncertainty is 
controlled. Existing data indicate dramatic 
individual differences in the time a subject 
takes to select an alternative in problem situ- 
ations with high response uncertainty (Ka- 
gan, in press). Moreover, theorists have con- 
tinually postulated a trait dimension dealing 
with the speed with which individuals react 
in problem situations, a characteristic that 
might be called “behavioral tempo” (Cattell, 
1946; McClelland, 1951). 


1 This research was supported, in part, by Grant 
M-4464 from the National Institute of Mental 
Health, United States Public Health Service. 

2Now at Harvard University. 


This paper summarizes one aspect of an 
extensive investigation of the significance 
and stability of preferred speed of decision 
making among school-age children. Previous 
research (Kagan, in press; Kagan, Rosman, 
Day, Albert, & Phillips, 1964) has studied 
decision time in problem situations in which 


the child is presented with a standard and an , A 


array containing 5-10 similar variants and 
the subject is required to select the one stimu- 
lus that is identical to the standard. These 
studies have revealed remarkable intraindivid- 
ual consistency in speed of decision time 
across varied tasks and stability of decision 
time over short and long periods. As a result, 
a cognitive disposition called reflection-im- 
pulsivity was postulated. It was assumed that 
the stable individual differences in decision 
time to problems with a fixed number of 
alternatives would be manifest in tasks for 
which the subject had to generate the response. 
alternatives mentally. The procedure chosen 
to test this proposition was a tachistoscopic 
recognition task in which the subject was 
shown pictures at exposures ranging from 18 


milliseconds to 3 seconds. In order to maxi- 


mize the effect of differences in speed of de- 
cision making the pictures were constructed 
to contain incongruous elements. The reasons 
for this machination follow. At the initially 
fast exposures no child would be able to dis- 
cern the exact nature of the stimulus or the 
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incongruities within the stimulus. The child’s 
-initial interpretation would be governed by 
, the salient aspects of the external stimulus. 
With increasing exposure, however, the child 
would be forced to acknowledge the incon- 
gruous information. He would have to recon- 
sider his earlier hypotheses and accommodate 


` to the incongruous aspects of the stimulus. It 


was anticipated that some children would re- 
flect over the validity of their interpretation 
when they recognized the incongruity. Others 
would offer descriptions impulsively with 
minimal consideration of the logical relation 
of their current response to earlier interpre- 
"tations or the credibility of the interpreta- 
tion about to be reported. It was expected 
that subjects who normally displayed longer 
response times to the visual recognition tasks 
in which all the alternatives were present 
(called reflective subjects) would also mani- 
fest longer response times to the tachisto- 
scopically presented pictures for which the 
response alternatives had to be generated by 
the child. 


METHOD 


Subjects and Apparatus 


. The subjects were 60 boys and 53 girls from 
Grades 2 and 3 in a public school. Each subject was 
seen for two, 1-hour sessions separated by 1 day. 
The apparatus was a three-field electronic tachisto- 
Scope using mercury argon lamps? 


Tachistoscopic Session 


Each subject was brought into a lightproof room 
at the Institute and dark-adapted for 8 minutes. 
The experimenter indicated that he wanted to see 
how good the subject’s eyes were and a set for 
accuracy of recognition was communicated. Initially, 

* the experimenter exposed three simple stimuli (+, 
4, E) in an ascending exposure series to determine 
the subject’s visual acuity and to acclimate him to 
the apparatus. The first exposure for each practice 
stimulus was 8 milliseconds with 2 footcandles inci- 

* dent on each stimulus. Each succeeding exposure 
was approximately 2 milliseconds longer until the 
subject described the stimulus correctly three suc- 


"s cessive times. Any child who did not recognize all 
`. three stimuli at exposures of 175 milliseconds was 


discarded from the study. Three children were elim- 
inated for this reason, leaving 113 children. Follow- 
ing the three practice items, the experimenter said, 


I am going to show you some pictures, not designs, 
_Tell me everything you see in each picture. These 


® The apparatus was designed by Charles W, Sny- 
der of the Menninger Foundation. 


Fic. 1. Six scenes used in tachistoscopic recognition 
task. 


are pictures that may seem silly to you; for ex- 
ample, here is a picture. Look at it. 


The subject was then shown two incongruous 
pictures (a girl with rabbit ears, a boy in a football 
uniform swinging at a football with a baseball bat) 
to insure that each subject had a set to expect pic- 
tures with incongruous elements. Three test items 
were then administered with a 5-minute recess after 
each item. When the subject returned the next day 
he was shown three new pictures. The six ink-line 
drawings used are illustrated in Figure 1. 

The three pictures in the first session were the 
bird with plane’s fuselage, living room, and the 
chicken with pig’s head. The three pictures pre- 
sented on the second day were the sweater, the dog 
jumping on a boy, and the person in a woman’s 
dress with a man’s head. 

The initial exposure for each picture was 18 milli- 
seconds, and each stimulus was presented for a 
minimum of 18 and a maximum of 30 exposures. 
The exposure times for the first 18 trials increased 
according to the following schedule (in milliseconds) : 
18, 20, 23, 25, 28, 30, 31, 38, 59, 85, 122, 162, 194, 
215, 240, 265, 295, and 320. For Trials 19 through 
30, the average increase in exposure time was .25 
second. 

If the subject described the stimulus accurately 
three times in succession between Trials 18 and 30, 
the series was terminated. The subject was asked to 
describe the picture in detail for each exposure, The 
experimenter made no reference to the speed with 
which the subject was to respond, but periodically 
reminded the subject that his verbal descriptions 
were to be as complete as possible. Thus, a set for 
accuracy and completeness, rather than speed, was 


156 


communicated to the child, After the subject re- 
ported three successive correct descriptions (or 
after 30 trials) he was asked to look at the picture 
once more and then to draw it. The picture was 
exposed at 3 seconds, after which the subject drew 
what he saw on an 8 X 11 inch piece of white paper. 
The subject was given unlimited time to draw the 
picture. 

Variables scored. The variable of major interest 
was the average response time for Trials 1-18 across 
all six pictures, Response time was defined as the 
latency from exposure of the stimulus to the sub- 
ject’s first significant verbalization, and was recorded 
to the nearest half-second by an observer. The reli- 
ability of this time score was obtained in sessions 
during which two observers independently recorded 
response times. The reliability coefficient was .93. 

Two additional variables studied were the com- 
pleteness of the subject’s drawings and his recogni- 
tion threshold for each scene, In order to assess the 
former variable each of the six drawings was given 
a maximum score of 10 points, with specific ele- 
ments receiving scores of 7, 4, ł, or 1 point, de- 
pending upon their salience in the original stimulus. 
Salience was positively correlated with probability 
of occurrence in the drawing. The most salient 
and frequently occurring components received low 
scores; the less obvious and less frequently oc- 
curring components received higher scores. A mean 
score across all six pictures was computed for each 


Fic. 2. Sample items from match- 
ing familiar figures task. 
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subject and this score was correlated with the re- 


sponse time variables. 

In order to score recognition threshold, a set of 
essential attributes for recognition was determined 
for each picture prior to the scoring of any protocol, 
The verbatim protocols were then scored for the 
trial at which the subject first described all the es- 
sential components of the picture and this trial was 
his recognition threshold. The interscorer reliability 
for this variable was .97. 


Perceptual Recognition Tasks 


Each subject was seen subsequently for two addi- 
tional sessions during which two perceptual recog- 
nition tasks were administered. Session 1 occurred 
approximately 3 months after the tachistoscopic ses- 


sion; Session 2, 9 months after the tachistoscopic , ' 


session. 

Session 1: Matching familiar figures. The subject 
was shown a picture of a familiar object (cowboy, 
tree, telephone) and six similar variants, only one 
of which was identical to the standard. The subject 
was asked to select a stimulus that was identical to 
the standard. The variable of interest was the average 
Tesponse time to the first selection across the 12 
test items. Figure 2 illustrates two of the test items. 

Session 2: Haptic visual matching. The subject 
first explored tactually a three-dimensional wooden 
form (approximately 3 inches square) to which he 
had no visual access. He was allowed an unlimited 
time to explore the form. When he withdrew his 
hands he was presented immediately with a visual 
array of five drawings, only one of which repre- 
sented the outline of the form he had explored 
haptically. The 20-item test contained both familiar 
objects (cat, hand, boy, plane) and unfamiliar geo- 
metric forms. The two variables of interest were 
exploration time (time taken to explore the wooden 
form) and response time (time taken to select first 
solution hypothesis to the visual array). 

Verbal skills. Each subject had been administered 
the Vocabulary, Information, and Similarities sub- 
tests from the WISC in a separate session which 
occurred prior to the administration of the haptic 
visual matching task. Each subject’s average scale 
score for the three tests was used in the statistical 
analysis, 


RESULTS 


There were no significant sex or grade 
differences in response time over Trials 1-18 
to any of the six scenes. Furthermore, there 
was a consistent tendency for response times 
to decrease with succeeding pictures, due to 
increased boredom or familiarity with the 
task. There was a positive correlation among 
the average response times to the six pictures. 
The average of 15 correlations for response 
time to each of the six pictures ranged from 
-52 to .64 for the four grade-sex groups, with 
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e TABLE 1 
lod Pu. AVERAGE CORRELATIONS AMONG DECISION TIME SCORES 
Ba z 1 2 3 4 5 
f Response time 
x Tachistoscopic scenes — .40** 30* 20 MRS 
HE Matching familiar .40** — SAS 40** 12 
a! . _ Haptic visual matching Ts .64*** — 69*** 20 
Bee Exploration time 
Haptic visual matching 38** Oot Ki peace’ = 24 
Verbal skills on WISC .20 10 .07 13 = 


: *p <.05, two-tailed. 
F ** p <.01, two-tailed. 
+*+ p <.001, two-tailed. 


‘dn. average correlation of .56. There was, 
7 therefore, a generalized tendency for subjects 
% «= to display short or long latencies across all 
< six scenes, An average response time score for 
- all six scenes was computed for each subject, 
and this variable was put in McCall T score 

i, form. 

The intercorrelations among the five ma- 
jor variables were computed for the four 
grade-sex groups separately (i.e. the three 

| response time scores, exploration time on the 
haptic visual test, and verbal skills). In gen- 
», eral, the direction and magnitude of these 
coefficients were similar for all four groups. 
+ Average correlations (z’ transformations) 
were computed for the two grades, keeping 
the sexes separate. These coefficients appear 
E in Table 1. 

; There was moderately high intertask con- 
‘| “sistency in response times across the two rec- 
== ognition tasks that contained a fixed number 
+, of response alternatives (r =.57, .64). The 
critical set of correlations, however, related 
response time on each of the fixed alterna- 
. tive tasks with response time to the tachisto- 
« Scopic scenes in which the alternatives were 
generated mentally. All four correlations were 
`e Significant at p < .05 or better for two-tailed 
tests (r = 40, .30, .40, and .32). Moreover, 
, . even exploration time on the haptic visual 
a`. task was positively related to decision time 
. to the tachistoscopic scenes (r = .20 for boys; 

. +38 for girls). 
Verbal skills were independent of response 
- time on the two fixed alternative tasks (r 
|:  tanged from .07 to .20), but displayed a mod- 
43 erately positive relation with response time 
to the tachistoscopic scenes (r = .33, p < .01 


Note.—Boys to right and above diagonal, girls to left and below diagonal. 


for boys; r = .20 for girls). The direction of 
this relationship may come as a surprise to 
some readers. For rich verbal resources might 
be expected to facilitate rapid conceptualiza- 
tions of the scenes; whereas, an inadequate 
vocabulary might be expected to result in a 
more halting performance. These data indi- 
cate that the verbally precocious children 
delayed longer before describing the scenes 
than the children of average or below average 
verbal ability. Since response latency in this 
situation is regarded as a partial index of de- 
gree of response uncertainty, it is suggested 
that the verbally proficient children generated 
more response possibilities and, consequently, 
required more time to select a final interpre- 
tation. The number of alternatives were fixed 
in the matching familiar figures and haptic 
visual tasks and, as a result, hypothesis gener- 
ation and verbal skills should not be factors 
in decision time.* 


Relation of Drawing Scores to Decision Time 


The correlations between drawing score and 
each of the response time variables were simi- 
lar in direction and magnitude for the sexes 


4 Additional data on 40 subjects in Grade 8 (age 
13-14) support the notion that increased verbal 
skills lead to longer decision times in describing the 
tachistoscopic scenes. Twenty boys and 20 girls 
were tested individually with the same stimuli used 
with the younger group. The average response time 
for the older subjects averaged .5 second longer 
than the younger children. The differences between 
the older and younger subjects were significant for 
Pictures 2, 3, 5, and 6 (p< 02 for each), but just 
short of significance for Pictures 1 and 4. When the 
mean response time across all six pictures was com- 
puted, the difference between the older and younger 
subjects bordered on statistical significance (p = .08). 
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and grades and average 7’s were computed. 
The drawing score was significantly related to 
response time on matching familiar figures 
(average r = .31, p < .001); and the tachisto- 
scopic scenes (average r= .22, p < .05), as 
well as exploration time on the haptic visual 
task (r = .21, p < .05), The correlation with 
response time on the haptic visual task was 
only .08. Drawing score was unrelated to 
verbal skills for boys (r= .13), but posi- 
tively associated for girls (r = .56). There 
was, therefore, a moderately positive relation 
between completeness of the child’s drawings 
and his tendency to display long decision 
times. This relation was strongest for the 
matching familiar figures task which was 
more difficult than the haptic visual test. 

It is assumed that children who manifested 
long decision times on matching familiar fig- 
ures were prone to reflect upon the relative 
validity of their cognitive products; the longer 
response times were the result of considera- 
tion of the differential accuracy of the re- 
sponse alternatives available. It is believed 
that the completeness of a subject’s drawings 
was facilitated by the tendency to conduct an 
internal dialogue while executing the drawing. 
It is presumed that the reflective child con- 
tinued to ask himself if all the components 
remembered from the tachistoscopic exposure 
were being included in his drawing. More- 
over, the reflective child would be likely to 
consider his drawing after its completion, 
matching his memory of the scene to the 
product just completed. It is suggested that 
reflective children matched continually their 
image of the input to the transformations gen- 
erated from their original perceptions. This 
interpretation is, of course, conjectural, and 
the argument would be strengthened if the 
reflective children reported that they were 
performing the mental matching attributed to 
them. Unfortunately, young children have 
great difficulty verbalizing the cognitive com- 
ponents that accompany their actions. Some 
of the children were asked after completion 
of a drawing what they were thinking while 
they were reproducing the stimulus. The 
typical answers were ambiguous, “I don’t 
know; I was trying to remember the picture; 
I tried to do my best.” Short of putting words 
in their mouths, it was not possible to elicit 


a helpful description of their thoughts during 
either the recognition or drawing tasks. , 


Relationship of Recognition Thresholds to 
Drawing Score and Decision Time 


The recognition threshold distributions 


were skewed, and, as a result, were split at 


the median, and phi coefficients computed 
to assess the relationship among recognition 
threshold across the six pictures. These co- 
efficients were surprisingly low, for only 6 of 
the 60 coefficients were significant, and two- 


thirds of the coefficients were between — .15 __ 


and + .15. There was, therefore, no generai 
intraindividual disposition to recognize most 


of the pictures either early or late. For this , 


reason the average recognition score across 
the six scenes was not computed. Point bi- 
serial correlations were computed relating rec- 
ognition threshold for each of the six scenes 
(the dichotomous variable) with drawing 
score and decision times (continuous vari- 
ables). 

There was no systematic relation between 
recognition thresholds for Scenes 1 through 5 
and drawing scores (point biserial 7 ranged 
from — .42 to + .11 with 60% of the coeffi- 
cients falling between — .12 and + .12). Pic- 
ture 6 yielded a significant negative relation 


between drawing score and recognition thresh- ~ 


old for the younger boys (r = —.35) and the 
older girls (r = — .51), but a negligible re- 
lation between these two variables for the 
other two groups (r = .01, — .24). 

The statistical independence of recognition 
thresholds and drawing score corroborates a 
frequent observation made during the testing 
sessions. Some children described the stimu- 
lus perfectly on the last exposure, but omitted 
from their drawing components they had de- 
scribed verbally a moment earlier. These 
omissions appear to be the product of a non- 
reflective or impulsive orientation, a reluc- 
tance to match the image of an event to th2 
response intended to describe it, or to consider 
the validity and completeness of one’s re- 
sponse, be it a verbal classification or 4 
graphic illustration. 

The correlations between recognition thresh- 
olds and the decision-time variables were 
neither high nor consistent, and indicated no 


Tae 


= 


i 


"i 


pi 
» | strong relation between ease of recognition 
pal 


and the reflection-impulsivity dimension. 


DISCUSSION AND SUMMARY 


* The major finding of this investigation was 
“that the tendency toward short or long re- 
«sponse times in reporting descriptions of 
* tachistoscopically presented scenes was posi- 
tively associated with response times in situ- 
ations in which all the response alternatives 
Were objectively present. It is assumed that 
* during the delay in the tachistoscopic situa- 
tion the child was considering the differential 
‘validity of alternative descriptions. Thus, the 
. disposition to be reflective versus impulsive 
generalized across the two situations. This 
“finding supports the postulation of a per- 
vasive individual difference variable involv- 
‘ing speed of decision in situations with high 
_Tesponse uncertainty. It is to be noted that 
“the assumption of a generalized tendency 
toward reflection or impulsivity holds only for 
situations in which several alternatives are 
generated (or present) simultaneously, and it 
is not immediately obvious which alternative 
s is correct. Response time to this class of 
problems is not necessarily related to response 
time in situations in which the solution hy- 
“pothesis is not generated (or present) imme- 
diately, or problems for which only one alter- 
native presents itself (e.g., What is the cube 
root of 1331? Who discovered the North 

- Pole?), 
A basic assumption in this research is that 
response times to the specific tasks described 
‘sare a faithful reflection of decision times. 
That is, the dimension reflection-impulsivity 
réfers to the degree to which a subject con- 
Siders alternative hypotheses, in contrast to 
reporting hypotheses with minimal considera- 
_ tion of their probable validity. Since long 
"response times to problem tasks can be a prod- 
uct of other psychological processes (e.g., 
‘fear of reporting any hypothesis, apathy, 


` Or an inadequate vocabulary), it is neces- 


4 
2] 


` sary both to control for these other factors 
* and to accumulate evidence that corroborates 
the link between the construct of decision 
‘time and the operation of response latency. 
Control of factors such as fear of answering 
and vocabulary resources has been accom- 
plished through experimental procedures (Ka- 
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gan et al., 1964), and earnest attempts to 
establish the best possible rapport with the 
children. Moreover, the independence of re- 
sponse time and vocabulary level has been 
validated in many independent studies (Ka- 
gan, in press; Kagan et al., 1964). The more 
difficult task of providing objective and un- 
ambiguous support for the hypothesis that 
long response times to a task like matching 
familiar figures are an index of long decision 
times is in progress. A recently completed 
study of fourth- and fifth-grade children fur- 
nishes the best evidence we have been able to 
gather up to now. A group of 52 girls and 56 
boys were given a slightly more difficult ver- 
sion of matching familiar figures, but the 
standard and variants were arranged in a 
manner that allowed an observer to code re- 
liably the number of times the subject’s eyes 
moved back and forth between the standard 
and the variants. A high positive correlation 
between number of distinct eye-scanning 
movements and response time to first selec- 
tion would suggest that the subject was ac- 
tively considering alternative answers during 
long delays. The correlation between these 
two variables was remarkably high (r = .97 
for boys; .92 for girls), furnishing strong 
support for this hypothesis. 

There are several implications of this work. 
These data suggest, first, that investigations 
that use response time as the major dependent 
variable should control for the subject’s 
generalized tendency to respond slowly or 
quickly, much as one controls for intelligence 
or social class. A second implication claims 
more theoretical substance. Speed of deci- 
sion, whether the decision be a classification 
of a stimulus or the selection of a solution 
sequence to a difficult problem, influences the 
probability of producing a successful solution. 
The child who impulsively selects the first so- 
lution hypothesis that occurs to him is more 
likely to produce an incorrect answer than 
one who reflects on the differential validity 
of various possibilities (Kagan, in press). It 
is reasonable to argue that training a child to 
reflect in situations with multiple solution 
possibilities will eventually affect the quality 
of his performance. 

A third implication questions the strategy 
of using a child’s drawings as an index of his 
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perceptions or mental constructions. Piaget 
and his colleagues (Inhelder & Piaget, 1958) 
sometimes infer the child’s conception of an 
event from his drawing. The Geneva investi- 
gations regard developmental differences in 
the child’s pictorializations as isomorphic 
with his understanding of an event. The 
present data suggest that there is considerable 
slippage between the child’s comprehension 
and graphic reproductions. For although re- 
flective children drew more complete ‘scenes 
than the impulsive subjects, their verbal de- 
scriptions of the scenes were not more com- 
plete with respect to the essential components 
of the picture. 

A final implication concerns the validity 
of inferences about personality constructs 
that are based on content variables derived 
from tests where the speed of response may 
affect the content of the verbal interpretation. 
Interpretations of inkblots and pictures are 
exemplary in this regard. For example, pure 
color responses to inkblots are typically asso- 
ciated with shorter response times than move- 
ment or form responses. Thus, responses such 
as “fire,” “blood,” “explosion,” which are 
usually scored as aggressive in meaning, are 
offered with less reflection than the responses 
“butterfly,” “cat,” “woman eating” (Kagan, 
Moss, & Sigel, 1963). Similarly, an interpreta- 
tion of Card 3 BM of the TAT that includes 
reference to the ambiguous revolver in the 
background is likely to be given by a subject 
who pauses to study all aspects of the picture. 
The subject who quickly reports his first 
association to the picture is less likely to 
include the revolver and, as a result, less 
likely to report an aggressive theme. System- 
atic study of the relation between degree of 
reflection over a cognitive product and the 
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seems a profitable research enterprise. 
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CREATIVITY AND ADAPTIVE REGRESSION * 


CYNTHIA WILD 


Yale University 


This study provides systematic evidence bearing on the psychoanalytic concept 


of “regression in the service of the ego,” which postulates shifts from more to 


Jess regulated thinking in the creative process. The performance of 30 art 
students was compared with that of 26 teachers and 26 schizophrenics on the 


Word Association and Object Sorting tests, given under (a) spontaneous in- 
; structions, and instructions to induce; (b) regulated; and (c) unregulated 
ñ thought. A shift score consisted of the difference between an individual’s score 
under regulated and unregulated conditions. Art students had significantly 


higher shift scores on both measures than the other 2 groups combined and 


showed more favorable reactions to unregulated thought than did teachers or 


schizophrenics. 


__ Recent systematic work on creativity has 
. been stimulated by psychoanalytic theory, 
where the current emphasis is on cognitive 
and adaptive concepts dealing with adjust- 
¡ment to the external world, on relatively 
neutralized, conflict-free spheres of ego activ- 
ity, like problem solving, and on the con- 
trolled use of more primitive psychic mecha- 
nisms which promote rather than interfere 
with adaptation (Gill, 1959). One such con- 


, cept is “regression in the service of the ego,” 


‘first presented by Kris (1952) and further 
„elaborated by Schafer (1958). It refers to a 
momentary, at least partially controlled low- 
ering of the level of psychic functioning, 
where constructive use is made of more primi- 
tive, drive-dominated, nonlogical (primary 
` process) modes of thought. According to this 


=~ View, the inspirational phase of creativity in- 


zÁ 


volves regressive thought processes, but in the 


“service “of the final production, so that all 


direction and control of thought is not lost, 
as occurs in psychotic primitivization of 
thinking. In the elaborative phase of creativ- 
ity, more conventional, neutral, logical (sec- 
‘e }This study is based on a dissertation submitted 


to Yale University in partial fulfillment of the 
Tequirements for the PhD. Further detailed informa- 


; ‘tion is available in the dissertation. The study was 
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ondary process) modes of thought prevail, 
when the product of inspiration is synthe- 
sized, criticized, and integrated. Thus, this 
view postulates the successive or simultane- 
ous interaction of more and less regulated 
thinking in creative activity. 

Utilizing a manual developed by Holt 
(1959) for scoring formal and content mani- 
festations of primary process on the Ror- 
schach, Holt (1960) and von Holt, Seng- 
stake, Sonoda, and Draper (1960) report 
positive relationships between such primary 
process manifestations and problem-solving 
ability on the Stroop Colour-Word Inter- 
ference Test and Hanfmann-Kasanin Con- 
cept Formation Test. In a study directly 
relating to the concept of regression in the 
service of the ego, Pine and Holt (1960) 
found that for 27 undergraduates, quality 
scores on seven tests of imagination were 
unrelated to amount of expression of primary 
process on the Rorschach, but related to 
measures of the effectiveness of control of 
primary process. Further support for this 
conceptualization is found in research by 
Barron (1955, 1957) on originality and 
Getzels and Jackson (1960) on creativity 
and in theoretical statements by Wallas 
(1926), Hutchinson (1949a, 1949b, 1949c), 
Thurstone (1952), and Werner (1957), 
which also suggest that the creative process 
involves both more primitive and more real- 
istic thinking. 

The major purpose of this study is to 
provide further systematic evidence relating 
to this hypothesis by seeing whether a 


162 


“creative” group differs from “normal” and 
psychotic control groups in their ability to 
shift adaptively from more to less regulated 
thought and their tendency to find unregu- 
lated thinking easy and enjoyable. Shifting 
seems particularly applicable to the concept 
of regression in the service of the ego, for it 
involves both a capacity to engage in un- 
regulated thinking and an ability to return 
adaptively to more regulated thought, imply- 
ing some degree of control of regression. Simi- 
larly, a favorable attitude toward an un- 
regulated response implies some distance and 
control, as well as enjoyment. Further, it 
implies more freedom from the anxiety and 
guilt that such loose thinking is likely to 
arouse without the danger of total loss of 
control that often accompanies psychotic 
primitivization of thought. 

The creative group consisted of students 
at a professional art school connected with a 
university. High school and elementary teach- 
ers were selected for the normal group. For a 
second comparison group, schizophrenics were 
chosen as the clinical population most likely 
to show uncontrolled pathological regression. 
The predictions were that art students will 
be highest, schizophrenics lowest, and teach- 
ers intermediate in shifting under regulated 
and unregulated instructions on certain 
tests and favorable reactions to unregulated 
thinking. 


METHOD 
Subjects 


The sample consists of 82 subjects, including 30 
art students, 26 teachers, and 26 schizophrenics. 

All art students were actively producing works of 
art at the time of testing. Two were majoring in 
sculpting and 28 in painting. Eighteen men and 12 
women volunteered, a sample roughly corresponding 
to the proportion of men to women at the art 
school. Their mean age was 25.17, with a range 
from 21 to 35. Their mean intelligence level, con- 
sisting of the average score on the Information and 
Block Design subtests of the Wechsler Adult Intel- 
ligence Scale, was 13.47, with a range from 9.5 to 
17.0. The group’s mean number of years of education 
was 16.73. t 

In order to match the teachers with the art 
students by group on sex, age, intelligence, and 
education, records at a teachers’ college were 
perused; and approximately 67% of teachers who 
were contacted agreed to be subjects. All but 1 
were doing graduate work in education at the time 
of testing. 
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Of the 26 schizophrenics, 23 were patients at 
Connecticut state hospitals and 3 were patients in 
private hospitals. All had a clear-cut diagnosis of 
schizophrenia and showed no evidence of organic 
brain damage. In order to screen out patients who 
were unable to grasp the instructions, all subjects 
were asked to judge immediately after reading each 
character sketch whether the character was above 
average, average, or below average on nine traits; 
like imaginativeness and curiosity. Testing was. 
discontinued with all patients who, in the ex- 
perimenter’s judgment, did not appear to dis- l 
criminate between the two characters. Three pa- l 
tients and no other subjects were excluded by this 
criterion.” Thus, since the instructions were complj- « 
cated, only patients in good contact were tested, 

All three groups are well matched on sex and | 
age. The mean difference between art students and | 
schizophrenics on intelligence approaches significance 1 
($ < .10). However, since the premorbid intellectual — 
functioning of the patients was almost certainly. 
above their present level, the two groups are prob-* 
ably better matched on intellectual potential than | 
these data show. While art students and teachers | 
are well matched on education, the differences of 
both art students and teachers from schizophrenics’ 
are significant (p <.001) and average about 2 years, 

Tt was practicably impossible to find enough patients 

who had completed college or done graduate work, 
not only because most of the patients were state 
hospital inmates, but also because the most likely 
age of onset of schizophrenia is in the late teens | 
or early twenties during the college years. K 
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Ratings : 


To investigate the relationship between creativity 
and shift within the art student group, four art, 
school faculty members were asked to independently | 
rate the art students’ creativity after all the testing | 
was completed. First, they checked off the students . 
whom they knew well enough to rate and then 4 
ranked those known to them on the global variable if 
“creativity,” leaving the definition of creativity up M 
to each rater. Three of the four raters had already | 
made such ratings on 28 other students, with inter- 1| 
rater reliabilities of -80, .50, and 47. 3 | 


Procedure 


Art students and teachers came to the experi- | » 
menter for the 2-3 hour testing procedure and re- 
ceived $2.00 an hour, while schizophrenics were seen 
in their hospital Settings and were not paid. All 
testing was done by the experimenter and, insofar ., 
as possible, everything the subject said was recorded 
verbatim. Initially, every subject was given the 
same, very general rationale for the study as an 


?The three groups do not differ significantly in 
their responses to these 18 questions for the two 
characters, so that group differences cannot be at- A 
tributed to differential understanding of the character »<3 
sketches. 


investigation of different kinds of thinking, and no 
mention was made of creativity. 

“Then the Word Association (WA) and Object 
Sorting (OS) tests were given in that order to 
-every subject (spontaneous condition), For both 
- ‘tests, the instructions and mode of administration 

. described by Rapaport were employed (Rapaport, 

Schafer, & Gill; 1945, 1946). While both tests can 

be taken in a regulated and unregulated manner, 
. since they allow for a wide leeway of response, they 

appear to require somewhat different cognitive ca- 
pacities, The WA test is a verbal test, calling for 

; quick, spontaneous, almost automatic thought, 

whereas the OS test requires more delayed, planned 
- „thinking. Thus, a demonstration of group differences 

in ability to shift on both tests would provide some 
evidence that shifting is a general cognitive approach, 

' «The WA test consisted of three lists of 30 words, 

one. list for each condition, The three lists were 

used the same number of times under each condition. 
» Each list consisted of 18 words taken from the 
iA - Kent-Rosanoff list (Rosanoff, 1944) and 12 from the 

Rapaport list. The OS test consists of 33 common 

objects, like silverware and tools, which the subject 

is asked to group together. Only Part I of the test 

*+was used, where 7 objects are presented to the sub- 

ject in turn and he is asked to put with each ob- 

ject those of the 32 remaining objects that belong 

with it and then to verbalize why they belong 
together, 

All subjects took the tests under the spontane- 

ous condition first, followed by the regulated and 

*, unregulated conditions in counterbalanced order, 

Character sketches were used to induce more and 

less regulated thinking. The experimenter presented 

the subject with either the “R” (regulated) or “U” 

(unregulated) character sketch. In brief, the “regu- 
lated” character was described as a conventional, 
cautious, reliable person, who prefers an orderly, 

Structured universe and values good common sense. 
|» „Suspicious of unusual ideas, he does not allow his 

thoughts to wander from ordinary, well-traveled 
paths, The “unregulated” character was described as 
having Dovel thoughts and whimsical yet acute per- 

* ceptions that May startle other people. He points 

to the contradictions in experience and enjoys 

fanciful speculations. After evaluating the character 
$ on nine traits, the subject was asked whether he 
liked or disliked, admired or did not admire the 
Person, The subjects were then told: 


f ‘e a Now, Td like you to take the Word Association 
Test you had before as this person would take it. 
So respond to each word that is read to you with 
he one word you feel would first come into his 

. ‘mind. This may not be the first word that comes 

+ Mto your mind—you may want to find another 

Word you think he would be more likely to give. 


Corresponding instructions for the OS test were: 
» Now, I'd like you to take the Object Sorting Test 
the way this person would take it. 


After the subject had taken both tests under R 
and U Instructions, exactly the same procedure was 
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followed for the other condition. Then, in order to 
obtain a measure of the subject’s reaction to un- 
regulated thinking, he was asked whether he pre- 
ferred the first or second character, which one he 
found easier to be, which one he felt he was actually 
more like, and which character he would rather be 
if he had a choice. 

At this point, the Rorschach was administered to 
all subjects, except for eight schizophrenics who had 
had it within 2 months prior to testing. Immediately 
after the test, the subject filled out a questionnaire, 
checking on a 5-point scale: how much he enjoyed 
the Rorschach, how easy he found it, how certain 
he was of his percepts, and how satisfied he was 
with his responses, Because the Rorschach is con- 
sidered to stimulate fantasy and less regulated think- 
ing, it was expected that the art students would 
react more favorably to it than the other two 
groups, 

The Block Design and Information subtests of 
the Wechsler were given at the end of the session, 


RESULTS 


Shift Scores on the Word Association (WA) 
and Object Sorting (OS) Tests 


All responses on the WA and OS tests 
were classified as original or conventional, 
with the rationale that an ability to give 
original responses reflects more flexibility of 
thought processes than does a capacity to 
give only safe, conventional ones. Each word 
on the WA test was scored conventional if it 
appeared on the Kent-Rosanoff or Rapaport 
norms and original if it did not. 

The experimenter and a trained clinical 
psychologist classified each response on the 
OS test as conventional or original according 
to norms given by Rapaport for conventional 
responses.* Interscorer reliabilities for indi- 
vidual total scores computed for each condi- 
tion and for shift scores are: spontaneous 
condition, .92; regulated condition, .87; un- 
regulated condition, .90; shift scores, .87. 


The OS test was also rated for integration- 
disintegration of thought according to Rapaport’s 
criteria for distinguishing organized and conceptual 
from disorganized and pathological modes of 
thought. The experimenter and an untrained scorer 
did blind ratings of each response on a 7-point 
scale of integration-disintegration, with an inter- 
scorer reliability of .92 for individual total shift 
scores. Since this set of scores was highly correlated 
with conventionality-originality scores (+.83 for 
shift scores) and since the patterns of results were 
very similar, only the results from the conven- 
tionality-originality scores will be presented. Where 
differences occur, they will be noted. 
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TABLE 1 


MEAN SHIFT SCORES FOR Art STUDENTS, TEACHERS, 
AND SCHIZOPHRENICS WITH £ TESTS OF THE 
ADJUSTED MEANS COMPARING ART 
STUDENTS AGAINST TEACHERS 
AND SCHIZOPHRENICS 


COMBINED 
Test studeits | Teachers ieee t 
WA 21.53 17.27 16.08 3.05* 
Os 5.90 4.73 3.83 2.94* 
*p <.005. 


The averaged scores of the two raters were 
used in the statistical analyses. Except for 
F tests, all probabilities presented are one- 
tailed, unless otherwise noted. 

Shift scores consist of the difference in the 
corresponding measures under the regulated 
and unregulated conditions. Since spontane- 
ous level of originality was found to be posi- 
tively correlated with shift scores—+-.22 for 
both Mmeasures—analyses of covariance were 
carried out for shift scores, with spontaneous 
level of originality as the predictor variable, 

These analyses show that the total between- 
group difference on shift scores for both the 
WA and OS tests is significant (p < .025).4 
Thus, as predicted, the three groups differ 
in their ability to shift from conventional to 
original responses on both tests. 

The mean shift scores for the three groups 
are presented in Table 1. In both cases, the 
means are in the predicted order, with art 
students showing the greatest ability to shift 
and schizophrenics the least, with teachers 
intermediate. Individual comparisons of the 
three means by the Tukey test (Scheffe, 
1959) showed that all significant differences 
were between art students and the other two 
groups. Therefore, art students were com- 
pared against teachers and schizophrenics 
combined by ¢ tests of the adjusted means 
(Lindquist, 1953). Since this same pattern 
of results emerged in all further analyses of 
shift scores, only the comparison of art stu- 
dents against teachers and schizophrenics 
combined will be presented. As predicted, on 
both the WA and OS tests, art students shift 
significantly more from conventionality to 
originality than teachers and schizophrenics. 


*This difference was not significant for the 
integration-disintegration scoring ($ < .20). 
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Analyses of variance of spontaneous scores 
were carried out to determine whether the 
groups differed in initial level of regulated or 
unregulated thinking. On the WA 3 
is a tendency (p< .20) for the 
differ on spontaneous originality; and on the 


OS test, the between-group difference on } 


spontaneous originality is significant ($ 
< .025). Table 2 shows that again art stu- 
dents differ significantly from teachers and 
schizophrenics on the two measures. However, 


test, there . f 
groups to` 


on the OS test, individual comparisons show _ 


that teachers give significantly more original’ 
responses than schizophrenics and do not 
differ from art students, Thus, with this 
exception, there is some tendency for the art 


students’ spontaneous mode of thought to lie . 


further toward the unregulated end of the 


continuum than that of either teachers or. 


schizophrenics." 


Two further analyses of covariance were. 


carried out to determine whether the groups 
differed in the extent to which they shifted 
in a regulated or unregulated direction. 
Spontaneous scores are the predictor variable, 
with scores under the regulated and unregu- 
lated conditions as the criterion variables. 
The analyses show that for both tests, the 
group differences on unregulated scores are 
significant (p < :001), while those on regu- 
lated scores are not, although there is some 
tendency for the groups to differ on regulated 
scores on the WA test (p < .20). A £ test of 
the adjusted means (Table 3) shows that on 
both tests art students become significantly 
more original under unregulated instructions 

” For integration-disintegration scores, art students 
were significantly more disintegrated spontaneously 


than teachers and schizophrenics combined (f 
< 0005). 


TABLE 2 


MEAN SPONTANEOUS Scorrs ror ART STUDENTS, 


TEACHERS, AND SCHIZOPHRENICS WITH £ TESTS 
OF THE MEANS COMPARING ART STUDENTS 
AGAINST TEACHERS AND SCHIZO- 
PHRENICS COMBINED 


gs F 
Test tadents Teachers jaa f 
WA 10.57 8.35 8.54 2158S 
os 3.40 3.17 1.96 1.85* 

* 05. 
=b $ i025. 
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TABLE 3 


MEAN UNREGULATED AND REGULATED SCORES FOR 
Art STUDENTS, TEACHERS, AND SCHIZOPHRENICS 
WITH £ TESTS OF THE ADJUSTED MEANS 
COMPARING ART STUDENTS AGAINST 
TEACHERS AND SCHIZOPHRENICS 


COMBINED 
at students | Teachers rear f 
Unregulated 
WA 21.63 15.54 13.58 | 5.16** 
* OS 5.92 4.65 3.65 | 3.93** 
Regulated * 
« WA 11.20 9.00 8.54 1.97* 
Os 2.02 1.92 1.83 04 


than teachers and schizophrenics. Under regu- 
lated instructions, art students are less con- 
` ventional on the WA test than teachers and 
_ schizophrenics (p < .05); but on the OS test, 
* the groups perform similarly under regulated 
instructions. Therefore, the group differences 
on shift scores are accounted for primarily 
by the art students moving further in an un- 
regulated direction than either teachers or 
schizophrenics, who again do not differ from 

One another. 

- The group means in Tables 2 and 3 show 
that for the WA test, instructions to become 
more regulated resulted in art students and 
teachers becoming slightly less conventional 
than they had been spontaneously, while 


* , schizophrenics did not shift at all from their 


Spontaneous level. The mean difference be- 
tween conditions for all subjects is not sig- 
nificant for the WA test. However, on the 
OS test, the instructions to become more 
‘regulated were effective (t = 3.73, p < .001, 
two-tailed). The difference between the effi- 
Cacy of the instructions for the two tests can 
be explained as follows. The WA instructions 
establish a set to respond quickly with the 
first word that comes to mind—usually a 
+ Synonym, antonym, or other conventional re- 
» SPonse—and the norms for the test correspond 
to this set. Instructions to role play would 
tend to interfere with this relatively automatic 
set by inducing a process of selection and 
Censorship, so that the response is not so apt 
to be a conventional word. The OS test, on 
the other hand, permits more freedom of re- 
Sponse. There is no time pressure and no 
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one correct answer, so that the subject has 
more opportunity to think about his responses 
and can be more original spontaneously and 
more conventional under regulated instruc- 
tions than on the WA test. The relatively 
automatic set to respond conventionally on 
the WA test may also account for the groups 
being more similar under the spontaneous 
condition of the WA than of the OS test. 

Shift scores on the WA test are positively 
correlated with those on the OS test over all 
subjects (+.56, p < .0005). For art students 
the correlation is .64 (p < 0005), for teach- 
ers +.54 (p < .005), and for schizophrenics 
+.30 (ns). These positive correlations, as 
well as the similarity in the patterns of re- 
sults from the two tests, provide some evi- 
dence that the ability to shift is a general 
cognitive approach. They also seem to indi- 
cate that this approach is a more general 
trait in normal than in schizophrenic subjects. 

Table 4, giving the frequencies of shift 
scores for the three groups, shows that there 
is a great deal of overlap among them. The 
overlap is more striking on the WA test, 
where 76% of all subjects have shift scores 
between 10 and 24, 


TABLE 4 


FREQUENCIES OF SHIFT SCORES FOR ART STUDENTS, 
TEACHERS, AND SCHIZOPHRENICS 


0-32 
Range 
students | Teachers iain 
WA 

35-39 

30-34 5 

25-29 6 2 2 
20-24 6 9 6 
15-19 8 6 6 
10-14 4 6 11 

5-9 1 3 

0-4 1 

pes ALY Gad 
0-9 
OS-conventional-original 

9 3 1 
7.5-8.5 6 5 3 

6-7 6 2 3 
4.5-5.5 7 6 4 

34 7 6 7 
1,5-2.5 1 4 6 

0-1 2 3 
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In sum, the prediction that art students are 
more able to shift from their spontaneous 
level of thinking than teachers or schizo- 
phrenics appears to be confirmed. But the 
prediction that teachers and schizophrenics 
would differ in their ability to shift is not 
confirmed. The groups do not differ in their 
ability to become more regulated; and clearly 
the shift in art students’ thinking is primarily 
toward the less regulated end of the con- 
tinuum. For every comparison made so far, 
except for spontaneous scores on the OS test, 
teachers and schizophrenics resemble one an- 
other, while the significant differences are 
between the art students and the other two 
groups. 


Further Qualitative Analysis of the OS Test 


In order to explore the question of how 
art students’ original responses differed from 
those of teachers and schizophrenics, a more 
comprehensive qualitative scoring system was 
developed for the OS test. This test was 
selected for further analysis because, since 
each response is longer than the single-word 
responses on the WA test, it is easier to score. 
Stemming from the concept of regression in 
the service of the ego, the general expectation 
was that artistic originality would involve 
loosening of both formal and content aspects 
of responses to a pathological degree, but also 
include adaptive elements indicating some 
control over and distance from such loose 
modes of thought. The protocols were studied 
with this orientation and a scoring manual 
worked out from these data. While the find- 
ings may reflect a particular sampling bias 
and hence are suggestive only, they seem suf- 
ficiently consistent and interesting to warrant 
presentation. P 

Every item on the test was judged by the 
experimenter and a trained clinical psycholo- 
gist for the presence or absence of five vari- 
ables: pathology, according to criteria worked 
out by Lovibond (1954); drive; humor; 
response tone; and unusual property, like 
sensuous, where objects are grouped together 
because of similarity in texture or weight, for 
example. Responses with one or more of these 
properties were classified as original, while 
responses with none of them were clas- 


sified as conventional; responses classified 
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as conventional met Rapaport’s criteria for 
conventional responses. x 

A measure of adaptive regression was ob- 
tained by taking the ratio of adaptive original 
responses to total number of original re- 
sponses. Adaptive original responses were: 
original responses without pathology and 
original responses with pathology, but also 
with an adaptive feature of humor, unusual 
property, or pleasant tone. 

The overall interscorer reliability for these 
ratio scores of adaptive regression is .75.° 
As expected, the original responses of art 
students given under the unregulated condi- 
tion contain significantly more adaptive re- 
sponses than do those of teachers and schizo- 
phrenics combined (p< .001). Of the 28 
subjects in the former group, 20 were above 
the general median; of the 38 subjects in the 
latter two groups, only 9 were above it. 

Several striking features emerge from the 
qualitative judgments. Under the unregulated 
condition, art students gave more responses 
containing pathology than the other two 
groups. Contrary to expectation, schizo- 
phrenics gave the fewest pathological re- 
sponses under all conditions, although the 
groups do not differ strikingly in the amount 
of pathology given under the spontaneous 
and regulated conditions. The same general 
pattern holds for the number of responses 
containing drive given by the three groups. 
Art students’ responses have more drive 
representations than those of the other two 
groups under the unregulated condition; but 
there are no striking group differences under 
the spontaneous and regulated conditions. 
Schizophrenics’ responses show the least drive 
under all conditions. The art students’ scores 
on pathology and drive are consistent with 

®The scores were pooled by classifying as adap- 
tive any response so judged by one or both raters. 
The ratio scores used in the statistical analysis 
consist of the percentage of each subject’s original 
responses which were judged to be adaptive. In order 
to reduce the instability of this measure, only sub- 
jects who gave three or more original responses 
under the unregulated condition were included, 
leaving 28 art students, 22 teachers, and 16 schizo- 
phrenics. A frequency distribution of the ratio scores 
was made over all subjects and a median test per- 
formed. Since teachers and schizophrenics again did 


not differ from one another, art students were 
compared against the two former groups combined, 


s» the findings presented above, which showed 
aE „that art students became far less regulated 
1 than the other groups under the unregulated 
= condition, On the other hand, their responses 
a1 | clearly include more adaptive features under 
i the unregulated condition than do those of 

j teachers and schizophrenics. 


Character Questionnaires 


Since individual chi-square comparisons of 
teachers and schizophrenics showed no, sig- 
, nificant differences, again art students were 
compared against teachers and schizophrenics 
- combined. 

"+ The results show that all groups tended to 
like and admire U and not to admire R. 
However, as predicted, teachers and schizo- 
phrenics liked R significantly more than art 
+ students ( < .025), and art students liked 
U significantly more than teachers and schizo- 
i. phrenics (p < .025). Moreover, the art stu- 
dents unanimously voted for U as the char- 
acter they would most like to be if they could 
choose, and differ significantly from the other 

two groups in this respect (p < .005). 


*, Rorschach Questionnaire 


+ Analyses of variance of the four questions 
show that the groups differ significantly on 
how much they enjoyed the test (p < .001), 
how easy they found it (p < .025), and how 
Confident they were of their responses (p < 
£ , 05). They did not differ significantly on 
+, degree of satisfaction with their responses. On 
i = all four questions, the mean of the art stu- 
T: dent was more positive in attitude than the 
Mean of either of the other groups. 


a 


. Ratings of Creativity of Art Students 
‘The raters’ rankings were received in dif- 
; t forms, and the four raters did not 
Know the same students, so that it was im- 
i le to obtain indices of interrater relia- 

| bility. However, in spite of the disorganized 
+e ‘condition of the ratings, the results are all 
= Consistent with the prediction that art stu- 
dents rated high on creativity would differ 


Fat 7 Since 8 schizophrenics had had Rorschach within 

. z prior to testing and 1 teacher did not 

$. follow instructions, the questionnaires of 18 schizo- 

< \renics, 25 teachers, and 30 art students were used 
ý in the statistical analysis, 


X 


> 
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TABLE 5 


MEAN SHIFT SCORES FOR THE Upper (Most CREATIVE) 
AND LOWER (LEAST CREATIVE) THIRDS OF THE 
ART STUDENT GROUP, EXCLUDING THREE 
Supyects oN WHom RATERS 
STRONGLY DISAGREE 


Test Uppers Lower* t 
WA 14.33 9.44 1.83* 
os 6.72 5.61 1.26 
aN =9, 
*p <05. 


from those rated low on their ability to shift. 

The correlations of ratings with shift scores 
on the WA and OS tests are in the predicted 
direction, but not significant (WA, + .28; 
OS, + .26). Table 5 presents the mean shift 
scores of the upper and lower thirds of the 
group, excluding three subjects on whom there 
was strong interrater disagreement. The re- 
sults for both tests are again in the predicted 
direction. The mean difference between the 
high and low creative groups on the WA test 
is significant ( < .05); but on the OS 
test, the difference is not significant (p< 
.20). Table 6 presents the mean shift scores 
for the four subjects at least two raters agree 
are high in creativity and for the six subjects 
at least two raters agree are low in creativity. 
It can be seen that this comparison increases 
the group differences on both tests. The mean 
difference is significant again for the WA 
test (p< .05) and approaches significance 
on the OS test (p < .10). 

There is a significant correlation of + .37 
(p < .05, two-tailed) between creativity and 
intelligence in the art group, so that intelli- 
gence seems to have influenced the raters. 
But there are no significant correlations be- 
tween shift scores and intelligence in any of 
the groups, although low, positive correlations 
tend to be present. This finding suggests that 


TABLE 6 


MEAN SHIFT Scores ror 10 SUBJECTS AT Least Two 
RATERS AGREE ARE HIGH or Low ON CREATIVITY 


Test Highs* Lows? t 
WA 19.00 10.83 2.41" 
os 7.62 | 5.75 1.60 
| | 
oN =4. 
bN =6. 
*p <.05. 
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shifting is related more to personality struc- 
ture than to intelligence. 


Discussion 


Spontaneously, there is some tendency for 
art students to be more original than school- 
teachers and schizophrenics, and they clearly 
shift their thinking in an unregulated direc- 
tion more than the other two groups. Con- 
sistently, they seem able to engage in unreg- 
ulated thinking with more pleasure and facil- 
ity. They like the unregulated character and 
dislike the regulated character more than the 
other two groups, and all 30 of them would 
prefer to be the former if they had a choice. 
They took the Rorschach with more enjoy- 
ment, ease, and assurance than did schizo- 
phrenics. Although the evidence from the 
qualitative analysis of original responses can 
be suggestive only, their original responses 
appear to differ from those of the other two 
groups, whose originality is apt to be more 
pathological, without the accompanying adap- 
tive features found in the art students’ un- 
regulated responses. 

These findings are all consistent with the 
concept of regression in the service of the 
ego, which postulates that creators possess 
the capacity of gaining easy access to id material 
without being overwhelmed by it, of retaining con- 
trol over the primary process, and, perhaps spe- 
cifically, the capability of making rapid or at least 
appropriately rapid shifts in levels of psychic 
function . . . [Kris, 1952, p. 25]. 


However, these results are also open to the 
interpretation that the art students differ 
from schoolteachers and schizophrenics be- 
cause of particular subgroup differences, not 
creativity. But in addition to the fact that 
the groups were selected to represent different 
levels of creativity, the results from the rat- 
ings suggest that within the art student sam- 
ple shifting is positively related to creativity. 
It would be of value in subsequent research 
to see whether this difference between more 
and less creative art students existed between 
more and less creative individuals in other 
specialty groups. 

Except for the Rorschach questionnaire, 
teachers and schizophrenics, contrary to pre- 
diction, were very similar. Sampling factors 
seem to provide the most likely explanation 
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for these results. As mentioned above, only 
patients in good contact were tested because 
the instructions were complicated. Therefore, 
this sample would be more apt to have re- 
tained ego functions like conventional modes 
of association and concept formation than 
other patients. In the previously cited study 
with the Object Sorting test, Lovibond (1954) 
found that 34% of the schizophrenic patients 
in his sample did not show signs of thought 
disorder by the criteria he set up, and the 
present sample may include a large propor- 
tion of such intact patients. Furthermore, 
inability to shift does not necessarily indicate 
pathology; and the teachers may have shown 
less ability to shift than the art students for 
different reasons from the schizophrenics. 
The three groups appeared to have very 
different general approaches to test taking. 
First, it was striking to see how involved 
many of the art students became with the 
tests. They seemed to work to their utmost, 
throwing themselves completely into the task, 
often with great gusto and enjoyment. For 
example, the energy they poured into the OS 
test seems reflected in the many “action” 
responses they gave under the unregulated 
condition. Responses were scored “action? 
when actual manipulation of the objects 
formed the basis of the grouping. For exam- 
ple, one art student balanced the ball on the 
bowl of the pipe, saying, “Bubble pipe!” 


Teachers rarely seemed to become as involved - 


or fascinated with the test stimuli; and schizo- 
phrenics seemed noticeably more remote and 
detached from the objects. s 
The art students also appeared to take the 
instructions as a real challenge and to set 
high standards for themselves in meeting 
their self-imposed goals. At least part of their 
tendency to give more original responses 
under the regulated condition of the WA 
test seemed due to their trying to give words 
that would be appropriate for their interpre- 
tation of the regulated person, rather than 
giving an easier, more conventional response. 
For example, one subject responded “Arthur 
Murray” to “dance” and “Tray” to “dog.” A 
third impression of the art students’ approach 
was the sense of assurance they appeared to 
have in their own internal standards, often 
giving the most arbitrary responses with an 


i 
l 
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is air of absolute certainty. In contrast, the 
\__.performance of teachers and schizophrenics 
mi seemed characterized by doubt and uncer- 
.` tainty under the unregulated condition. 
. ° Another impression given by many of the 
> art students was that they were very inter- 
ested in and intrigued by their own responses 
< and inner life and enjoyed discovering some- 
thing new about themselves. This attitude 
J seemed in striking contrast to that of schizo- 
=“ phrenics and, to a lesser extent, that of teach- 
. . ers. The latter two groups appeared to be cau- 
' tious and fearful about self-revelation and in- 
trospection, while art students appeared to 
"be much more exhibitionistic about flam- 
boyantly displaying their loose thinking. 
There were many exceptions to these im- 
pressions, but such factors appear to be rele- 
« vant to creativity. The capacities for complete 
involvement in the task at hand, for setting 
^ up high, individual, even arbitrary internal 
standards, which are independent of external 
ones, for looking within oneself with curi- 
osity and enjoyment, and for self-display in 
personal productions seem like important 
facets of the creative person. Granting that 
_ the ability to shift from more to less regu- 
lated thinking is an aspect of creativity, it is 
only one component among many and should 
be considered in the total personality config- 
uration of the creative individual. The char- 
_ acteristics mentioned above would seem to 
) ı provide a favorable context for engaging in 
“4 adaptive unregulated thought. 


> 
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DREAM REPORTING FOLLOWING ABRUPT AND 
GRADUAL AWAKENINGS FROM DIFFERENT 
TYPES OF SLEEP’ 
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LEROY JARET, ann IRVING SLESER? 
State University of New York, Downstate Medical Center 


The study was designed to identify conditions under which dream reports are 


likely to be given when Ss are awaken 


ed from different types of sleep. As has 


been found in previous studies, awakenings from periods of Stage-1 sleep with 


rapid eye movements (REM) 


led to reports of cognitive experiences more 


often than awakenings from periods of Stage-2, -3, or -4 sleep without rapid 


eye movements (non-REM), 
labeled as thoughts rather than 


and the non-REM reports were frequently 
dreams by the Ss. Thinking reports occurred 


more frequently when the awakenings were gradual than when the awakenings 
were abrupt. Reports of cognitive activity decreased in frequency as a function 
of time in Stage-2, -3, or -4 non-REM sleep after the end of a REM period. 


The same decrease was obtained 


when the Ss entered non-REM sleep from a 


waking state, without an intervening REM period. 


As a person falls asleep, there are consistent 
changes in EEG activity which have been 
classified into four sleep stages (numbered 1 
through 4 in the order of their occurrence) 
by Dement and Kleitman (1957a). During 
the course of sleep, a person returns (or 
emerges) to a Stage-1 sleep for relatively pro- 
longed periods at intervals of about 90 min- 
utes. These emergent Stage-1 sleep periods 
differ from periods of Stage-2, -3 , or -4 sleep 
in a number of physiological and psycholog- 
ical functions. 

Several of these differences are important 
for the present study. 

1. Eye-movement activity: Emergent Stage- 
1 sleep periods are accompanied by rapid, 
conjugate eye movements which do not occur 
during Stage-2, -3, or -4 sleep (Aserinsky & 
Kleitman, 1953; Dement & Kleitman, 1957a). 
These rapid eye movements (REMs) are so 
characteristic that Stage-1 sleep with rapid 
eye movements has been called REM-period 
sleep. 

2, Frequency of reports of preawakening 
cognitive activity: When awakened from 


~REM-period sleep, subjects report that they 


were experiencing something before the awak- 


1This study was supported by the National Insti- 
tutes of Health, United States Public Health Service, 
Grants MH-03885, MH-05518, and MH-K3-16,619. 

? Now at the Research Center for Mental Health, 
New York University. 


170 


ening and can recall the content of their ex- 
perience in about 80% of the awakenings. 
Awakenings from Stage-2, -3, or -4 sleep yield 
such reports less frequently (Dement & Kleit- 
man, 1957b; Foulkes, 1962; Goodenough, 
Shapiro, Holden, & Steinschriber, 1959; Or- 
linsky, 1962; Rechtschaffen, Verdone, & 
Wheaton, 1963). 

3. Dreamlike versus thoughtlike reports: 
Thoughtlike reports are more often obtained 
from Stage 2, 3, or 4 than from REM pe- 
tiods (Foulkes, 1962; Rechtschaffen et al., 
1963). 

4. Subjective judgments of sleep depth: 
Subjects judge that they were in a deeper 
sleep during REM periods than during 
Stage 2, 3, or 4 (Rechtschaffen et al., 1963). 

The emergent Stage-1 REM periods also 
differ from nonemergent Stage-1 periods which 
occur as the subject falls asleep. During these 
nonemergent Stage-1 periods, rapid eye move- 
ments usually do not occur (Dement & Kleit- 
man, 1957b), arousal thresholds are lower 
(Dement & Kleitman, 1957a), and reports 
are often thoughtlike or hypnagogic (Dement 
& Kleitman, 1957a; Foulkes, 1962). 

Dement and Kleitman (1957a, 1957b) 
have proposed that the rapid eye movements 
may reflect scanning activity in the visual 
imagery of a dream, and that REM sleep 
may be used as an objective, operational defi- 
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nition of the dream state. The proposal that 
dreams occur during REM periods has re- 
ceived support from studies on the relation- 


` „ships between eye-movement activity and 


‘characteristics of subsequent dream reports 


` (Berger & Oswald, 1962; Dement & Wol- 


pert, 1958; Goodenough, Lewis, Shapiro, & 
Sleser, 1965; Kamiya, 1962*; Roffwarg, 
Dement, Muzio, & Fisher, 1962). Further 


_ Support has come from studies on the rela- 
` tionships between measures of the duration 
,of the REM period preceding an awakening, 


subjective estimates of the duration of the 


. dream, and word counts of the dream narra- 


tive (Dement & Kleitman, 1957b; Dement 


_ & Wolpert, 1958). 


The proposal that dreams or hypnagogic 
phenomena occur only during Stage-1 sleep 
has been more controversial. In their early 
work, Dement and Kleitman (1957b) found 


*- dream recall in only 7% of awakenings from 


Stage-2, -3, and -4 sleep, but in recent studies, 
much higher percentages have been obtained 
(Foulkes, 1962; Goodenough et al., 1959; 


' Orlinsky, 1962; Rechtschaffen et al., 1963). 


These discrepancies can probably be ac- 


» counted for on the basis of procedural dif- 


ferences among the various studies, but what- 
ever the reasons, the large number of Stage-2, 
-3, and -4 reports that may be elicited under 
Some conditions raises a question about the 
usefulness of the assumption that dreams oc- 


: , Cur only during Stage-1 sleep. In approaching 


this question, it seems critical to understand 
more fully the specific conditions under which 
Teports may be obtained from Stages 2, 3, and 
4. 


If we assume that a dream occurs only 
during Stage-1 sleep, there appear to be at 
least two possible ways of accounting for the 


. Stage-2, -3, and -4 reports. 
s+ One possibility is that they represent 


arousal artifacts. This possibility has been 


‘discussed by Foulkes (1962) and by Roff- 
: Warg et al. (1962). In the study by Roff- 


Warg et al., a case is reported in which Stage-1 
sleep, accompanied by rapid, conjugate eye 
Movements, was observed during an arousal 

* “Behavioral and Physiological Concomitants of 
Dreaming,” progress report to the United States 


Public Health Service, Research Grants M-2116 and 
M-5069, February 1962, 
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from non-REM sleep. Incidents of this sort 
have occasionally been observed in our lab- 
oratory as well. Because these examples are 
infrequent, it is not clear whether the awak- 
ening stimulus began, coincidentally, just 
before a spontaneous change from non-REM 
to REM-period sleep, or whether the stimulus 
actually induced a brief REM period before 
the subject was fully awakened. There are 
many occasions, however, in which an awak- 
ening from Stage-2, -3, or -4 sleep is gradual 
and the arousal is acompanied by a prolonged 
Stage-1 EEG record without rapid, conju- 
gate eye movements. If reports of arousal ex- 
perience do occur, then they might be ex- 
pected with greatest frequency in these cases. 
They might also be described as thoughtlike 
in nature. 

In a study of the correlation between the 
duration of Stage-2, -3, and -4 awakenings 
and dream reporting, Foulkes failed to ob- 
serve any tendency for reports with cognitive 
content to occur more frequently when the 
awakening was prolonged. However, in a 
series of studies of REM-period awakenings 
in which the abruptness of awakening was 
experimentally varied (Shapiro, Goodenough, 
& Gryler, 1963; Shapiro, Goodenough, Lewis, 
& Sleser, 1965), evidence was found that the 
subject may report thoughtlike experiences 
more often from gradual awakenings. 

A second possibility to account for reports 
from Stage-2, -3, and -4 sleep is that the 
subject may be recalling experiences which 
occurred earlier at night during REM pe- 
riods, or during Stage-1 sleep periods which 
occurred as the subject fell asleep. This possi- 
bility suggests a number of hypotheses. 

1. Content of preceding experience: If 
emergent Stage-l1 REM periods represent 
dreams, and nonemergent Stage-1 sleep pe- 
tiods without rapid eye movements more 
often represent hypnagogic experiences, then 
one might also expect a difference in dream 
reporting between Stage-2, -3, or -4 awaken- 
ings: when the previous Stage 1 was an un- . 
interrupted REM period; and when the 
previous Stage 1 was a nonemergent period 
without rapid eye movements. If the Stage-2, 
-3, or -4 awakening (non-REM awakening) 
followed a previously uninterrupted emergent 
Stage-1 REM period, during which a dream 
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presumably occurred, then the content of the 
non-REM awakening should be dreamlike 
since it involves the recollection of the pre- 
vious uninterrupted REM-period dream. If, 
however, the Stage-2, -3, or -4 awakening 
followed a nonemergent Stage-1 period, with- 
out rapid eye movements, and presumably in- 
volving a hypnagogic experience, not a dream, 
then the content of the non-REM awakening 
would be more thoughtlike since it involves 
the recollection of a hypnagogic experience 
rather than a dream. 

2. Interval between previous Stage-1 period 
and awakening: If a memory phenomenon is 
involved, one might also expect that the per- 
centage of awakenings which yield dream re- 
ports would decline as a function of the time 
interval between the end of a Stage-1 sleep 
period and the Stage-2, -3, or -4 awakening. In 
fact, there are data (Dement & Kleitman, 
1957b; Kamiya, 1962 *; Wolpert & Trosman, 
1958) indicating a much greater chance of ob- 
taining a dream report if the subject is awak- 
ened a few minutes after the end of a REM 
period than at a later time during Stage-2, 
-3, or -4 sleep. 

The present study was designed to investi- 
gate the effects of the abruptness of awaken- 
ing on dream reporting during Stage-2, -3, and 
-4 sleep; the relationship between dream re- 
porting and the duration of the interval be- 
tween the end of Stage-1 sleep and the Stage- 
2, -3, or -4 awakening; and the difference in 
dream reporting from Stage-2, -3, and -4 
awakenings preceded by different kinds of 
Stage-1 sleep. In addition, the design of the 
study made it possible to examine the rela- 
tionships among EEG, behavioral, and sub- 
jective measures of sleep depth. 


METHOD 


The subjects were 10 male college or medical 
students, selected from a larger pool of volunteers 
to work as paid subjects in sleep-and-dream studies, 
The equipment and general procedures used in the 
study have been described in detail elsewhere (Good- 
enough et al, 1959; Shapiro et al., 1963). Briefly, 
each subject slept in the laboratory during his nor- 
mal sleep period for a number of nights (not con- 
secutive). Monopolar frontal and occipital or bipolar 
fronto-occipital EEGs were continuously recorded. 
Electro-oculograms (EOGs), which made possible 
the detection of horizontal, rapid, conjugate eye 
movements, were also continuously recorded. 
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Each subject was told that a bell would ring 
several times during the night. He was instructed 
to respond to the bell by picking up a bedside tele- 
phone (which automatically turned off the stimulus), 
to describe what was “going through his mind” in 
as much detail as possible, to label his experience as | 
a thought or a dream, to judge whether he was . 
awake or asleep, and, if asleep, to give a subjective 
estimate of his sleep depth as light, medium, or 
deep. The subject was asked to give his reports 
immediately after picking up the phone and before 
the experimenter spoke to him, in order to minimize 
any possible experimenter bias. After nf subject ` 
gave his report, the experimenter conducted a brief 


inquiry to clarify any ambiguity in the spontaneous, ° 


report. The complex response required to turn off 
the bell and the regular inquiry conducted after 3 
each report insured that the subject was fully 
aroused during each awakening. 

The subjects were sometimes awakened abruptly 
and sometimes gradually. In the abrupt awakening 
procedure, the awakening stimulus was presented at 
approximately 80 decibels (re .0002 dynes/cm*) | 
until the subject automatically turned it off by pick- 
ing up the phone. The reaction time between stimu- 
lus onset and the telephone response was recorded. 
In the gradual awakening procedure, the stimulus 
was first presented at an intensity far below the 
arousal threshold (approximately © decibels) and 
increased in 4-decibel steps until the subject turned 
it off by answering the telephone. At each step the 
stimulus was on. for approximately 10 seconds, and 
there were approximately 5 seconds between steps. 
The intensity of the stimulus required to elicit the 
telephone response (arousal threshold) was recorded. 

The design of the study involved awakening each 
subject on 36 occasions including 12 Stage-1 REM- 
period awakenings, 16 awakenings from Stage-2 
non-REM periods, and 8 awakenings from Stage-3 
or -4 non-REM periods. For the Stage-2, -3, and -4 


awakenings, half of each type followed uninter- , “ 


rupted REM periods and half followed experi- 
mentally interrupted REM periods. The “inter- 
rupted” awakenings were not included if the subject 
had rapid eye movements during the nonemergent 
Stage-1 period that occurred as he returned to 
sleep. For the Stage-2 non-REM awakenings, half 
of each type were taken within 10 minutes after 
(mean time approximately 5 minutes), and half 
more than 20 minutes after the first spindle (mean 
time approximately 30 minutes), signaling the return 
to Stage-2 non-REM sleep after the experimental 
awakening or uninterrupted REM period. The Stage- 
3 and -4 non-REM awakenings all occurred after 
at least 20 minutes of Stage-2, -3, or -4 sleep.* In 
addition, half of each type of awakening were abrupt 
and half were gradual. 


4It was originally our intent to obtain eight 
Stage-3 and -4 awakenings within 10 minutes of the 
return to non-REM sleep, but this plan was aban- 
doned because the first appearance of delta did not 
often occur within 10 minutes of the onset of non- 
REM sleep. 
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TABLE 1 
DESIGNED DISTRIBUTION OF AWAKENINGS FOR EACH SUBJECT 


From non-REM periods 


3 Stage 2 Stage 3 or 4 
"Method of | From REM 7 
f iods Following Following 
E periods Following uninterrupted Following interrupted | uninterrupted | interrupted 
k REM periods by REM periods by REM REM 
3 a periods by periods by 
ii y 30 Sf 30’ 30° 30’ 
| Abrupt 6 2 2 2 2 2 2 
” Gradual 6 2 2 2 2 2 2 


. 

Table 1 summarizes the number of awakenings of 
each type on each subject. 
» In general, each subject was studied for the num- 
* ber of nights required to complete the set of awak- 
nings, In a few cases, when the subjects did not 

wish to continue, the study was discontinued before 
; the required number of 36 awakenings was com- 
* pleted. In a few other cases, more than the planned 
number of awakenings were collected. All subjects 
Were awakened at least once under each condition, 
however. When there were more or less than the 
Planned number of awakenings of a given type, the 
Scores were adjusted to prevent confounding in the 
data analysis. 

In all, there were 357 awakenings conducted on 
98 nights of study. All of the subjects’ reports were 
tape recorded and the tapes were transcribed for 
Scoring by a judge who had not seen the subject and 
Who had no knowledge of the awakening condition. 
All Teports were categorized according to a system 
developed in our earlier studies of the effects of 


P; 


abruptness of awakening from REM-period sleep. 
i , The reports were first divided into a group in which 
4 the subject believed that he had an experience which 
i Occurred before the bell rang and could describe some 


, Content, and a group in which no content was re- 
1 ported. The content reports were subdivided into 
‘those in which the subject labeled his experience a 
‘dream, and those in which he labeled it a thought. 
2 No-content reports were subdivided into those 
in Which the subject said that he was asleep, and 
those in which he said he was awake. When the 
_ ‘Subject expressed any doubt, the report was classed 
` as thinking or awake. The subjects’ subjective rat- 
tgs of sleep depth were scored from 1 (awake) to 

_ 4 (deeply asleep). 
' Word counts were also conducted on the reports 
i With content in order to measure the length of the 
narratives. In addition, the narratives were sub- 
jected to content analyses designed to evaluate the 
: ike versus thoughtlike quality of the report. 
‘The content analyses were conducted by a “blind,” 
independent judge’ who was given transcripts in 

Tra 


, _°We are indebted to Elaine Karp for serving in 
Capacity. 


which the subject’s comments concerning depth of 
sleep, etc., were censored. The five content scales 
were as follows: 

Bizarreness—a 3-point scale for the most bizarre 
content element in each narrative ranging from 0 
(ordinary elements as in day fantasies) to 4 (un- 
likely elements in unusual combinations) to 1 (fan- 
tastic or distorted elements). 

Activity—a 3-point scale for assessing the level of 
physical action or activity by the characters in the 
narrative ranging from O (a low level of action) to 
+ (intermediate activity) to 1 (a high level of 
physical activity). 

Self-representation—a 2-point scale evaluating the 
presence (scored 1) or absence (scored 0) of the 
dreamer as a protagonist of the action. 

Visual imagery—a 2-point scale evaluating the 
presence (scored 1) or absence (scored 0) of ref- 
erences to a visual experience. 

Overt reference to laboratory—a 2-point scale eval- 
uating the presence (scored 1) or absence (scored 0) 
of explicit inclusion of some element of the experi- 
mental situation, 

Wilcoxon’s matched-pairs signed-ranks test was 
used as a test of significance. 


RESULTS 


Dream Reporting from REM and Non-REM 
Sleep 


Data comparing REM and Stage-2, -3, and 
-4 sleep are summarized in Table 2. These 
data confirm the findings from recent studies. 
There were significantly more reports of cog- 
nitive experiences given from REM-period 
awakenings than from Stage-2, -3, or -4 
awakenings, and the REM-period reports 
were usually longer. However, there were 
many Stage-2, -3, and -4 reports in which the 
subjects were able to recall some content. 
Over half of all Stage-2, -3, and -4 reports 
with content were labeled as thoughts by the 


174 


TABLE 2 
DREAM REPORTING FROM REM anp Non-REM SLEEP 
Measures of report 
characteristics from 
Report characteristic 
REM Non-REM 
(Stage 1) | (Stage 2, 3, 4) 

Content 

Total (%) 84** 45** 

Dreaming (%) 76** 21** 

Thinking (%) 08* 24* 
No content 

Asleep (%) EZA 36** 

Awake (%) 04** 108%. 
Word count (M) 1153s 34** 
Bizarreness (M) pokes Alt. 
Activity (M) AG 23s 
Self-representation (M) note A0** 
Visual imagery (M) .64** PY bea 
Laboratory reference (M) | .16 26 


* REM-non-REM difference significant, p < .05. 

** REM-non-REM difference significant, p < 01. 
subjects, and content analyses of these reports 
clearly support the hypothesis that the nar- 
ratives had many thoughtlike features. 


Dream Reporting from Different Kinds of 
Non-REM Sleep : 


Reports from different kinds of Stage-2, -3, 
and -4 non-REM sleep are compared in Ta- 
bles 3 and 4. 

It was expected that the content of the 
Stage-2, -3, and -4 reports would be labeled 
by the subjects as dreams more frequently 
after uninterrupted REM periods, when dream 
experiences presumably occur. We also ex- 
pected that the content would be labeled as 
thoughts more frequently when the preceding 
REM periods had been experimentally inter- 
rupted, and a Stage-1 period without rapid 


TABLE 3 


DREAM REPORTING FROM STAGE-2, -3, AND -4 Non- 
REM SLEEP FOLLOWING INTERRUPTED AND 
UNINTERRUPTED REM PERIODS 


% of reports following 
Recall category 
Interrupted | Uninterrupted 
REM periods | REM periods 
Content 
Total 49 42 
Dreaming 23 20 
Thinking 26 22 
No content 
Asleep 33 39 
Awake 18 19 
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eye movements (presumably hypnagogic in 
nature) had occurred immediately prior to 
the onset of the Stage-2 period as the subject 
returned to sleep. 


These expectations were not confirmed’ 
(Table 3). For all Stage-2, -3, and -4 periods, « 


slightly more thinking reports were given from 
awakenings which followed interrupted REM 
periods (26%) than from awakenings which 
followed uninterrupted REM periods (22%), 


but slightly more dreaming reports were also’ 


obtained following the interrupted periods — 


(23%) than the uninterrupted periods 
(20%), and none of these differences was sig- 
nificant. č 


TABLE 4 


DREAM REPORTING FROM STAGE-2 Non-REM SLEEP 
AS A FUNCTION OF ELAPSED TIME FROM 
PRECEDING STAGE-1 PERIOD 


% of all reports following 
Stage-1 periods by 
a E e ai ze 
5! 30! 
Content 
Total 56 33 
Dreaming 28 20 
Thinking 28 13 
No content 
Asleep 25 46 
Awake 19 21 


For all Stage-2 periods, many more reports 
with content were obtained from the awaken- 
ings 5 minutes after the onset of Stage-2 sleep 
(56%) than from the awakenings 30 minutes 
after the onset of Stage-2 sleep (33%) (Ta- 


ble 4). This difference was significant (p < | 


05), confirming the findings of Dement and 
Kleitman (1957b), Kamiya,’ and Wolpert 
and Trosman (1958), 


No significant differences were found 


among the types of Stage-2, -3, and -4 awak- 


enings in the data based on word counts or 
content analyses, 


Dream Reporting as a Function of the 
Method of Awakening 


The data on dream reporting as a function 
of the method of awakening are summariz 
in Table 5. The data bear on a number of 
specific hypotheses, based on the assumption 
that some reports of cognitive content from 
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TABLE 5 


DREAM REPORTING AS A FUNCTION OF THE METHOD OF AWAKENING FROM REM ann Non-REM SLEEP 


% of reports from 
: Recall category (oy RE All stages 
i= 
3 Abrupt Gradual Abrupt Gradual Abrupt Gradual 
5 Content 
k Total 86 83 42 49 64 66 
g Dreaming 81 70 22 21 52 46 
Thinking 05 13 + 20 28 12 20 
ve No content 
Asleep 09 14 40 32 24 23 
Awake 0S 03 18 19 12 11 


=, 4 
_ „7 non-REM awakenings may be descriptions of 
» hypnopompic experiences, and these hypoth- 
eses were generally confirmed. 
When the data from all stages of sleep are 
; Pooled, the only significant difference between 
abrupt and gradual awakenings is in the 
tendency to give thinking reports. As we have 
found in two previous studies (Shapiro et al., 
, 1963, 1965), thinking reports are given more 
frequently from gradual awakenings than 
/ from abrupt awakenings (p< .01). Con- 
‘sidering the REM-period awakenings only, 
_ this difference is accompanied by a tendency 
for dreaming reports to occur more fre- 
quently from abrupt awakenings than from 
gradual awakenings (p< .05). This result 
_ also confirms the findings from our previous 
Studies. For Stage-2, -3, and -4 awakenings, 
4 however, abrupt awakenings do not yield 
, More dream reports. For non-REM awak- 
“enings, the increase in thinking reports 
with gradual awakenings is apparently ac- 
companied by a decrease in no-content asleep 
reports. There was a significant interaction 
(Ż < .05) between the REM/Stage-2, -3, and 
4 and the abrupt/gradual variables in the 
| tendency to give no-content asleep reports, 
„and in the percentage of reports with some 
^ Content, 
Perhaps these findings can best be sum- 
. Marized by saying that the tendency for 
REM-period awakenings to yield dream re- 
„Ports and for Stage-2, -3, and -4 non-REM 
awakenings to yield no-content reports is 
most marked when the awakenings are abrupt. 
en the awakening is gradual, the prevail- 


ing type of report tends to decrease in fre- 
quency and is replaced by thinking reports. 

This view receives additional support from 
an analysis of the relationship between reac- 
tion times on the abrupt awakenings and the 
tendency to give thinking reports. For the 
30-minute Stage-2, -3, and -4 abrupt awaken- 
ings, thinking reports never occurred when 
the subject was able to reach the phone within 
15 seconds of the start of the bell. The mean 
reaction time for all 30-minute Stage-2, -3, 
and -4 abrupt awakenings was 17 seconds. In 
contrast, the mean reaction time was 38 sec- 
onds for the awakenings which led to a think- 
ing report, and these reaction times were in- 
variably among the longest recorded for the 
subjects. 

These findings suggest that it is possible to 
account for the thinking reports which occur 
more than 10 minutes after the end of a 
Stage-1 period by assuming that they are 
reports of hypnopompic experiences. It does 
not seem possible to account for the Stage-2, 
-3, and -4 dreaming reports in this way, 
however. Not only did they occur as often 
with abrupt awakenings as with gradual 
awakenings, but they came from abrupt 
awakenings with relatively short reaction 
times (mean reaction time for 30-minute 
awakenings leading to a dream report was 11 
seconds). Nor is it possible to account in this 
way for all of the thinking reports from 
awakenings which followed a Stage-1 period 
by less than 10 minutes. Many of these 
awakenings had relatively short reaction 


times. 
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Relationships among Subjective Ratings of 
Sleep Depth, Reaction Times, and Arousal 
Thresholds 


The subject’s judgment of his own depth 
of sleep given at the time of awakening is 
obviously related to whether he labels his 
experience a dream or a thought. If he 
judges that he was deeply asleep, it seems 
likely that he will regard the content of 
what was in his mind as a dream. If he 
feels that he was awake, it seems more likely 
that he will report that he was “thinking.” It 
is of interest, therefore, to study the occa- 
sions on which the subject tells us that he was 
“awake,” but other indices of sleep depth in- 
dicate that he was in fact asleep. These oc- 
currences indicate that a perfect correspond- 
ence by no means exists between subjective 
judgments of sleep depth and behavioral or 
EEG measures. 

Table 6 shows that when a subject said he 
was “awake,” his threshold and reaction 
times are among the highest recorded for him. 
“Awake” subjective judgments thus come 
from the longest arousal periods. By any 
criterion, these awake reports are obviously 
errors in judgment and apparently reflect 
what may be the hypnopompic phenomenon 
previously used to interpret the relationship 
between reaction times and thinking reports. 
This view is also supported by the finding 
that the subjects judged that they were deeper 
asleep when the awakening was abrupt than 
when the awakening was gradual. For Stage- 
2, -3, and -4 periods, the mean rating was 
2.4 for abrupt awakenings and 2.0 for grad- 
ual awakenings (p < .05). For REM periods, 
the difference was in the same direction but 
was not significant. 


TABLE 6 


RELATIONSHIPS BETWEEN SUBJECTIVE RATINGS AND 
BEHAVIORAL MEASURES OF SLEEP DEPTH 


Sleep-depth rating 
——m 
Deep | Medium |.Light | Awake 


Behavioral measure 


Mean reaction time 

(seconds) from abrupt 

awakenings 14 9 6 14 
Mean arousal threshold 

(decibel) from gradual 

awakenings 52 41 36 49 
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TABLE 7 


BEHAVIORAL MEASURES AND SUBJECTIVE RATINGS oF | 
Steep Dert in REM anc Non-REM Strep = 


REM |Non-REM 
Index of sleep depth (Stage 1) | (tage, 
Mean reaction time (seconds) d 
from abrupt awakenings hs 13* 
Mean arousal threshold (decibel) j 
from gradual awakenings 39 43 
Mean subjective rating from 1 
(awake) to 4 (deep sleep) 2.9** UE he 
% of reports in which the sub- j 
> ject believed he was awake 10** 38** 
Mean subjective rating without À 
awake reports 3A 3.0 


d 
q 
=b So 
When a subject judged that he was asleep, 
his subjective judgment coincided much more“ 
accurately with the behavioral indicators. | 
The differences between Stage-1 REM and 
Stage-2, -3, and -4 non-REM sleep are sum- 
marized in Table 7. Reaction times were sig- 
nificantly shorter in REM periods than in the 
other sleep stages. Arousal thresholds were 
also lower in REM periods, but this difference » 
was not significant. However, subjective rat- 
ings of sleep depth showed a difference in the’ 
opposite direction. REM sleep was judged to 
be deeper sleep by these subjects, thus con- | 
firming Rechtschaffen’s finding. Since the 
awake judgments apparently represent a qual- 
itatively different phenomenon than the other | 
judgments of sleep depth, these ratings were | 
further analyzed by discarding the awake | 
judgments and recomputing the mean ratings 4 
of sleep depth. These data are also presented | 
in Table 7. When the subjects believed that 
they were asleep, the mean ratings of depth of 
sleep were about the same for REM and 
Stages 2, 3, and 4. However, the subjects | 
believed that they were awake significantly 
more often during Stage-2, -3, and -4 sleep. { 
Tn 387% of all Stage-2, -3, and -4 awakenings, f 
the subjects mistakenly believed that they 
had been awake before the bell rang. if 
The data on sleep depth for different types. | 
of non-REM sleep are summarized in Table 8: 
Some interesting differences are apparent 1 
the table. Considering all the data from 
awakenings which occurred 30 minutes after | 
a Stage-1 period, there were significant dif í 
ferences between Stage-2, -3, and -4 sleep i 1 
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both reaction times (< .05) and arousal 


thresholds (p < .05). In both cases, Stage-2 


“sleep was lighter. However, the subjective 


„ratings were not different for these stages. 
Among Stage-2 awakenings, the reaction 


, times were clearly shorter (p < .01) when 


the awakening followed a Stage-1 period by 5 
minutes than when the awakening followed a 
Stage-1 period by 30 minutes. However, it 
made no difference whether the awakening 
followed an interrupted or an uninterrupted 
REM period. For the arousal thresholds, there 


` ewas no difference between 5-minute and 30- 


minute awakenings, but the awakenings 
which followed interrupted REM periods had 
significantly lower thresholds than the awak- 
enings which followed uninterrupted REM 
periods (p < .01). The reasons for this dis- 
„crepancy in results between the two measures 
Of reaction to an awakening stimulus are not 


:, Clear at this time. 


The subjective judgments tended to be of 
deeper sleep for the 30-minute awakenings 
than for the 5-minute awakenings, and deeper 
for the awakenings which followed uninter- 
tupted REM periods than for interrupted 


_ REM periods, as might be expected. Only 


the latter difference was significant, however 
($ < .05). 

DISCUSSION 
Dream Reporting 


The discovery of periods of rapid, conjugate 
eye movements during sleep, and the proposal 
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that the REM periods may be used to deter- 
mine when dreams occur, have had a remark- 
ably stimulating effect on experimental dream 
research. The proposal that dreams occur 
during REM periods is not likely to be dis- 
carded as a working assumption while it con- 
tinues to play such a useful role. However, 
there is a growing body of data that is be- 
ginning to pinpoint some problems for cur- 
rent theories of the dream process. 

It is clear that many reports of cognitive 
activity may be obtained when subjects are 
awakened from Stage-2, -3, or -4 sleep, and 
these reports require some explanation. Any 
adequate theory of the occurrence of the 
dream process should meet the minimum re- 
quirements that it be useful for predicting 
when dream reports will be given, and some- 
thing about the formal characteristics of these 
reports. None of the theories thus far pro- 
posed can meet these requirements. 

The assumption that some of the Stage-2, 
-3, or -4 reports are descriptions of hypno- 
pompic arousal experiences receives some sup- 
port from the findings of this study. It seems 
reasonable to propose that arousal experiences 
would be labeled as thoughts by the subjects, 
that these experiences would be reported by 
the subjects when the recall of a vivid dream 
is not possible, and when the awakening is 
gradual rather than abrupt. In line with these 
expectations, it has been found that thinking 
reports occur more frequently from Stage-2, 
-3, or -4 sleep than from REM-period sleep, 


$ TABLE 8 
INDICES OF SLEEP DEPTH DURING Non-REM SLEEP 


From Stage 2 


From Stage 3 or 4 


À Following Following 
it inte ted Following interrupted 
Index of sleep depth Oy une an REM periods by | RANE by | REM periods by 
8 5 30 CA 30’ 30’ 30’ 
` Eo sie time 
Seconds) from abrupt 
awakenings p 4 9 5 10 29 25 
Mein oe threshold 
‘ecibel) from gradual 
awakenings 44 45 29 27 65 46 
ie ete rating 
‘om 1 (awake) to 4 
(deep sleep) ) 21 26 1.9 2.0 24 2A 
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more frequently among subjects who claim 
that they rarely recall dreams at home than 
among subjects who claim that they are 
“good” dream recallers (Lewis, Goodenough, 
Shapiro, & Sleser, in press); and more 
frequently when the awakening is gradual 
rather than abrupt. However, it is not easy 
to account for the Stage-2, -3, or -4 dream 
reports, or even for all of the thinking reports 
in this way. In the data from this study, there 
were a substantial number of thinking reports 
as well as dreaming reports given from abrupt 
awakenings shortly after the subjects entered 
Stage-2, -3, or -4 sleep. 

Tf the Stage-2, -3, and -4 reports repre- 
sent descriptions of experiences which oc- 
curred during earlier Stage-1 sleep periods, it 
should be possible to predict something about 
the nature of the reports from a knowledge of 
the characteristics of the preceding Stage-1 
periods. In searching for characteristics of 
Stage-1 periods that might be useful in pre- 
dictions of this sort, it seemed to us that the 
distinction between nonemergent non-REM 
periods and emergent REM periods was the 
most dramatic difference among Stage-1 pe- 
riods that might be found, and offered the 
greatest hope for predictive success. How- 
ever, what seemed like a reasonable predic- 
tion, that the experiences reported would be 
more dreamlike if the preceding Stage-1 
period was an emergent REM period, and 
would be more thoughtlike if the preceding 
Stage-1 period was a nonemergent non-REM 
period, was not confirmed. A substantial 
number of dream reports were given by the 
subject from Stage 2, 3, or 4 shortly after 
falling asleep and with no intervening REM- 
period sleep, 

If these reports represent descriptions of 
experiences which occurred during the pre- 
ceding Stage-1 sleep period, then the data 
suggest that cognitive activity may be just 
as dreamlike in nonemergent non-REM pe- 
tiods when the subject falls asleep as it ap- 
parently is during emergent REM periods. 
This conclusion is inconsistent with published 
data on the nature of Teports obtained from 
nonemergent Stage-1 periods (Dement & 
Kleitman, 1957a; Foulkes, 1962), and is not 
easily reconciled with the view that rapid, 
conjugate eye movements during Stage-1 
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sleep invariably accompany visual scanning 
activity during dreams. i 

The obvious alternative is to conclude that 
dreams may occur during Stage-2, -3, or -4 | 
sleep. This conclusion is one that has been ' 
favored by an increasing number of research- _ 
ers (Foulkes, 1962; Kamiya, 1961; Recht- | 
schaffen et al., 1963), but it is far from a A 
specific assumption about the dream process 
that might prove useful in generating predic- © 
tions about when Stage-2, -3, and -4 reports 
occur. It suggests no explanation for the de- A 
crease in the tendency to give reports as 4 
function of the time after onset of Stage-2, 
-3, or -4 sleep. Nor is it immediately helpful 
in understanding why these reports so often 
are thoughtlike rather than dreamlike in 
character. 

These considerations suggest the need for 
much more data on the physiological and 
psychological events that occur, particularly 
during and shortly after the time when a 
subject falls asleep. It is at this time that the — 
Stage-2, -3, and -4 reports are frequent and 
appear most difficult to account for by a 
theory based on the REM-period criteria of 
dreaming. Such data might make it possible | 
to predict when the non-REM awakening 
will yield dream reports. 


Sleep Depth 


In the planning stages of this study, we | 
were pessimistic with regard to the value of — 
the reaction time to the abrupt awakenings’ 
as a behavioral measure of sleep depth. Cer- 
tainly the behavior involved in locating and 
lifting the telephone in a darkened room is 
complex, and one would suspect that the time 
involved should be multiply determined. It 
was somewhat surprising, therefore, to find 
that the reaction times were reasonably well 
related to EEG and subjective measures of 
sleep depth. In fact, these relationships were 
more consistent than were the relationships 
involving intensity (arousal) thresholds. 

Corresponding to the EEG records, the 
reaction times showed regular fluctuations 
that appeared to be unaffected by the fact 
that some REM periods were interrupted by 
experimental awakenings. Arousal thresholds, *. 
on the other hand, were generally lower when 
an awakening occurred during the preceding 
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REM period. In addition, arousal. thresholds 
were about the same for Stage-1 REM pe- 
` riods as they were for Stage-2 non-REM 
. periods. The finding that thresholds are no 
lower during Stage-1 sleep may be under- 
. stood, however, if dreams occur during Stage- 
1 REM-period sleep. 

It is conceivable that when a subject is 
absorbed in a dream, a gradually increasing 
external stimulus must be more intense before 

` it becomes effective as an arousal-producing 
agent. It is also conceivable that with’ an 
* awakening stimulus that increases in intensity, 
the bell may become incorporated into an 
‘ongoing dream and the arousal consequently 
delayed. Arousal delays of these sorts may 
be relatively common during Stage-1 REM- 
period sleep. Reaction times, on the other 
«hand, may not be influenced by this phe- 
nomenon. In terms of the EEG records, at 
:, least, arousal almost invariably takes place 
within a second or two, using our abrupt 
awakening method. The differences in reac- 
tion times are probably in great part a func- 
tion of recovery rates for cognitive orienta- 
tion, coordination, etc., reflecting some kind 
of preawakening psychophysiological tonus or 
readiness for response. If this view is correct, 
then it need not be surprising that reaction 
times are more closely related to the EEG 
Stages of sleep than are the arousal thresholds. 
Subjective judgments appear to be even 
more complex as a measure of sleep depth. 
' They are clearly influenced by events that 
occur during the awakening process and may 
be based on cues from the awakening stimu- 
lus itself. The inconsistency involved in many 
-of these judgments is most apparent in the 
large number of times the subjects said that 
they were awake, particularly when reaction 
times were long or arousal thresholds un- 
Usually high. As we have noted, these frequent 
mistakes in judging sleep depth seem related 
oe to the occurrence of some thinking reports as 
_ arousal artifacts. 
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STUDIES IN EFFICIENCY: 


MUSCLE-ACTION PATTERNS IN REACTION TIME AS RELATED 
TO INHIBITION OF EYELID CONDITIONING? 


HAROLD D. FISHBEIN 2 


University of Pennsylvania 


On the basis of the concept of psychological efficiency proposed by Wishner, it 
was predicted that efficiency in inhibiting eyelid CRs would be related to effi- 
ciency in reaction time (RT). 60 Ss were run 1 day per wk. for 3 wk. in an RT 
session followed immediately by an eyelid conditioning session. 30 of the Ss 
were run under passive instructions on the Ist eyelid conditioning session and 
under instructions to inhibit eyelid responses on the 2nd and 3rd sessions (PI 
procedure). The remaining 30 Ss were run under instructions to inhibit eyelid 
responses on the Ist and 2nd sessions, and under passive instructions on the 
3rd session (IP procedure). The prediction was confirmed in the PI procedure, 


but not the IP procedure. 


The primary purpose of this experiment 
was to study the relation between efficiency 
in reaction time (RT) and efficiency in the 
inhibition of eyelid conditioning. The second- 
ary purpose was to determine the stability 
of an individual’s RT efficiency. 

Wishner (1962a) defined the concept of 
efficiency as follows: given a set of task 
requirements for an individual, efficiency is 
some function of the ratio of focused (F) to 
diffuse (D) behavior in meeting task require- 
ments, where F is energy expended directly 
on the task and D is energy expended in 
nontask directions. 

In its broadest sense, efficiency refers to 
the appropriateness of behavior which people 
adopt in meeting environmental demands. It 
is thought that appropriateness can best be 
analyzed in terms of the ratio of relevant to 
irrelevant behavior in meeting these demands. 
Furthermore, it is assumed that efficiency is 
a general characteristic of the individual. 
Hence, a given individual’s efficiency score 
on a particular task indicates how appropriate 

1This report is based on a doctoral dissertation 
submitted to the University of Pennsylvania in 1963. 
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2Now at the University of Cincinnati. 


his behavior was on that task relative to 
others and further indicates what his relative” 
performance will be on other tasks. Thus far, 
the efficiency concept has successfully been 
applied to RT (Niebuhr, 1955); to semantic 
generalization (Peastrel, 1964); to GSR con- 
ditioning (Wishner, 1962b); and to operant 
conditioning (Letchworth & Wishner, 1962). 

The present study follows the work of 
Wishner, Peastrel, and Fishbein (1964). ` 
These authors used the pattern of muscle- 
action potentials in a simple RT task as a 
measure of efficiency, and related this to rate 
of GSR conditioning under two sets of in- 
structions. When the instructions were, in 
effect, to not condition, a significant positive . 
correlation was found between efficiency in 
RT and rate of conditioning. When the in- 
structions were, in effect, to condition rapidly, 
a nonsignificant negative correlation was 
found between efficiency in RT and rate ot 
conditioning. The interaction between effi- 
ciency in RT and type of instructions on 
rate of conditioning was significant. 

Is efficiency a general characteristic, or is 
it specific to a few tasks? One purpose of. 
the present experiment was to attempt to 
answer this question by relating RT efficiency 
to eyelid conditioning, If RT efficiency were 
related to GSR conditioning, but not to eyelid 
conditioning, this would suggest that efficiency — 
is not a general characteristic, and revisions 
in the concept would be necessary. Analysis 
of the task requirements in eyelid condition- 
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ing revealed two situations for which the 
task requirements were clear, and for which 
`- differential predictions would be made for ef- 
‘ficient and inefficient subjects. Using a light 


_* conditioned stimulus (CS), and an air puff 


unconditioned stimulus (UCS), for example, 
these task requirements took the forms of 
instructing subjects to blink as soon as the 
light appeared and before feeling the air 
puff, and of instructing subjects to refrain 
< from blinking while the CS was on. In the 
former case, one would predict that effictent 


` + subjects would produce more blinks during 


the CS-UCS interval than inefficient subjects. 
In, the latter case, one would predict that 
efficient subjects would produce fewer blinks 
than inefficient subjects. 
Previous studies concerned with individual 
_ differences in eyelid conditioning have uti- 
“lized so-called passive or neutral instructions. 
_, Analysis of the passive instructions, however, 
Indicates that the task requirements under 
these instructions are not clear. These in- 
Structions usually tell the subject to relax 
and to try not to control his natural eyelid 
responses. This is vague as to what is expected 
from the subject. For what is a natural eyelid 
response when the subject is made acutely 
aware of his eyelid responses by both the 
instructions and by being regularly struck on 
the cornea of his eye by puffs of air? Where 
the instructions are vague, it may be assumed 
that subjects will adopt some hypothesis as 
: to what behavior is being required of them. 
Thus, the resulting behavior may or may not 
Coincide with the experimenter’s expectations. 
Under instructions to refrain from blinking 
during the CS-UCS interval (inhibit instruc- 
tions) the task requirements are clear at 
the beginning of the experiment; however, 
the subject may consider other tasks during 
the experiment which conflict with the origi- 
hal instructions, Hence, blinking during the 
.CS-UCS interval to diminish the effects of 


, 


the UCS may become a more powerful moti- 


. Vator than following the experimenter’s in- 
structions, However, paying subjects to re- 
frain from blinking, and giving them feedback 
after every trial may overcome this. 

The hypothesis underlying this experiment 
Was that efficiency in inhibiting eyelid condi- 
tioned responses (CRs) under instructions 
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to do so, was related to efficiency in focusing 
during the performance of an RT task, in 
such a way, that subjects who were efficient 
in RT would produce fewer CRs than sub- 
jects who were relatively inefficient in RT. 


METHOD 
Subjects and General Procedure 


Subjects were paid volunteer undergraduate stu- 
dents. On each day, subjects were run first in an 
RT session in which their muscle-action potentials 
were measured. Immediately afterwards, they were 
taken to another laboratory and run in one of two 
eyelid conditioning procedures. Thirty subjects were 
used in each of the two conditioning procedures, 
for a total of 60 subjects. For both procedures, the 
sessions were held the same time each week, for 
3 weeks. In one of the procedures, the subjects 
were run for the first two eyelid conditioning sessions 
under inhibitory instructions and the third session 
under passive instructions (IP procedure). In the 
other procedure, the subjects were run for the first 
session under passive instructions and the second and 
third sessions under inhibitory instructions (PI 
procedure). One procedure was utilized for all sub- 
jects run during a given 3-week interval. During 
alternate 3-week intervals, the other procedure was 
used for all subjects. Subjects were assigned in this 
manner in order to minimize any effects of com- 
munication between subjects in the two conditioning 
procedures, 

RT procedure. The same procedure was utilized 
as in Wishner et al. (1964). Briefly, in each session, 
6 practice trials and 50 test trials of an identical 
nature were run. Muscle-action potentials were re- 
corded continuously through bipolar placements from 
the extensor digitorum communis of each arm. The 
chair on which the subject sat had arms to which 
telegraph keys were affixed. In front of the subject 
stood a panel on which two white lights and two 
green lights made up the four corners of a rectangle, 
the green lights on the bottom. A trial consisted of 
the onset of the green lights to which the subject 
was instructed to depress the two telegraph keys 
with the center finger of each hand. This was 
followed 3-6 seconds later by the onset of the right 
white light to which the subject was instructed to 
react as quickly as possible by lifting his right-hand 
finger from the telegraph key. This was followed 
3 seconds later by the left white light onset to 
which the subject was instructed to lift his left-hand 
finger with no concern for speed, which ended the 
trials. The intertrial interval was 5 seconds. 

Eyelid conditioning procedure. The subject sat in 
a dentist’s chair in the center of a soundproofed 
room. A 20-inch square translucent white Plexiglas 
panel was on the wall approximately 4 feet in front 
of the subject. Under the IP procedure, for each 
of the first two sessions, Day 1 and Day 2, subjects 
received 100 trials under inhibit instructions. On 
Day 3, subjects received 20 trials under inhibit 
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instructions and were then shifted to 100 trials under 
passive instructions. Under the PI procedure on 
Day 1 subjects received 100 trials under passive 
instructions. On Day 2, subjects received 20 trials 
under passive instructions and were then shifted to 
100 trials under inhibit instructions. On Day 3, 
subjects received 100 trials under inhibit instructions. 

A trial consisted of a 4 second white noise ready 
signal, followed by the light CS after a random 
3-, 4-, or 5-second interval. A 550-millisecond CS, 
a 500-millisecond CS-UCS interval, and a 50-milli- 
second 2-pound-per-square-inch (psi) air puff were 
used. A random 15-, 20-, or 25-second intertrial 
interval was employed with an average of 20 seconds. 
Under inhibit instructions feedback information was 
given within 1 second, where possible, after UCS 
termination. The red light from the feedback box 
was flashed for any pen deflection 1 millimeter or 
greater from the highest point of the base line, 
during the CS-UCS interval, The green light was 
flashed in all other cases, Subjects were paid at the 
end of the third session, 

Under inhibit instructions, subjects were informed 
of all the interstimulus intervals, that is, the time 
between the ready signal and CS, the CS-UCS inter- 
val, and the intertrial interval. The subject was 
told that every time he was able to keep his eyelid 
open while the CS was on and until he felt the 
air puff he would win a penny which would be 
signified by a flash of the green light from the 
feedback system, and that every time he even slightly 
closed his eye while the CS was on and before 
he felt the air puff, he would lose a penny which 
would be signified by a flash of the red light from 
the feedback system. Each subject was further told 
that he would start each session with $1.00 credit 
and that there would be 100 test trials. 

Under passive instructions, subjects were informed 
of none of the interstimulus intervals. Subjects were 
told that they would receive $1.50 for that session 
independently of their performance. The standard 
Towa State University passive-type instructions were 
then read. These instructions stress that the subject 
remain relaxed throughout the entire period and that 
he not try to control the natural responses of his eye, 


Apparatus 


RT laboratory. The RT lights were programed 
with a Western Union tape driver. Muscle-action 
potentials were recorded on a Grass Model III D 
EEG with Grason-Stadler integrators, in raw and 
integrated form. The integrated data were printed 
out every second on a Victor adding machine in 
units proportionate to accumulated microvoltage. All 
data on muscle-action potentials will be expressed in 
these units. 

Eyelid conditioning laboratory. A 7.5 watt par- 
tially shielded light was placed on the ceiling directly 
in back of the headrest and was kept lit at all times 
to maintain light adaptation and thus eliminate beta 
responses (Grant & Norris, 1946). The information 
feedback system consisted of a 6 X 6X1 inch box 
situated below the Plexiglas screen, containing visible 
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green and red lights, The CS consisted of an abrupt 
change in illumination and hue (to reddish-orange) 
of a 2.5 inch circular area of the Plexiglas panel from 


03 millilambert to 1.73 millilamberts. The UCS was ` 


delivered through a nozzle (1-millimeter diameter: 
orifice) set .5 inch from the center of the subject’s 
right cornea. 


Hunter timers controlled durations of the ready © 


signal, CS, UCS, and CS-UCS intervals. Trials were 
automatically presented by a programmer. Only the 
feedback system was manually operated. 

A potentiometer-polygraph system (similar to that 
used by Spence, 1953) was used to detect and record 
eyelid movements. Each subject wore an adjustable 
heddband which supported a padded aluminum plate. 


Mounted on this plate were the air nozzle and a. 


microtorque potentiometer. A light thread connected 
a plastic tab to the shaft of the potentiometer. 
Movements of the shaft caused a resistance change 
which was amplified and recorded on a Brush instru- 
ment polygraph. This instrument also recorded onset 


and termination of the CS. Paper speed was 120 § 


millimeters per second. 


Data Analysis 


The ratio of muscle-action potentials from the — 


right arm to muscle-action potentials from the left 
arm was taken as the index of efficiency. This ratio, 
called Prep R/L was computed for each trial by 
dividing Prep R, which was the accumulated micro- 
voltage from the right arm for the interval com- 
mencing 1 second after green light onset and ending 
at right white light onset, by Prep L which was the 
accumulated microvoltage from the left arm for this 
period. These measures were the same as those used 
by Wishner et al. (1964). Prep R/L, Prep R, 
Prep L, and RT to the right white light were each 
averaged for the 50 test trials each day, and for the 
total 150 test trials. 

For the eyeblink data, a response was scored if 
both of the following criteria were met: the eyelid 
closure departed from the base line within the CS-UCS 
interval; the deflection was at least 1 millimeter in 
magnitude within 50 milliseconds from its commence- 
ment. In cases of multiple responses, the second 
response was scored (there were no triple responses 
recorded) only if the pen had returned to the bas? 
line during the CS-UCS interval after the first 
response. Responses with latencies less than 150 
milliseconds were not counted as CRs. At most, one 
CR per trial was scored. Each subject was given @ 
score representing the mean number of CRs for each 
day, and a score representing the mean number of 
CRs for the first 200 trials under inhibit instructions. 


RESULTS 


For two subjects in the IP procedure, 
muscle-action potential measures were lost 


because of equipment failure. Hence, compat | 


sons of the mean performance of three R 
sessions for these measures are based upon 
58 subjects. 
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In Table 1 are the results of the split-half, 
first-half-second-half product-moment corre- 


+ lations of the muscle-action potential meas- 
. ures. These correlations are uncorrected for 


reduced sample sizes. As can be seen the per- 
formance of subjects within a given session 
was moderately stable, the minimum correla- 
tion equaling .60 ( < .01 in all cases). 

In Table 2 are the intersession product- 
moment correlations of Prep R/L, Prep R, 
and Prep L, as determined by the mean of 
50 test trials per session. In all casesy the 
performance of subjects between adjacent 
sessions was more stable than between the 
first and third sessions. As can be seen, all 
correlations were significant (p < .05 in all 
cases). 


TABLE 1 


Sprit-Hatr, Frrst-Hatr-Seconp-HaLr PEARSON r 
CORRELATIONS OF Prep R/L, Prep R, AND 
Prep L ror Eacu RT SESSION 


Measure Day 1 Day 2 Day 3 
Prep R/L 78 60 71 
Prep R 82, 80 19 
Prep L Wal .92 719 


The correlations between RT and Prep 
R/L were .26 on Day 1, .25 on Day 2, and 
06 on Day 3, The correlations on Days 1 
and 2 were significant (p< .05 in both 
cases). Thus, high-efficient subjects tended to 
have faster RTs than low-efficient subjects. 
The correlations between Prep R and Prep L 
were .60 on Day 1, .69 on Day 2, and .66 
on Day 3. 

Table 3 presents the intercorrelations be- 
tween the various muscle-action potential 


“Measures and the number of CRs under in- 


hibit instructions, the mean of I, and Ip (the 
first and second days, respectively, under 
inhibit instructions), and the number of CRs 


“under passive instructions. For the PI pro- 


cedure, the only significant negative correla- 


> tions were found between Prep R/L, the 
‘efficiency ratio, and number of CRs under 


inhibit instructions, The highest correlation 
between efficiency on any given day and 
number of CRs in Ix or Is was found when 
both scores, efficiency and conditioning, were 
made on the same day. This suggests that 
changes in the efficiency ratio from session 


TABLE 2 


INTERSESSION PEARSON 7 CORRELATIONS OF 
Prep R/L, Prep R, AND Prep L 


Days correlated | Prep R/L | PrepR | PrepL 
land 2 .49** 53%" ote 
2 and 3 Ai** .62** 713** 
land3 -29* 25% .65** 
*p <.05. 
Dp <01 


to session may reflect true variations in ef- 
ficiency. However, this is only suggestive in 
that differences between the highest and low- 
est correlations of efficiency with number of 
CRs in I; or Ip were not significant (p > .20). 

For the IP procedure, the only significant 
correlations were the negative correlations 
between Prep L with number of CRs under 
inhibit instructions. None of the correlations 
between the muscle-action potential measures 
and number of CRs under passive were 
statistically significant in either the IP or 
PI procedures. 

Figures 1 and 2 represent the performance 
of high-efficient and low-efficient subjects 
under the IP and PI procedures, respectively. 


TABLE 3 


PEARSON r INTERCORRELATIONS BETWEEN Prep R/L, 
Pree R, AND Prep L AND NUMBER or CRs FOR 
Each CONDITIONING SESSION: PI anp IP 
PROCEDURES SEPARATED 


Measure hth Ih Ik Passive 
PI procedure 
M Prep R/L =.39* | —.31 | —.56**| —.02 
M PrepR —.14 | —.09 | —.17 .05 
M Prep L .29 16 .37* 23 
Day 1 Prep R/L —.16 | —.32 03 
Day 2 Prep R/L} —.32 | —.48** 
Day 3 Prep R/L —.21 | —,54** 
Day 2 PrepR —.13 
Day 3 PrepR —.30 
Day2PrepL , 15 
Day 3 PrepL 30 
IP procedure 

M Prep R/L ely: .09 19 21 
M PrepR —.24 | —.27 | ~.22 05 
M PrepL —.42* | —.38* | —.40* | —.16 
Day 1 Prep R/L| .08 
Day 2 Prep R/L| .05 
Day 3 Prep R/L} .12 
Day 1 PrepR —.32 
Day 2 PrepR —.26 
Day 1 PrepL —.39* 
Day 2 PrepL —.37* 
*p <.05. 
ep <.01 
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IP procedure 


CRs 


Per Cent 


123456789101 234 
Day I 


Day 2 
Fic. 1, Percentage of CRs plotted in blocks of 10 trials, under the IP procedure, for high- and low-efficient 
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n=18 
n=lo 


High efficient subjects 
Low efficient subjects 


e—e 


56789100}: 2 12345678910 
Day 3 


subjects, (Efficiency scores are based on 3-day efficiency.) 


The conditioning data is plotted in blocks of 
10 trials for each day of the experiment. 
Subjects were categorized as high efficient 
or low efficient on the basis of their 3-day 
efficiency. Figure 3 shows the mean number 
of CRs for 2 days of conditioning under 
inhibit instructions for high-efficient and low- 
efficient subjects. 


PI procedure 


Per Cent CRs 


12345678 90121 2 


Day | 


x Day 2 
Fic. 2. Percentage of CRs plotted in blocks of 10 trials, under the PI procedure, for high- and low- 


The basic technique utilized for hypothesis 
testing was analysis of variance. This was 
accomplished by splitting groups at the 
median on the basis of their mean Prep R/L 
scores for the three RT sessions (3-day ef- 
ficiency). In the IP procedure, 18 subjects 
were classified as high efficient, and 10 as low 
efficient. In the PI procedure 11 subjects were 


nell 
n=19 


High 
Low 


efficient subjects 
efficient subjects 


345678901 2345678910 


Day 3 


efficient subjects. (Efficiency scores are based on 3-day efficiency.) 
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Mean Number of CRs 


PI 


Procedure 
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B—H High efficient subjects 


Low efficient subjects 


re 


Fic. 3. Mean number of CRs for 2 days under inhibit instructions for high- and low-efficient subjects. 
(Efficiency scores are based on 3-day efficiency.) 


classified as high efficient and 19 as low 
efficient. The Snedecor (1956) procedure for 
correcting this disproportionality was utilized. 
Table 4 indicates that the interaction be- 
tween efficiency and mean number of CRs 
under inhibit instructions for the PI and IP 
procedures was significant (p < .05). Under 
the PI procedure, high-efficient subjects sig- 
nificantly inhibited CRs better than low- 
efficient subjects (p < .05). Under the IP 
procedure, low-efficient subjects tended to 
inhibit better than high-efficient subjects, but 
not significantly so (p> .20). These results 
are representative of all the analyses of vari- 
ance done relating efficiency in RT to condi- 
tioning under inhibit instructions. Thus, 
when efficiency, as determined either by the 
mean of three RT sessions, or by scores from 
` the single RT session run on the same day 
-as I, or Is, was related to number of CRs 
on I; or Ig, a significant interaction was found 
such that in the PI procedure, high-efficient 
Subjects produced fewer CRs than low- 
efficient subjects (Figure 2), and under the 
IP procedure, no significant difference in per- 
formance was found between _high-efficient 
and low-efficient subjects (Figure 1). The 
sole exception was found when 3-day ef- 
ficiency was related to number of CRs on Ih, 
where the interaction was not significant 
($ > .20), although the same configuration 


seen on Figure 3 was found. A repeated 
measures analysis of variance (Winer, 1962) 
relating 3-day efficiency to conditioning scores 
for the first 50 trials and second 50 trials 
of I, in the PI procedure found the inter- 
action to be significant (F = 64, df=1, 
p < .025). A ż test between the mean per- 
formance of high- and low-efficient subjects 
on the last 50 trials showed that high- 
efficient subjects produced significantly fewer 
CRs than did low-efficient subjects (p < .05). 
This can be clearly seen on Figure 2. 
Inspection of Figure 2 shows that in the 
PI procedure high-efficient subjects tended to 
give fewer CRs under passive instructions 
than did low-efficient subjects. In light of 
this higher responding of the low-efficient sub- 
jects under passive instructions, further sta- 
tistical analyses were carried out with the aim 


TABLE 4 


ANALYSIS OF VARIANCE RELATING EFFICIENCY TO MEAN 
NuMBER or CRs UNDER INHIBIT INSTRUCTIONS 
FOR THE PI AND IP PROCEDURES 


Source af MS F 
Efficiency (A) 1 255.2 — 
Procedure (B) il 344.3 — 
AXB 1 3307.5 6.60* 
Within 54 501 

*p <.025, 
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of determining whether the predicted results 
found under inhibit instructions were due to 
some chance experimental biasing. 

Analyses of covariance for the PI pro- 
cedure eliminating the effects of performance 
differences on Day 1 from I; and Is, respec- 
tively, indicates that in I1, the performance 
of high- and low-efficient subjects was not 
significantly different (F=.6, df= D 
> .20), but that with the effects of Day 1 
performance eliminated from Ie, high-efficient 
subjects produced fewer CRs than low- 
efficient subjects (F = 5.5, df = 1, p< .05). 
The same results were found when the ef- 
fects of performance differences in the 20 
trials under passive instructions on Day 2 
were eliminated from I, and Is, respectively. 
Thus, analysis of covariance did not change 
the pattern of results in the PI procedure. 

Repeated measures analyses of variance for 
the IP and PI procedures were done relating 
3-day efficiency to performance on day under 
inhibit instructions, I; and Iz, In the IP 
procedure, the relative performance of high- 
and low-efficient subjects did not change 
from I; to Iz (F=11, df=1, p> .20), 
but that for the PI procedure the interaction 
was significant (F= 4.6, df=1, p< .05), 
indicating that the performance of high- 
efficient subjects slightly improved from IL 
to Iz, whereas the performance of low- 
efficient subjects deteriorated from I, to In. 
These results can be seen on Figures 1 and 2. 

In the PI procedure under inhibit instruc- 
tions, for either I, or Is, when subjects were 
categorized into above or below median Prep 
L, Prep R, or latency of RT, ¢ tests indicated 
no significant differences in eyelid condition- 
ing between the above and below median 
groups (% > .20 in all cases), Under the IP 
procedure, high Prep L subjects gave signifi- 
cantly fewer CRs than low Prep L subjects 
for Is (p < .05), but not for Iı (p> .20). 
However, no significant performance differ- 
ences were found between above median 
Prep R subjects and below median Prep R 
subjects, or above median RT subjects, or 
below median RT subjects for either I, or Iz 
($> .20 in all cases). Thus, in the PI pro- 
cedure, only the efficiency ratio, and not its 
components, or RT was related to ability to 
inhibit CRs. In neither the PI nor IP pro- 
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cedures, when efficiency, Prep R, Prep L, and 
latency of RT were related to performance 
under passive instructions, were significant 
performance differences found between above 
median subjects and below median subjects 
(p > .10 in all cases). 


Discussion 


The basic finding of this experiment was 
that in the PI procedure high-efficient sub- 
jects inhibited CRs better than low-efficient 
subjects under instructions to inhibit CRs, 
Under the IP procedure, low-efficient sub- 
jects tended to inhibit CRs better than high- 
efficient subjects, but this difference was not 
significant (Figure 3). The interaction be- 
tween conditioning procedure and efficiency 
was significant (Table 4). Thus, the experi- 
mental hypothesis was confirmed under the 
PI procedure, but not in the IP procedure. 

One possible interpretation of this finding 
is that the PI procedure is a more sensitive 
test of efficiency in inhibiting CRs than the 
IP procedure. It seems plausible to suppose 
that in order for individual differences in 
efficiency to become manifest, some minimum 
amount of responding must be present at the 
time the inhibit instructions are given. Thus, 
in the PI procedure, subjects were required 
to inhibit a relatively high ongoing level of 
responding, whereas in the IP procedure sub- 
jects started out with essentially zero level of 
responding. This interpretation suggests that 
Procedures which either increase or decrease 
initial responding at the time inhibit instruc- 
tions are given, should either increase orde- 
crease the magnitude of the relationship be- 
tween efficiency in RT and efficiency in eyelid 
conditioning. For example, it would be ex- 
pected that this relationship would be of a 
higher magnitude in a group which received 
200 trials under passive instructions preceding » 
inhibit instructions, than a group which re- 
ceived 25 trials under passive instructions pre- 
ceding inhibit instructions, because subjects 
in the former group would presumably be 
responding at a higher level at the time 
inhibit instructions were given than would 
subjects in the latter group. In conjunction 
with this interpretation, it should be noted 
that when subjects in the PI procedure were 
switched from passive to inhibit instructions, 


mces between high- and low-efficient 
ts became manifest only in the last 
block of 5 trials on Day 2 (Figure 2). It 
would be interesting to determine whether 
of prior experience under passive in- 
ctions also affects the rapidity with which 
and low-efficient subjects differentiate 
ves on subsequent performance under 
ibit instructions. 
_ Interpretation of the finding that Prep L 
opposite correlations with conditioning 
der the PI and IP procedures is difficult. 
f one assumes that tension level reflects 
gnitude of drive (Malmo, 1958), then 
the direction of the correlations of Prep 
o conditioning one must interpret the IP 
edure as being a complex situation in 
which high drive leads to slow conditioning 
and the PI procedure as a simple situation in 
ich high drive leads to high conditioning 
Spence, 1958). Intuitively, this argu- 
Ment does not seem plausible. Indeed, one 
uld expect the PI procedure to be more 
plex than the IP procedure in that under 
it instructions in the PI procedure, sub- 
are required to inhibit high levels of 
nding. 
hen the results of the present study are 
en in conjunction with those from the 
er et al. (1964) study, there appears to 
some generality of the relation between 
siency and the inhibition of CRs. Recall 
in the Wishner et al. study, a significant 
ve relation was found between efficiency 
T and rate of conditioning under instruc- 
to, in effect, resist conditioning; whereas 
n the instructions were, in effect, to condi- 
Tapidly, no significant relation was found 
een efficiency in RT and rate of condi- 
ig. Further, in neither the present study 
or the Wishner et al. study was RT syste- 
tically related to inhibition of CRs. This 
f a relationship underscores the value of 
ilizing the efficiency ratio in predicting indi- 
dual differences in conditioning. 
it was argued in the introduction that the 
. requirements under passive instructions 
ambiguous, and hence, a prediction could 
made relating efficiency in RT to 
tioning under these instructions. The 
ce of significant correlations among effi- 
p: x 


by . 
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ciency and number of CRs under passive in- 
structions tend to support the a priori analy- 
sis of the task requirements under passive 
instructions. Taken in conjunction with the 
predicted significant correlations between effi- 
ciency and number of CRs under inhibit 
instructions, the value of the method of analy- 
sis required by the concept of efficiency is 
also supported. 
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SEX OF THE DOMINANT PARENT ON SEX- 
ROLE PREFERENCE, IDENTIFICATION, 

AND IMITATION IN CHILDREN 
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This study investigated the effects of sex of the dominant parent on sex-role 
preferences, parent-child similarity, and the childs’ imitation of the parent in 3 
age groups. Parental dominance was found to’ facilitate imitation in both boys 
and girls. Maternal dominance was related to disruption in the formation of 
masculine sex-role preferences in boys and low father-son similarity. Parental 
dominance had little effect on sex-role preferences in girls or in mother-daughter 
similarity, but paternal dominance was related to increased father-daughter 


similarity. 


Most theories of identification agree that 
identification is based on a process or proc- 
esses whereby the child through imitation, 
modeling, or introjection acquires traits, char- 
acteristics, and values similar to the parent. 
In normal development the boy is assumed 
to identify with the father, and the girl with 
the mother which results in the preference for 
and adoption of appropriate sex-role behavior. 
Psychoanalytic theory has stressed the role of 
fear of punishment and identification with the 
aggressor, while. learning theory has stressed 
the facilitating effects of reward in promoting 
identification, A third theory (Parsons, 1955) 
has emphasized the importance of total pa- 
rental power in the development of identifica- 
tion. According to Parsons, the child identifies 
with the parent because he is powerful in his 
ability to dispense both rewards and punish- 
ments. Several recent experiments support the 
position that parental power or dominance 
plays a major role in identification (Hether- 
ington & Brackbill, 1963; Mussen & Distler 
1959). 

In identifying with parents, children may 
acquire traits and values which are particu- 
larly characteristic of either male or female 
roles in our society. This would be directly 
related to the formation of sex-role_ prefer- 
ences. They may also imitate parental behay- 
iors which are not sex typed but which are 
equally appropriate in males and females, It 
would be expected that the rate of develop- 
ment and type of sex-role preference would 


? 


be related not only to parental behavior but 
also to social pressures to conform, and the 
status of a given sex in the culture. Parent- 
child similarity in traits which are not sex 
typed should have fewer extra familial social 
sanctions bearing on them and_ therefore 
should be more directly and consistently re- 
lated to reinforcements and imitative models 
provided by the parents. Since the child has 
relatively few social contacts outside the 
family in the preschool years it would be ex- 
pected that identification on both sex-typed 
and non-sex-typed measures would be closely 
related to family power structure in 4- and 5- 
year-old children, However in older children 
sex-typed behaviors should be increasingly in- 
fluenced by social norms. Boys will be en- 
couraged by peers and adults outside the 
family to develop masculine sex-role prefer- 
ences and girls feminine sex-role preferences 
and these preferences should therefore be less 
directly related to parental dominance in older 
than in younger children. It would also be 
predicted that because of the greater prestige 
and privileges of males in our culture, girls 
will be slower and less consistent in develop- 
ing appropriate sex-role preferences than boys 
(Brown, 1956, 1958). In contrast parent-child 
similarity on non-sex-typed traits should be 
closely related to parental dominance in both 
preschool and school aged boys and girls. 
This similarity would be expected to increase 
with age as identification is more fully estab- 
lished. 
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IDENTIFICATION IN CHILDREN 


If children identify with the dominant par- 
ent and this parent is the same sex as the child 


‘this should facilitate the development of nor- 


ge- 


. mal sex-role preferences. If the dominant par- 
* ent is the opposite sex of the child this should 
+ strengthen cross-sex identification and may 
«retard the development of normal sex-role 

preferences. This disruption in identification 
and sex-role preferences should be particu- 
larly marked in boys from mother-dominant 

“homes since the acquiescing father supplies a 

socially inappropriate model for the son. In 

“contrast, girls from father-dominant homes at 

least have parental models whose power rela- 
tionships are appropriate for their culturally 
defined sex roles, Although it could be argued 


e that a dominant mother does not provide a 


normal sex-role model for girls, maternal 
dominance may not preclude the mother hay- 
ing other feminine traits with which a daugh- 
», ter will identify. 

If imitation is involved in identification, 
childrens’ performance on an experimental 
task involving imitation of the parents and a 
measure of parent-child personality similarity 

should be positively related. Past research 

_ (Bandura, 1962; McDavid, 1959) suggests 
that although children tend to imitate the 
most powerful model there is a greater readi- 
ness for both boys and girls to imitate a male 
model. Thus children should imitate and have 
Personality traits similar to the most domi- 
nant parent, particularly if it is the father. 

' The findings of McDavid also suggest that 
girls will imitate increasingly with age while 
boys will imitate less. 

The present study investigated the effects 
of parental dominance on sex-role preferences, 
Parent-child trait similarity, and imitation of 
boys and girls of three different age levels. 


METHOD 
Subjects 


+ Subjects were three groups of 36 boys and 36 girls 
ages 4-5, 6-8, and 9-11 enrolled in nursery schools 


Ei or in elementary schools in a public-school system. 


Half of the boys and girls in each group came from 


mother-dominant homes and half from father-domi- 
nant homes, 


Experimenters 


nae Of the subjects in each condition were run by 
ate experimenters and half by female experimenters. 
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Since no experimenter differences were found in a 
preliminary analysis of the data, this variable was 
not considered in the final analysis. 


Parental Dominance Measure 


The parental dominance measure was adapted from 
a procedure by Farina (1960). Farina’s problem 
situations were modified to make them more suitable 
for all age levels in the present study. Each parent 
was seen individually in a quiet room in his own 
home. He was read 12 hypothetical problem situa- 
tions involving child behavior and asked how he 
would handle them when he was by himself. Both 
parents were then brought together and asked to ar- 
rive at a compatible solution on handling these chil- 
drens’ problems. The discussion of each problem con- 
tinued until both parents said the terminating signal, 
“agreed.” The experimenter participated only mini- 
mally in the discussion in order to clarify scoring 
responses, All interviews were tape-recorded and 
scored later. 

The scoring procedure was identical with that 
used by Farina (1960) which involved seven indices 
of parental dominance. If six of the seven indices in- 
dicated paternal dominance the family was classified 
as father dominant; if six of the indices indicated 
maternal dominance the family was classified as 
mother dominant. A total of 326 couples were run 
in order to obtain the 108 mother dominant and 108 
father dominant families for the study. 


Procedure 


The study was comprised of three procedures: the 
It Scale for Children (ITSC; Brown, 1956), a parent- 
child similarity task, and an imitation task. 


ITSC 


The ITSC, a projective test of sex-role preference, 
was administered to each subject at school. In this 
test the child is presented with a drawing of an am- 
biguous child figure referred to as “It,” and is asked 
to choose what It would like in a series of 36 pic- 
ture cards of objects and figures identified with 
masculine or feminine roles. Scores can range from 
0, exclusively feminine choices, to 84, exclusively 
masculine choices. It is assumed that “the child 
will project himself or herself into the It-figure on 
the basis of his or her own sex-role preference, and 
will attribute to It the childs’ own role preference 
[Brown, 1956, p. 5].” 


Parent-Child Similarity Measure 


A list of 130 adjectives was given to 10 advanced 
education. students who were asked to categorize the 
adjectives as more characteristic of males, or fe- 
males, or as neutral (equally applicable to both 
sexes). They were also asked to check whether each 
adjective was descriptive of the behavior of children 
or adults or both. Forty adjectives which had been 
rated as neutral by 90% of the judges and as de- 
scriptive of both adults and children by all were in- 
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TABLE 1 
ANALYSIS OF VARIANCE oF ITSC Scores 


Source df MS F 

Parental dominance (D 1 | 1,247.07] 4.67* 
Sex of child (S) my 1 | 32,047.04 120.03** 
Age (A) 2 | 2,055.70 7.70** 
DXS 1 | 4,329.12] 16.21** 
DXA 2 265.29) <1 
SXA 2 | 6,520.29) 24.47** 
DXSXKA 2 154.92] <1 
Error 204 267.00 

Total 215 

* 

ab Sio 


cluded in the final list (e.g., friendly, honest, im- 
aginative, humorous, pleasant, capable, etc.). These 
adjectives appeared to give a broad view of non- 
sex-typed personality traits. 

Parents were asked to give the name of someone 
who knew them well enough to rate them on the 
adjective check list. Different raters were used for 
each parent. The lists were sent to the raters with 
assurances of the confidentiality of the responses and 
were returned by mail to the experimenter. If the 
list was not returned in 2 weeks, raters were con- 
tacted by phone. Eleven of the raters refused to co- 
operate and others suggested by the parents were 
substituted. Children were rated on the same list by 
their teachers. In an attempt to control for response 
bias raters were asked to mark 15 adjectives which 
were most like the ratee X, 15 which were most un- 
like the ratee O, and to leave 10 blank, 

The procedure was repeated on 45 of the families 
1 month later. The test-retest reliabilities were .82 
for ratings of mothers, .86 for fathers, and .79 for 
children. 

Similarity scores were based on the number of 
identical responses in the mother and child, and 
father and child lists, Two blanks, X-X or 0-0 pair- 
ings on an item were scored as similar. 


Imitation Task 


Each child was run in his home on the imitation 
task twice, once with each parent as a model. The 
child was instructed that he and his parent were par- 
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ticipating in a study attempting to evaluate what 
things people think are prettiest. While the child 
watched, the parent, who had been coached before 
the experimental session, was asked to indicate by 
pointing and naming which picture in each of 20 
pairs of pictures he thought was prettiest. The par- 
ent repeated this procedure three times consistently — 
selecting the predetermined prettiest pictures. The 
child then went through the series once selecting the“ 
pictures he thought were prettiest. One month later : 
the same procedure was repeated with the other par- 
ent and a second series of pictures. Order of pres- 
entation of list and parent models was balanced for 
male and female children. An imitation measure of- 
number of similar responses was derived for mother- i 
child and father-child, 1 


RESULTS 


Separate analyses of variance were run on 
ITSC scores, parent-child similarity measures, q 
and parent-child imitation scores. Differences _ 
between means were tested by Duncan multi- 
ple range tests; those reported are significant 
at $ < .05. 


Sex-Role Preference 


The summary of an analysis of variance of 
ITSC scores for all subjects is presented in $ 
Table 1. As was expected, parental dominance 4 
influenced sex-role preference. More appro- — 
priate sex-role preferences occurred when the | 
father is dominant than when the mother is 
dominant. Duncan range tests indicated that 
the differences in sex-role preferences for girls 
from mother- and father-dominant homes were 
nonsignificant, however these differences for 
boys were significant at all ages. Mother í 
dominance was related to less masculine sex- 
role preferences in boys. 

As predicted, girls were later in developing 
feminine sex-role preferences than were boys 
in masculine sex-role preferences, Boys at age 


TABLE 2 
MEANS AND STANDARD Deviations or ITSC Scores 


Mother dominant 


Father dominant 


Ages Boys Girls Boys Girls 
M SD M M SD M SD 
4-5 52.4 14.3 41.8 15.9 67. 
: F 7 3 13.1 3 16.9 
ee 53.7 22.3 55.1 143 66.9 14.9 ie 16.1 
61.0 20.9 24.1 15.9 74.2 212.7 | @ 19.1 15.9 


7 
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4-5 had already developed a preference for 
the masculine role which continued and in- 
«< creased slightly but nonsignificantly through 
ages 9-11. In contrast, girls showed a signifi- 
-‘cant increase in preference for the feminine 
role in the age 9-11 group. The means and 
standard deviations of ITSC scores for the 
_ three age groups of boys and girls from 
mother- and father-dominant homes are sum- 
marized in Table 2. A high score indicates 
masculine sex-role preferences. 


` Parent-Child Similarity 


A summary of the analysis of variance of 
» mother-child and father-child similarity meas- 
ures obtained from the adjective check list 
“ratings by friends and teachers is presented in 
Table 3. Duncan range tests indicates that 4- 
and 5-year-old children obtained lower par- 
ent-child similarity ratings than the two older 
groups who did not differ from one another. 
Dominance played an important role in 
parent-child similarity. Children tended to be 
more similar to the dominant parent than the 
passive parent. Mother dominance appeared 
sto inhibit father-child similarity. This simi- 
clarity was lower than mother-child similarity 
in father-dominant homes. The disrupted 
“identification of boys was again particularly 
marked in mother-dominant homes. In father- 
dominant homes the boy identified signifi- 
cantly more with the father than the mother, 
í however in mother-dominant homes this rela- 
tion was reversed. In fact, the mother-son 
similarity in mother-dominant homes did not 
» „differ from the father-son similarity in father- 
dominant homes. : 
„In contrast, girls in father-dominant homes 
identified equally strongly with both parents. 


IDENTIFICATION IN CHILDREN 
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TABLE 3 


ANALYsIS OF VARIANCE OF PARENT-CHILD 
SIMILARITY SCORES 


Source af MS F 

Parental dominance (D) 1 498.38 | 18.06** 
Sex of child (S) 1 2.38 | <1 
Age (A) 2 273.17 9.90** 

xs 1 107.98 3,91* 
DXA 2 29.82 1.08 
SXA 2 58.42 2.13 
SXDXA 2 15.91 | <1 
Error (a) 204 27.59 
Similarity (Sim) 1 | 56490 | 20.43" 
Sim X D 1 | 2,380.08 | 86,08** 
Sim X S 1 630.74 | 22.81** 
Sim X A 2 23.85 | <1 
Sim X D XS 1 158.92 5.15": 
SimX DXA 2 59.55 2,15 
Sim X SX A 2 39,82 1.44 
SimX DXSXA 2 52.78 1.91 
Error (b) 204 27.65 

Total 431 

*p <.05. 

++p <.01 


Also their mother-daughter similarity did not 
differ from that in mother-dominant homes, 
On the other hand, girls in mother-dominant 
homes identified notably more with mother 
than father. The father-daughter similarity 
scores in mother-dominant homes were sig- 
nificantly lower than similarity scores in any 
other group. The means and standard devia- 
tions of parent-child similarity scores for all 
groups are summarized in Table 4. 


Parent-Child Imitation 


A summary of the results of the analysis of 
variance of imitation scores is presented in 
Table 5. In the childs’ imitation of the parent, 
as in sex-pole preferences and parent-child 
similarity, parental dominance was a signifi- 


TABLE 4 


MEANS AND STANDARD DEVIATIONS 


OF PARENT-CHILD SIMILARITY SCORES 


Mother dominant 


Father dominant 


Ages: 4-5 6-8 9-11 45 6-8 9-11 
Similarity M SD M SD M SD M SD M SD M SD 
-Mother-daugh 211 | 4.3 | 20.1 
ghter 191 | 53 | 198] 45 | 219] 58 | 187] 5.3 | 211] 4 A | 5.9 
Paother-son 123 | 22 | 204 | 61 | 233 | 53 | 158] 60 | 152] 43 | 153 | 45 
Pather-daughter 116 | 41 | 120| 46 | 12.7 | 3.6 | 182] 5.8 | 19.5 | 6.0 | 183 | 6.3 
_ Father-son 121 | 44 |162| 45 | 15.0 | 43 | 17.6 | 68 | 23.3 | 54 | 236) 48 
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TABLE 5 


ANALYSIS OF VARIANCE OF PARENT-CHILD 
IMITATION SCORES 


Source df MS F 

Parental dominance (D) 1 1.12, <1 2 
Sex of child (S) 1 108.00 10.58** 
Age (A) 2 5.56 | <1 
DXS 1 01 | <1 
DXA 2 20.10 1.97 
SXA 2 24.02 2.35 
DXSXA 2 497 | <1 
Error (a) 204 10.21 
Imitation (I) 1 1.56 | <1 
IxD 1 | 560.34 | 58.30** 
IXS 1 15.57 1.62 
IXA 2 23.16 2.41 
IXDXS 1 6.25 | <1 
IXDXA 2 4.89 | <1 
IXSXA 2 3.71 | <1 
IXDXSXA 2 7.50 | <1 
Error (b) 204 9.61 

Total 431 

*b <.05. 

<0, 


cant factor. Children of both sexes imitated 
the dominant parent more than the passive 
parent. The prediction that children would 
imitate the father more than the mother was 
not confirmed. There were no significant dif- 
ferences between imitation of the dominant 
mothers and fathers, or between passive moth- 
ers and fathers, As was expected on the basis 
of past research, girls imitated more than 
boys; however no differential trends with age 
for the two sexes were found. The means and 
standard deviations of imitation scores are 
presented in Table 6. 


Relations among Measures 


The ITSC scores, father-child and mother- 
child similarity Measures, and father-child 
and mother-child imitation scores were corre- 
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lated separately for boys and girls in mother- 4 
and father-dominant homes for each age |’ 
group. The results suggest that these meas. 
ures are meaningfully related only for older 
girls from father-dominant homes. In this 
group, girls who imitate their mother are 
rated as being similar to them on the adjeg; | 
tive check list (r = .58, p < .01) and have. 
feminine sex-role preferences (7 = — 68, $ 
< .01). When these girls have feminine sex- |. 
role preferences they are rated as similar to 
their mothers (r= — .67, p< 01), when 
they have masculine sex-role preferences they 
are rated as similar to their fathers (r = 42, 
P < .05). It appears that in homes where par- 
ents serve as culturally appropriate sex-role 
models with the father more dominant than 
the mother, girls’ feminine sex-role preferences 
are closely related to imitation of the mother 
and similarity to the parents on other traits 
which are not sexually defined. ‘ 

In contrast, there are few systematic or 
meaningful relationships for girls from mother- 
dominant homes or for boys from either 
mother- or father-dominant homes. There was 
a tendency for parent-child imitation to be 
telated to parent-child similarity in these 
groups, but there was little relationship be- 
tween sex-role preference and the other meas: 
ures. In fact, for boys there were no signifi- 
cant correlations between the ITSC and any 
other measures at any age. 


Discusston 


i 


The results appear to support partially 4 
theory of identification based on parental,» 
power. Parental dominance influenced chil- | 
dren in imitation of parents, sex-role prefer- 
ences, and similarity to parents in non-Sex |, 


TABLE 6 
MEANS AND STANDARD DEVIATIONS oF PARENT-CHILD Imitation SCORES 


Mother dominant 


Father dominant 


Ages: 4-5 6-8 9-11 4-5 6-8 9-11 

Imitati M | sD sD 
imitation M | sp | mu SD M SD M SD a 
Mother-daughter |133 | 3.4 | 13.4 | 37 |147| 3a | a 7| 26 10.7 | 38. 
Mother-son 12.3 | 32 | 12.0 | 26 |123 | £5 | 109] Z8 102 38 9.1 | 28 
Hather-daughter | 10.1 | 3.1 | 112] 30 | its | os | 3 1.8 | 31 | 142 | 31 | 144 | 32 
Father-son POTS 23 UAO8 He 31 atte On| ist. ng) ae | sil iio. | 3a 
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` typed traits. Inversions of the normal pa- 


rental dominance pattern were related to more 


. disruption in the identification of boys than 


of girls. Boys from mother-dominant homes 
‘acquired non-sex-typed traits like the mother 
and also more feminine sex-role preferences 
“than boys from father-dominant homes. Con- 
“trary to expectations these differences in sex- 
role preferences were present at ages 4-5 and 
were sustained through ages 9-11. The pre- 
dicted decrease in the relationship between 
“sex-role preference and parental dominance 


- with age did not occur. It appears that later 


social pressures on boys to acquire masculine 


- preferences do not adequately counteract the 


early developed more feminine preferences of 
boys in mother-dominant homes. The predic- 


' tion of an increasing similarity of the child 


and dominant parent on non-sex-typed traits 
with age is supported. This similarity is sig- 
nificantly less in 4- and 5-year-olds than in 


‘solder children. Since the period from 3 to 6 


years is considered to be a critical formative 
one in which identification is rapidly chang- 
ing and growing, this increase in similarity 
following the preschool years might be antici- 
pated. After this marked increase in identifi- 


. cation, a stabilizing of identification on non- 


sex-typed traits appears to occur early in the 
school aged years. 

Tt was surprising that girls from mother- 
and father-dominant homes showed no dif- 
ference in sex-role preference at any age, since 


„it might be assumed that a dominant mother 


offers her daughter a rather “unfeminine” role 
model. However, it should be remembered that 
the measure of maternal dominance in this 
study was one of dominance relative to the 
spouse and not to other members of her own 
sex, Thus a mother could be more dominant 
than a passive husband and still not be domi- 
nant or “unfeminine” relative to other women. 


< The significance of parental dominance rela- 


ve to other members of the same sex re- 
Mains to be investigated. It is possible that 


. Since the feminine role in our culture is less 


well defined, less highly valued (Lynn, 1959; 
McKee & Sherriffs, 1957), and later acquired 
than the male role, these factors attenuate any 
differences due to maternal dominance. 

It could be argued that lack of appropriate 
paternal dominance rather than the presence 
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of maternal dominance led to the obtained 
findings. If boys and girls initially both 
identify with the mother, the socially appro- 
priate sex-role behavior of paternal dominance 
may be necessary to facilitate the shift in 
identification models for the boy. Since nor- 
mal identification for girls involves sustaining 
and intensifying the mother-child relationship, 
father dominance may contribute only to 
cross-sex identification and do little to disrupt 
the girls’ primary identification, Social pres- 
sures may also encourage the child to identify 
with the like-sexed parent unless his behavior 
is culturally inappropriate. Since the feminine 
role is less well defined than the masculine 
role, either dominant or passive behavior in 
mothers may be regarded as more acceptable 
than passivity in fathers. Evidence for this 
is provided not only by the different results 
for boys and girls on the ITSC, but also by 
the parent-child similarity on traits not influ- 
enced by sex typing. Paternal dominance fa- 
cilitated cross-sex identification in girls but 
did not disturb like-sex identification. Thus 
the mother-daughter and father-daughter simi- 
larity in father-dominant homes, and mother- 
daughter similarity in mother-dominant homes 
did not differ significantly. However, neither 
the sons nor daughters in mother-dominant 
homes identified with the passive father. 

This interpretation of the role of dominance 
in identification appears to be consistent with 
the psychoanalytic stress on the great impor- 
tance of “identification with the aggressor” in 
boys’ identification, and its lesser importance 
in the identification of girls. 
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EFFECTS OF MANIPULATED SELF-ESTEEM ON 
PERSUASIBILITY DEPENDING ON THREAT 
AND COMPLEXITY OF COMMUNICATION + 
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Self-esteem, manipulated by inducing success or failure on an ego-involving task, 
had the following predicted effects on persuasibility: (a) Lowered self-esteem 
increased persuasibility when the advocated opinion was not threatening and the 
communicator’s position was clearly stated. (b) Lowered self-esteem decreased 
persuasibility when acceptance of the advocated opinion would generally result 
in substantially increased threat to Ss, (c) Lowered self-esteem decreased per- 
suasibility when the communication was complex and contained misleading, 
closure-inducing phrases. In the first 2 cases, self-esteem affected acceptance 
components; and in the last case, it affected learning components of persuasi- 


bility. 


The purpose of this experiment was to in- 


vestigate some possible relationships between * 


self-esteem and persuasibility. It was expected 


*, that these relationships would vary with dif- 


m 


; 


3 


ferent characteristics of a communication. 
The hypothesis is widespread that self- 
esteem is inversely related to various forms 
of conformity behavior, including persuasibil- 
ity. Evidence for the inverse relation between 
self-esteem and persuasibility has been re- 
ported by Janis (1953b, 1954, 1955), Janis 
and Field (1959), Janis and Rife (1959), 
and, for children, Lesser and Abelson (1959). 
However, it seems probable that different 
types of communication would interact with 


` self-esteem and produce different effects on 


persuasibility. We reasoned that the findings 
of increased persuasibility with lower self- 
esteem probably depended on quite specific 
‘characteristics of the communication. By 
varying crucial aspects of the communication, 
we predicted varying relationships between 


self-esteem and persuasibility. 


i 
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Kasparek and Karen White for assistance in con- 
ducting the experimental sessions. 


Simple Hypothesis 


When the advocated opinion is nonthreat- 
ening and clearly and unambiguously stated, 
there is an inverse relationship between self- 
esteem and persuasibility. This hypothesis, 
without any limiting conditions being imposed 
on the content and form of the communica- 
tions, is the one which has been tested in the 
research cited above. It may be derived from 
the following considerations: Subjects feeling 
low in self-esteem think more unfavorably of 
themselves and their opinions, and therefore, 
in attempting to enhance their self-esteem 
are more likely to accept the opinions of oth- 
ers, In addition, it is assumed that being in 
agreement with others has, for most people in 
our culture, an acquired reward value, and 
that reassurance resulting from agreeing with 
others enhances self-esteem. 


Fear Hypothesis 


In cases where acceptance of a clelfly and 
simply stated opinion would cause a substan- 
tial threat to the self, there is a direct positive 
relationship between self-esteem and persuas- 
ibility. It is assumed, except perhaps in some 
cases of pathological depression, that people 
with threatened self-esteem do what they can 
to compensate for or minimize their feelings 
of inferiority, that is, they try to raise their 
level of self-esteem. They are likely to defend 
against accepting any opinion that would 
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make them feel still more insecure and 
threatened. 


Complexity Hypothesis 


There is a direct positive relation between 
self-esteem and persuasibility when the com- 
munication is complex, and especially when it 
contains the kind of complexity which results 
from the presence of misleading, closure- 
inducing phrases. The simple and fear hy- 
potheses concern the effect of self-esteem on 
the acceptance of the communicator’s opinion, 
and assume negligible effects on learning the 
content of the communicator’s position. These 
hypotheses assume that people understand the 
communication but do or do not accept its 
opinion, according to the conditions specified 
in the hypotheses, The complexity hypothesis, 


however, concerns the effect of threatened: 


self-esteem on the learning of the communi- 
cator’s position. If people do not learn the 
position, they cannot be “persuaded” by it. 
The distinction between acceptance and learn- 
ing, as distinct components of persuasibility, 
has also been made by Janis (1953a). 

Several studies have shown that anxiety 
facilitates performance on simple tasks and 
interferes with performance on complex tasks 
(Child, 1954). The complexity hypothesis 
applies these findings regarding task com- 

' plexity and anxiety to the relation of self- 
esteem and persuasibility. For present pur- 
poses it is assumed that threatened self-esteem 
functions as anxiety. 

Taylor and Spence (1952) present the 
theory that when complex tasks evoke many 
competing response tendencies, high anxiety 
may lead to poorer performance by increasing 
the probability that conflicting or erroneous 
responses will occur. Child (1954) has pointed 
out that high anxiety may result in more 
task-irrelevant responses, which interfere es- 
pecially with performance on complex tasks. 

In studying the effects of threatened self- 
esteem on closure, Dittes (1959, 1961) has 
specified circumstances under which condi- 
tions of threat lead to increased probability 
that competing erroneous responses will occur. 
He has proposed that the act of closure com- 
monly acquires a reward value as a means of 
enhancing self-esteem and that damage to 
self-esteem increases the strength of the tend- 
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ency to achieve closure as one means of re- 
storing or enhancing self-esteem. 

Applied to persuasibility, Dittes’ theory * 
states that the reading of a communication 
without obtaining closure (i.e., a feeling of 
having understood the communication) is, 
threatening to self-esteem, and that subjects 
with already lowered or threatened self- 
esteem are particularly motivated to avoid 
this further threat and therefore impose clo- 
sure readily. When a communication is sim- 
ple, such a tendency to form rapid closure | 
should facilitate performance; but when as 
communication is complex, and particularly 
when it contains misleading, closure-inducing ` 
phrases, such an effort toward rapid closure is ` 
likely to result in misunderstanding the com- 
munication. 

The theoretical positions of Taylor and 
Spence, Child, and Dittes all lead to the hy- 
pothesis that if a communication is complex, 
and especially if it also contains misleading, 
closure-inducing cues, subjects with threatened 
self-esteem will not learn the content of the 
communication as well as subjects with en- 
hanced self-esteem, 


Interactions between Hypotheses 


The effects postulated by all the hypotheses 
of this study are presumably operating with 
all communications, It is assumed that they 
can interact with each other and, depending 
on the relative strength of the various proc- 
esses which are activated, result in a net re- 
lationship between self-esteem and persuasi- 
bility which, theoretically, can range anywhere 
between a correlation of — 1.00 and + 1.00. 
Thus, in testing the fear hypothesis, we se- 
lected a threatening opinion which we felt 
would activate processes which would over- 
ride the effects postulated by the simple hy- 
pothesis. In a similar manner, again in order 
to override the effects hypothesized by the 
simple hypothesis, we emphasized the com- 
plexity and misleading, closure-inducing fea- 
tures of the portion of the communication 
used to test the complexity hypothesis. 


Relation between Manipulated and Predispo- 
sitional Self-Esteem 


Previous studies of the relation between 
self-esteem and Persuasibility have been con- 


: 
l 
5 


j 
| 
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A. 


i 


SELF-ESTEEM 


cerned with predispositional self-esteem, as 


| _ measured by subjects’ self-report on question- 


naires. In this study we have experimentally 
. manipulated self-esteem by varying failure or 


= success on an ego-involving task. It was, how- 


* ever, a secondary purpose of the study to 
- compare the effects on persuasibility of pre- 
dispositional and manipulated self-esteem. 
Although we recognize that there are prob- 
ably several important differences in manipu- 
“lated and predispositional self-esteem, we 
expected that the hypotheses advanced above 
“would hold for both manipulated, situation- 
ally affected levels of self-esteem, and predis- 
positional levels of self-esteem. In presenting 
our hypotheses we have cited studies which 
supported analogous hypotheses using self- 
report measures of predispositional self- 
esteem or predispositional anxiety. 
Cohen (1959), though, has proposed that 
` people high in predispositional self-esteem 
tend to use avoidance defenses which lead 
them to reject threatening communications 
and accept optimistic ones; while people low 
in self-esteem tend to use sensitizing defenses 
‘ which lead them to reject optimistic com- 
munications and accept threatening ones. 
Thus, Cohen’s hypotheses concerning predis- 
Positional self-esteem are contrary to the 
predictions of our simple and fear hypotheses. 
Cohen’s hypotheses have been partially sup- 
ported by Goldstein (1959) and Leventhal 
„and Perloe (1962). Dabbs (1962), though, 
found that the optimism-pessimism of com- 
munications produced no effect. He did find 
. that the effect of predispositional self-esteem 
on persuasibility depended on whether the 
communicator was portrayed as a “coper” or 
“noncoper,” 


METHOD 


a . b 5 
Subjects were made to experience either success 


Ws failure on an ego-involving task. They then gave 
7 their Opinion on three questions (pretest), read the 
communication, and then again, in the light of what 


„ they had just read, stated their opinion on the same 


three questions (posttest). Then, in order to meas- 
Ute possible differences in learning factors, the sub- 
Jects were given an information test which asked 

» for the communicator’s opinion on these same three 
questions, At the end of the experimental session 
Subjects completed a questionnaire which included 
ees intended to check on the effectiveness of 
the experimental manipulations. 
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Two self-report measures of predispositional self- 
esteem were administered to subjects at the begin- 
ning of the experimental session. The questionnaires 
were presented to subjects as unrelated to the later 
procedures and were completed in a room separate 
from the one in which the experimental procedures 
were conducted. The questionnaires were the 23-item 
measure of “feelings of inadequacy” which Janis and 
Field (1959) found was positively related to persuasi- 
bility in male high-school students, and the 12-item 
scale previously found by Dittes (1959, 1961) to in- 
teract with the effects of manipulated self-esteem on 
closure. 


Subjects 


The experimental materials were completed by 165 
Yale freshmen who were recruited from their dormi- 
tories and paid for participating in the experiment. 
To minimize any effects of interpersonal interaction 
and anticipated opinion of others, all work was 
anonymous, After a brief general introduction, sub- 
jects began work on the tasks, which were all self- 
administered. The various experimental materials 
were presented as a series of unrelated tasks. Each 
subject took a folder of experimental materials as he 
entered the room. The folders containing the experi- 
mental manipulations had been shuffled in advance. 


Success-Failure Manipulation 


Subjects took Items 1-10 and Items 31-40 of the 
Space Relations Test published by the Psychological 
Corporation. This test is difficult enough to prevent 
subjects from having much subjective certainty about 
the quality of their performance and it is therefore 
open to manipulation of reported performance re- 
sults, and thus of success or failure. The test is also 
abstract enough to allow it to be represented as 
measuring almost anything; this provided oppor- 
tunity for making it an ego-involving task. A spe- 
cial answer sheet was devised, so that answers were 
recorded, by carbon, onto a keyed sheet so as to 
permit self-scoring after the seal was broken, Ano- 
nymity was emphasized by repeated written instruc- 
tions during the procedures that subjects should not 
write their names on the materials and that the ex- 
perimenter’s interest was only in group averages and 
not individual results. 

In order to make it an ego-involving task, the 
written instructions described the Space Relations 
Test as one measuring 


skills of abstraction . . . of central importance in 
almost all occupations . . . highly related to per- 
sonal effectiveness and professional success. 


Fictitious keys and norms were used to affect the 
subjects’ feelings of having performed well or poorly 
on the Space Relations Test. For subjects assigned 
to the success condition (N = 79), a special answer 
key was devised, to maximize the number of appar- 
ently correct responses. A fictitious set of norms was 
provided so that all success subjects would find their 
scores ranging among the high percentiles of Yale 
freshmen. For subjects in the failure condition (W 
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= 86) a false key was provided indicating that many 
of the common responses were incorrect. A set of 
norms was also devised in such a way as to make 
the scores of all failure subjects range in the lower 
percentiles as compared with other Yale freshmen. 
To guarantee that subjects would inspect the norms 
carefully, a form was provided on which they were 
required to record their own score and percentile. 

A final questionnaire was intended to check on the 
effectiveness of the experimental manipulations. On 
four items on this questionnaire, success subjects re- 
ported that their results were above average, higher 
than they had expected, pleasing, and esteem en- 
hancing, while subjects in the failure condition re- 
ported that their results were below average, lower 
than they expected, disturbing, and not esteem en- 
hancing. The differences between the success and fail- 
ure subjects on these four items were all significant 
at well beyond the .001 level of probability. 


Communication and Measures of Opinion 
Change 


The communication was about 1,200 words long 
and was entitled, “How Much Should We Spend on 
Cancer Research?” The stated source of the article 
was either a low prestige, poorly qualified author 
(Mr. Lewis—sophomore student ‘at Andover High 
School) or a high prestige, well-qualified author (Dr. 
Lewis—Executive Director of the American Associa- 
tion for Medical Research).2 Different segments of 
the communication were constructed to test the dif- 
ferent hypotheses, The threatening portion occupied 
the first segment, and the complex, closure-inducing 
portion occupied the final segment. The point which 
was neither threatening nor complex was introduced 
in the middle of the communication. For reasons to 
be given later, two counterbalanced forms of the 
final segment were used. 

Simple hypothesis, Both forms of the communica- 
tion clearly expressed the belief that a diffuse re- 
search program is now the best strategy for using 
research facilities in doing research on cancer. Opin- 
ion was measured, on a 6-point scale, by asking 
subjects: 


Consider the following two strategies for using re- 
search facilities in doing research on cancer: (1) a 
massive, concentrated crash Program, focusing on 
a few clear problems, vs. (2) a widespread, diffuse 
program of many people working in many different 
places and in many different ways. Which do you 
think is now the best strategy ? 


This item was intended to Provide a measure of 
opinion change in which impulsive closure would 
play a negligible role and in which affective reac- 


2The manipulation of qualifications and prestige 
of communicator did not show any statistically sig- 
nificant effects on any of the measures of opinion 
change used in this report. The main effects and 
first- and second-order interactions with the failure- 
success conditions and with Forms A and B of the 
communication were all nonsignificant. 
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tions to the content of the question would be rela- 
tively slight. i 
Fear hypothesis, The first two paragraphs clearly | 
stated and elaborated the opinion, “At present the 
whole cancer problem seems almost hopeless.” The 
communication included statements such as: s 


When your chances of getting a disease are as- 
high as one in four, you might as well admit that. 
it COULD happen to you! .. . your chances of 


escaping cancer are not better than just that—four 1 


to one... . What is more, cancer is becoming a 
bigger problem every year. . . . You can’t be vac- 
cinated against cancer, ... You can’t even rely 
on the latest wonder drug in your doctor’s black 4 
bag to pull you through once you have it. i 


In order to measure opinion change on this issue, 
subjects gave one of six responses ranging from 
“Agree very much” to “Disagree very much” to’ the 
following statement: “At present the whole cancer 
problem seems almost hopeless.” q 

This item was intended to provide a measure of 
opinion change where any effects of impulsive closure 
would be negligible and where it would be probable 
that subjects would feel threatened by the content 
of the item, and the relevant portion of the com- 
munication. 

Complexity hypothesis. The complexity hypothesis 
states that subjects with lowered self-esteem, when 
given a complex communication, will be more likely 


to misread the communication and not learn the | 


communicator’s intended opinion, and hence not be — 
influenced by it. 

To test the complexity hypothesis, we constructed 
a portion of the communication as follows: The com- - 
munication presented a complex argument in favor 
of one opinion—on the relative costliness of a crash 
program versus a diffuse program of cancer research. 
It also provided—by means of underlining, quota- 
tions marks, and otherwise making selected phrases 
prominent—certain closure-inducing cues which could 
lead the subject to a conclusion the opposite of that 
actually expressed in the total communication. 

In this way we ensured that at least one compet 
ing erroneous response would be very high on the 
response hierarchy, and also made it easy for a sub- 
ject, Particularly if he felt threatened and was seek- 
ing closure, to jump to a quick, but incorrect, con- 
clusion as to the communicator’s opinion, Through 
this technique we were able to study an extreme casé 
of the common situation in which a rapid, impulsive 
closure-producing reading of a complex communica- 
tion leads to a misunderstanding of the communi- 
cator’s actual opinion. 

Two counterbalanced forms of the communication 
were used to test the complexity hypothesis. One 
form actually argued that a diffuse research program 
1s more expensive than a concentrated program; but 
the complex closure-inducing portions of the com- 
munication made it easy for subjects, if they were 
So inclined, to come to the conclusion that the ar 
thor was arguing in favor of the opposite position 
that is, that a crash research program is more eX 
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pensive than a diffuse research program. The second 
form argued in the opposite direction. 

+ Counterbalanced forms were used in order to con- 

trol for the possibility of an effect of the success-fail- 

. ure manipulation on subjects’ preferences for spend- 

ing more or less money on cancer research. Opinion 

change results showed no significant interaction be- 

tween the two forms of communication and the suc- 
= cess-failure manipulation (t= —.51). 

The item used in testing the complexity hypothe- 
sis was: “Which type of strategy in cancer research 
do you think is more expensive, a massive concen- 

. trated crash program, or a widespread diffuse 
program?” Subjects gave one of four responses 
ranging from “The crash program is much more 

° expensive” to “The diffuse program is much more 
expensive.” To aid comparison with other scales, 
these four responses are expanded to a 6-point scale. 

4 The information test for this item provides a 
Measure of whether or not subjects correctly learned 

the communicator’s position on the issue. Subjects 

" were asked to indicate on a similar 4-point scale 
whether the author regarded the diffuse or crash 
program as more expensive. Subjects whose responses 
were consistent with the misleading closure-inducing 

'. phrases, rather than with the communicator’s actual 
opinion, were regarded as having misunderstood or 
misread the communication. 


RESULTS AND DISCUSSION 


ld _The proportion of subjects changing in the 

. direction advocated by the communication 
was used as a measuse of opinion change. 

ı The mean change on each item showed the 
same pattern of results as the proportion of 
Subjects changing positively. 


Simple Hypothesis 


An inverse relationship between manipu- 
lated self-esteem and persuasibility was ob- 

a Setvetl when the advocated opinion was not 
threatening and the communicator’s position 
‘was clearly stated. As shown in Row 1 of 


® The investigators wish to thank Robert P. Abel- 
Son for suggesting use of the arc-sine transformation 
(Walker & Lev, 1953, p. 424) in analyzing the data. 

+ » Straightforward applications of the arc-sine trans- 
formation were used to test for interactions of the 

. failure and success conditions with other variables. 

» This test, unlike most other tests for analyzing in- 

` teractions among proportions, does not require the 
assumption that no main effects exist in order to 
test for the significance of interaction terms. 

For the benefit of those readers who distrust 
Statistical transformations of this type, chi-square 
tests of the main effects for all the hypotheses of 
this study gave p values of approximately the same 
magnitude as those obtained using the arc-sine 
transformation. 
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TABLE 1 


PROPORTION OF SUBJECTS CHANGING IN DIRECTION 
ApvocaTED BY COMMUNICATION 


Experimental 
conditions 
Hypothesis | Opinion question t 
Failure | Success 

1. Simple Diffuse versus crash 

program as best 

strategy 50 34 2.08* 
2. Fear Cancer problem 

seems hopeless 31 AB =2:22% 
3. Complexity | Diffuse versus crash 

program as most 

expensive 26 ‘43 | —2.34* 


* p <.02, one-tailed test. 


Table 1, failure subjects showed significantly 
more opinion change than success subjects 
(p < .02). 

The fact that the difference between the 
mean information scores of the two groups 
was so small (Row 1 of Table 2) permits us 
to conclude that the experimental manipula- 
tion probably had its effect on acceptance of 
the communicator’s position rather than on 
learning his position on the issue. 

There were no differences in initial opinion 
between the failure and success groups that 
could account for the observed differences in 
opinion changes. Similarly, with the opinions 
used to test the fear and complexity hypothe- 
ses, initial positions were virtually identical; 
all ¢’s were <.60. 


Fear Hypothesis 


A direct positive relationship between ma- 
nipulated self-esteem and persuasibility was 
observed when acceptance of the advocated 
opinion would generally result in substantially 


TABLE 2 


MEAN INFORMATION SCORE: LEARNING OF 
COMMUNICATORS POSITION 


Experimental 
conditions 
Hypothesis | Opinion question t 
Failure | Success 
1. Simple Diffuse versus crash 
program as best 
3 strategy ih 5.0 4.8 79 

Fe ancer problem 

scat seems hopeless | 4.9 | 4.8 |  .55 


3. Complexity | Diffuse versus crash 
program as most 
expensive 4.1 5.2 


* p <.001, one-tailed test. 
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increased threat to subjects. Row 2 of 
Table 1 shows that, as compared with success 
subjects, the subjects in the failure condition 
showed significantly less acceptance of the 
threatening opinion, “At present the whole 
cancer problem seems almost hopeless” (p 
< .02). Since there is no significant difference 
between the groups on the information test 
for this item (Row 2 of Table 2), it is again 
concluded that the experimental manipulation 
probably affected the acceptance component 
of opinion change, rather than learning 
components, 


Complexity Hypothesis 


A direct positive relationship between ma- 
nipulated self-esteem and persuasibility was 
observed in the case where the communica- 
tion presented a complex argument in favor 
of an opinion and contained misleading, 
closure-inducing phrases, As shown in Row 3 
of Table 1, success subjects showed signifi- 
cantly more opinion change than failure 
subjects (p < .01). 

Difference in learning, not acceptance. On 
this item, it was predicted that the experi- 
mental manipulation of self-esteem would af- 
fect opinion change via learning components 
of persuasibility. Row 3 of Table 2 shows 
that the information score on this item, unlike 
that of the other items, was significantly 
higher for success subjects than for failure 
subjects ( < .001). Thus, on the complexity 
item, the finding of more opinion change by 
success subjects (Row 3 of Table 1) can be 
adequately explained by the fact that signifi- 
cantly more of the success subjects had cor- 
rectly learned the communicator’s actual 
opinion, 


Results with Predispositional Self-Esteem 


Our results provided little illumination con- 
cerning comparisons between the effects on 
persuasibility of manipulated and predispo- 
sitional self-esteem. Neither of the two meas- 
ures of self-esteem was significantly related 
to any of the opinion measures, nor was 
any interaction between predispositional self- 
esteem and the success-failure conditions sig- 
nificant. We are not clear about the reasons 
for failure to replicate previous findings ob- 
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tained using these measures of self-esteem; 
perhaps the intrusion of the experimental 
manipulation swamped any overall effects of 
predispositional self-esteem. 


A Final “Complexity” Hypothesis 


The results of this experiment, and other 
recent work, emphasize that the relatiori be- 
tween self-esteem and persuasibility is quite 
complex. Our results or others’ indicate that 
increased self-esteem may enhance or decrease 


persuasibility, or have no effect, depending, 


on how it interacts with such variables as 
the following: the characteristics of the com- 
munication, the characteristics of the com- 
municator, and whether self-esteem is con- 


ceived as predispositional or situationally |” 


manipulated, Finally, our results emphasize 
that self-esteem may affect persuasibility 


through differential learning of the communi- _ 


cation as well as through the more commonly 
considered process of the acceptance of the 
communication, 
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EGO INVOLVEMENT AND THE ABSOLUTE JUDGMENT 
OF ATTITUDE STATEMENTS'* 


CHARLES D. WARD 
University of Maryland 


Personal involvement in a social issue was manipulated both experimentally and 
by S selection while controlling for extremity of attitude. The effects on the 
absolute judgment of relevant attitude statements then was observed, The more 
highly involved was S, the further from his own position the statements were 
judged. This effect was not significant for statements on 2 irrelevant attitude 
scales. The Involvement X Item Position interaction was not significant. These 
results were interpreted as providing support for the notion of involvement- 
influenced judgment of attitude statements. A possible cognitive interpretation 


of the phenomenon was suggested. 


An assumption common in many attitude- 
scaling methods is that a subject’s judgments 
of attitude statements are independent of his 
own attitude on the issue. Hinckley (1932) 
was the first person to attempt to test this 
assumption for the method of equal-appearing 
intervals. Hinckley compared judgments of 
Northern white, Southern white, and Negro 
subjects on statements pertaining to the social 
position of the Negro, He concluded that the 
judgments of the three groups did not 
differ in spite of their apparently extreme 
divergences in attitude on the issue. 

However, Hovland and Sherif (1952) re- 
peated Hinckley’s study in an attempt to 
bring a rapprochement between studies such 
as Hinckley’s in the area of attitude measure- 
ment and those which had demonstrated the 
influence of motivational factors on judg- 
ments, Hovland and Sherif concluded that 

Judges with an extreme position and strong per- 
sonal involvement will be highly discriminating in 
accepting items at their own end of the scale. They 
will correspondingly display a strong tendency to 


lump together statements at the end of the scale 
which they reject [p, 824]. 


A further conclusion was that 


A greater degree of displacement will occur for the 
“neutral” items and a smaller degree for the sharply 
defined pro and con statements at the extremes 
[p. 824]. 


1 This article is based upon a thesis submitted in 
partial fulfillment of the requirements for the MA 
degree at the University of North Carolina. The 
author is indebted to Harry S. Upshaw for his 
advice and assistance in carrying out this research 
and in preparation of the manuscript for publication. 


The striking difference between their results 
and those of the earlier studies on attitude 
measurement was explained by Hovland and 
Sherif as being due to the earlier studies’ 
lack of highly involved subjects who held 
extreme positions, For Hinckley’s study, this 
lack seemed to be due to the fact that a 
“carelessness” criterion had been applied 
which had the unintended effect of excluding 
those subjects who were most extreme in their 
opinions, 

The Hovland and Sherif (1952) study 
appears to reconcile previous attitude meas- 
urement studies with those demonstrating 
motivation-influenced judgment. However, 
Hovland and Sherif treated involvement and 
extreme attitude as a single variable opera- 
tionally. Consequently, it is not clear whether 
the displacement effects they observed were 
due to involvement alone, to extreme atti- 
tudes alone, or to some combination oï the 
two. This important question is left unan- 


swered in two later studies (Hovland, 1959;" 


Hovland, Harvey, & Sherif, 1957). Involve- 
ment and extremity of attitude are not identi- 
cal conceptually, although empirically they 
are very difficult to separate. Extremity of 
attitude, for example, typically is curviline- 
arly related to involvement, so that judges 
with extreme attitudes tend to be more 
involved than are judges with intermediate 
positions, 

The current theoretical position of Hovland 
and Sherif has developed beyond that which 
was presented in their 1952 study, but the 
notion of involvement-influenced judgment is 
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still a central one. Sherif and Hovland (1961) 

distinguish between two kinds of involve- 
.’ ment. The first is task involvement which is 
~ ‘induced in the subject by the experimental 
instructions or by the experimenter’s emphasis 
» on the importance of the task. The second is a 
„much more intense involvement for the sub- 

ject,.in which the stimulus items 


or the task itself is related to an issue important 

į to his very conception of himself. ... It is ego- 

“involvement of this second kind, aroused by a 

stimulus item related to his stand on an issue,* by 

` ca person or group to whom the individual is com- 

mitted, that is of special interest in the area of 

- attitude measurement and attitude change [pp. 
196-197]. 


The present study was designed to investi- 
| gate the effects of this second kind of involve- 
ment, and in so doing, to examine some of the 
basic propositions set forth by Hovland and 
_ Sherif in their 1952 study which are implicit 
“in their current theoretical position. The basic 
requirement for such an investigation is a 
method for varying involvement which does 
not at the same time vary extremity of atti- 
_ tude, Such a method is suggested by a tech- 
nique used by Charters and Newcomb 
(1952), who varied the religious involvement 
of their subjects by manipulating the salience 
of relevant religious groups. Their intention 
was to observe the possible effects of this 
Variation on answers to an attitude question- 
naire, This operation was carried out by 


“simply heightening the individual’s awareness of his 
membership in the specified group by vivid remind- 
ets of this membership [p. 415]. 


Charters and Newcomb view this as a ma- 
hipulation of the “potency” of the religious 
membership group, but it also can be viewed 
as a maniplation of the involvement of the 
Subject in the religious issue. Sherif and 
Sherif (1956) thus state that 


oes and Newcomb ... showed that when 
Ca ‘olic subjects are situationally ego-involved as 
eae their reactions to general statements are 
"ER ntially different from their reactions to the 
me items when Catholic group reference is not 
activated [p, 638]. 


: The subjects’ task in the Charters and 
ewcomb study was that which Coombs 


aoe has called “Task A,” a preference 
- In order to test the effects of involve- 


e 
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ment on the judgment of attitude statements, 
however, it is necessary to observe the effect 
of ego involvement on an attribute judgmental 
task, a “Task B.” The Charters and New- 
comb technique can influence Task A judg- 
ments and therefore the measured attitudes 
of the subjects. Consequently, a special prob- 
lem arises of applying the technique to sub- 
jects’ Task B judgments while at the same 
time controlling for the extremity of their 
attitudes, One possible solution to this dif- 
ficulty is to use subjects who already have 
such extreme attitudes on the issue that ex- 
perimentally induced variations in involve- 
ment are not likely to make their attitudes 
still more extreme. The effectiveness of such 
subject selection could be determined after 
the application of the Charters and Newcomb 
technique by giving all subjects the same 
Task A preference scale. If there are no 
differences in the resulting attitudes of the 
subjects, in spite of their different levels of 
involvement, then it would seem safe to- 
conclude that any observed differences in the 
Task B judgments are due to differences in 
involvement alone. 


METHOD 
Subjects 


The subjects in this experiment were 60 white 
students at the University of North Carolina, All 
subjects had uniform and extremely favorable atti- 
tudes toward the Negro. Forty of these subjects pre- 
viously had picketed local movie theaters for several 
months in order to protest against the theaters’ 
policy of racial segregation, An uneasy truce of 
about 2-months duration had existed between the 
pickets and the theaters when the present experiment 
was conducted. During this time the theaters had 
not been picketed even though they had not yet 
integrated. The picketing group, the “Citizens’ Com- 
mittee for Open Movies,” was a voluntary organiza- 
tion which did not actively solicit members. Conse- 
quently these 40 subjects had sought membership 
on their own initiative in order to express publicly 
their views on the issue. The other 20 subjects had 
not been pickets but they were known to possess 
equally favorable views toward the Negro. Many 
had friends who did picket the movie theaters, This 
sample of 60 students was composed of both males 
and females, and undergraduate as well as graduate 
students. 


Experimental Design 


There were two major independent variables in 
the experimental design. The first variable was in- 
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volvement. There were three different involvement 
conditions: the “pickets-salient” condition, composed 
of pickets whose membership in the picketing group 
had been made salient; the “pickets-nonsalient” 
condition, composed of pickets whose membership 
in the picketing group had not been made salient; 
and the “nonpickets” condition, composed of stu- 
dents who were not members of the picketing group 
but who had equally favorable attitudes toward the 
Negro. Twenty subjects, matched for sex and 
graduate-undergraduate status, served in each in- 
volvement condition. 

Since the subjects in all three conditions had 
extremely favorable attitudes toward the Negro, it 
is very likely that all originally felt at least mini- 
mally involved in the Negro issue. The 40 picketing 
group members, however, had voluntarily sought out 
the picketing group for membership while the non- 
pickets had not. Consequently it seems safe to 
conclude that the pickets were even more involved 
in the issue than were the nonpickets. Through use 
of the Charters and Newcomb technique, involve- 
ment also was varied among the pickets, Great 
emphasis (described below) was placed on picketing 
group membership for subjects in the pickets-salient 
condition in order to situationally ego involve them 
as pickets. No mention of picketing group member- 
ship was made for subjects in the pickets-nonsalient 
condition, Thus it was intended for involvement 
to vary between the three conditions, with involve- 
ment greatest for the pickets-salient subjects, inter- 
mediate for the pickets-nonsalient subjects, and least 
for the nonpickets, 

For the purposes of the analysis a second inde- 
pendent variable was specified, the scale values of the 
attitude statements on the Negro scale. The items 
were divided according to their scale values into four 
nonoverlapping groups of equal numbers. This was 
done in order to see if the statements’ locations on 
the attitude continuum influenced the judgments of 
the subjects differentially according to their degree 
of involvement. 


Hypotheses 


Paraphrasing Hovland and Sherif (1952), the 
hypotheses of the present study in terms of the 
experimental design were: 


I. Judges with extremely favorable positions and 
strong personal involvement in an issue, in com- 
parison with judges with equally extreme Positions 
but less involvement, displace attitude statements 
toward the unfavorable end of the scale. 

II. The difference in judgments between judges 
who are highly involved and judges who are less 
involved is greater for the “neutral” items than for 
the more sharply defined pro and con » statements 
at the extremes. 

Hypothesis I, in terms of the present experi- 
mental design, predicts a significant involvement 
main effect. The item judgments of the pickets-salient 
subjects should be the most negative, those of the 
pickets-nonsalient subjects intermediate, and those of 
the nonpickets subjects the most favorable. Hypothe- 
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sis II predicts a significant interaction between in- 
volvement and item groups, with the differences in 


judgments between involvement conditions being, 


greatest for the midscale item groups. 


Questionnaire 


During the experiment a questionnaire which con-' 
sisted of several parts was given to each of the. 


subjects. The first page contained a brief description 


of the alleged purpose of the study, which varied b 


with the involvement condition. Brief instructions 


for the judgment task also were included. Part 1 
was composed of 35 statements from the Droba’ 


(1931) militarism-pacifism scale, a scale intended 


to be irrelevant to the involvement manipulation) 


An 11-point rating scale appeared under each item, 


The subjects were instructed to indicate their judg- 


ments of how militaristic each item was by checking 
the interval most appropriate for that item, Interval 


11 was the most militaristic interval, Interval 1, 


the most pacifistic. The internal ordering of the state- 
ments in this scale, as for the other scales, was 
randomly determined. 

Part II of the questionnaire contained 40 attitude 


statements on the social position of the Negro. Most, 


of these statments were from the original Hinckley 
(1932) scale, but some supplementary items were 
added at the pro-Negro end, most of which were 
Negro superiority items. These items were added 
in order to insure that the range of items would 


extend beyond the extremely favorable attitudes of 


the subjects, since Upshaw (1962) has demonstrated 
that out-of-range subjects judge items differently 
than do in-range judges even when their absolute 
positions are identical, 

Part III of the questionnaire contained 35 state- 
ments on the Cuban revolution. This scale, like 
the militarism-pacifism scale, was intended to be 
irrelevant to the involvement manipulation. There 
were two reasons for the inclusion of the two 
irrelevant scales. First, it helped disguise the true 
purpose of the study, especially from subjects. in the 
critical pickets-nonsalient condition. Second, the in- 
clusion of the two irrelevant scales served as 4 
control since the experimental variation in involve- 
ment was relevant only to the Negro issue, It was 
expected that the variation in involvement woul 
affect judgments of items on the Negro scale, but 
not on the two irrelevant scales. 

Part IV of the questionnaire consisted of a pel- 
sonal data sheet to be filled out by the subjects. 
In addition to name, sex, etc, the subjects were 
requested to list the organizations to which they 


belonged and the ways they had spent their leisure 
time during the past school year. These last tW0 


questions were added in order to measure the 


efficacy of the experimental involvement manipula- 


tion. It was expected that the subjects in the pickets- 
salient condition would list picketing more often 
for both these questions than would subjects in 
the pickets-nonsalient condition, For the nonpickets 
of course, the questions were irrelevant. i 
After the subjects completed the above question- 


naire they were given a supplementary attitude 
questionnaire containing 21 Hinckley statements on 
» the social position of the Negro. The scale values 
of these items, along with those in the main question- 
. naire, were found by calculating the medians of their 
placements by 50 judges who did not otherwise par- 
. ticipate in the experiment. The subjects were re- 
quested to endorse the statements which adequately 
“represented their own opinions. From this supple- 
- mentary questionnaire the individual attitudes of 
the subjects were determined. 


` : Procedure 


The procedure was different for each of the three 
‘involvement conditions. Subjects in the pickets- 
salieht condition were contacted directly. They were 
told that the reason for their selection was their 
membership in the picketing group, and the experi- 
menter identified himself as also being a member. 

.. The purpose of the study was explained as being 
~ an investigation of the possible transfer of uni- 
formity of judgments on the race issue to other 
important issues, thus justifying the inclusion of 
the two irrelevant scales. Most of the subjects in 
`, this condition were given the questionnaire in small 
groups, Before they filled out the questionnaire the 
experimenter led a brief discussion on the importance 
of the group’s picketing, the composition of the 
group’s membership, and possible ways to get more 
People to participate in the picketing in the event 
«that it should be resumed. 
The subjects in the pickets-nonsalient condition were 
` contacted indirectly. Prior to the experiment their 
Class schedules had been determined. Their instruc- 
n tors in these classes agreed to ask the subjects, along 
with one or more randomly chosen nonpickets in 
each class, to remain behind after a given class 
period. The experimenter then approached the sub- 
Jects and told them that they had been selected 
Tandomly to participate in a pretest of a national 
survey of college student opinion. The subjects were 
Slven cover letters further explaining the alleged 
) Paper, of the study, and were asked to sign up 
+ for individual appointments, Only pickets were in- 
cluded in this involvement condition. All interviews 
T given individually. During the debriefing ses- 
h concluded all interviews, only a small 
in Siena the subjects in this involvement condition 
study eae as to the true nature of the 
se karded. questionnaires for these subjects were 
ree Subjects in the nonpickets condition, having 
Hf Pee ed randomly, at first were contacted 
spiel T classroom along with the pickets in the 
PET -nonsalient condition. This proved to be an 
i nea method for locating extremely pro-Negro 
Hie S, however, so the remainder were contacted 
t Edk after their names had been suggested by 
nonni who had already been interviewed. These 
cele S subjects were told the same cover story 
They cae Pickets-nonsalient subjects had received. 
A aieia in small groups, although a 
e questionnaire individually. 
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RESULTS 


In order to see if the attitudes of the 
subjects were unreasonably discrepant on the 
Negro issue the attitude of each subject was 
determined. This was done for each subject 
by computing the mean scale value of all 
the items on the supplementary questionnaire 
which he had endorsed. The mean attitude 
of judges in the pickets-salient condition was 
9.216; in the pickets-nonsalient condition, 
9.402; and in the nonpickets condition, 9.152. 
An analysis of variance of these differences 
produced a variance ratio of 1.39 (df = 2/57), 
with an estimated probability value of about 
.24, From this it was concluded that the 
observed differences in attitude were not large 
enough to disturb the requirements of the 
present experiment. The fact that the most 
extremely favorable attitudes were held by 
the pickets-nonsalient subjects rather than by 
the pickets-salient subjects provided further 
support for this conclusion. 

For the analysis of the (Task B) judgments 
of the Negro items in the main questionnaire, 
the scale values of the items were treated 
as an independent variable. The 40 items 
were divided into four groups of 10 items 
each. Group 1 contained the 10 most nega- 
tive items, Group 2 the next 10 most negative 
items, Group 3 contained 10 positive items, 
and Group 4 contained the 10 most positive 
items. Each subject’s judgments of all these 
items then were recorded, and his mean judg- 
ment for each of the four groups of items 
was calculated. Thus for each subject four 
means were computed, each mean represent- 
ing that subject’s placement of the 10 items 
of a particular item group. 

The overall mean item placement by in- 


TABLE 1 


MEAN JUDGMENTS OF Item Groups BY 
LEVEL or INVOLVEMENT 


Involvement 
Item group 7 : 
| ster | Bette, | secon 
í 5.045 4.900 5.125 
2 6.970 7.395 7.690 
3 8.380 9.015 9.185 
4 8.700 9.265 9.370 
Combined mean| 7.274 7.644 7.843 
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TABLE 2 


ANALYSIS OF VARIANCE OF JUDGMENTS OF 
Necro ITEMS 


Source df MS F 

Between people 

Involvement (A) 2 6.66 2.92* 

Error (b) 57 2.28 
Within people 

Item position (B) 3 211.46 | 346.66** 

AXB 6 80 1.31 

People X B (Error w)| 171 61 

Total 239 
ep = 
ep <i. 


volvement conditions is shown in Table 1. 
From this table it can be seen that the greater 
the degree of involvement, the more nega- 
tively the Negro statements were judged. 
These differences in judgments were tested by 
means of an analysis of variance, employing 
techniques suggested by Green and Tukey 
(1960). The effects of Involvement, Item Po- 
sition, and interactions were tested, as shown 
in Table 2, The main effect for Item Position 
was significant (p< .001), but it should be 
kept in mind that this was assured by the 
original division of the items into four groups 
according to their scale values. The involve- 
ment main effect produced a probability value 
of approximately .06, while the Involvement 
X Item Position interaction was not signifi- 
cant, having a probability value greater than 
.25. In order to further investigate the In- 
volvement main effect, the Newman-Keuls 
multiple comparisons procedure described by 
Winer (1962) was employed. The results of 
this procedure, shown in Table 3, suggest 
that the mean judgments were more influ- 
enced by the manipulation of picketing group 
saliency than by the variation in picketing 
group membership. 


TABLE 3 


DIFFERENCES BETWEEN INVOLVEMENT 
Conprtion MEANS 


Pickets- Pickets- 


Salient nonsalient Nonbickets 
Pickets-salient — 
Pickets-nonsalient | .370* — 
Nonpickets 560** -199 = 


* p < .10, one-tailed, 
** p < .025, one-tailed, 
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Part IV of the questionnaire, it will be 
recalled, required the subjects to list theip 
activities during the past school year, as well _ 
as the organizations to which they had be, 
longed. Out of the 20 subjects in the pickets: * 
salient condition 9 listed picketing at least’ 
once. In the pickets-nonsalient condition only, 
5 of the 20 subjects listed picketing. The chi- 
square for this difference is not significant 
statistically, but the difference is in the 
expected direction. It is possible that the’ 
difference in involvement actually is under- , 


estimated by the difference in frequency of — 


mentioning picketing. This possibility is sug- 
gested by comments during the debriefing 
session of some of the subjects in the pickets- — 
salient condition who did not list picketing, » 


These subjects said that they knew that the ` 


TABLE 4 


MEAN JUDGMENTS By INVOLVEMENT LEVEL 
FOR IRRELEVANT SCALES 


Pickets-| Pickets- 

falint | sionealient |N OREN 
Militarism-pacifism scale| 5.115 5.342 4.900 
Cuban revolution scale 6.355 6,453 


| 6.110 


experimenter already was aware of their: 
picketing activity and therefore they thought 
it unnecessary to list picketing formally. 
The mean judgments by involvement con- 
ditions for the two irrelevant scales, the 
militarism-pacifism scale and the Cuban revo- 
lution scale, are shown in Table 4. Ten out 
of the 35 items on each scale were selected 
randomly for the analysis. The mean judg- 
ment of these 10 items was then calculated 
for each subject on each scale. Possible dif- 
ferences in judgments between subjects in the - 
three involvement conditions were tested by 
Means of two analyses of variance, one for 
each scale. The variance ratios for the 
militarism-pacifism and Cuban revolution 
scales, respectively, were 2.33 (df = 2/55) 
and 1.42 (df = 2/56), The probability value : 
for the variance ratio on the Cuban revolu- | 
tion scale was only .25, while that for the 
militarism-pacifism scale was about .11. Al 
though both of these probability values ate 
larger than that observed for the Negro scale, 
the probability value for the militarism- 


m scale is smaller than expected. How- 
involvement condition means for the 
rism-pacifism scale, unlike those for the 
‘scale, are not consistently in the 
| direction. 


Discussion 


results of this experiment lend sup- 
© the general notion of involyement- 
ed judgment. As predicted by Hy- 
I, judges with extremely favorable 


found to displace attitude statements 
d the unfavorable end of the scale, in 
‘ison to the judgments of judges with 
ually extreme positions but with less in- 
ent. Thus subjects in the pickets- 
‘condition judged the items to be least 
0 able, judgments of the pickets-nonsalient 
byects were intermediate, and the judgments 
the nonpickets subjects were the most 


findings is that significant differences in 
ts by involvement conditions were 
l only on the Negro scale. The second 
j g is that subjects in the pickets-salient 
Condition more frequently listed picketing as 
tivity in which they had participated 
an organization to which they had 
ed than did subjects in the pickets- 
lient condition, The suggestion of this 
d finding is that the manipulation of 


t conditions was successful. 

thesis II was that the difference in 
ents between highly involved and less 
ubjects is greater for the midrange 
an for the items at the extremes of 
Scale. From a motivational standpoint this 
erential displacement would be expected 
tuse of the tendency for midscale items to 
re ambiguous than extreme items. The 
sments of such ambiguous midscale items 
be expected to be especially likely to 


the present experiment, however, 
pport this second hypothesis. 
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The theoretical position of Hovland and 
Sherif entails a motivational interpretation of 
the effect of involvement upon judgment. 
However, the difference in judgments between 
highly involved and less involved subjects was 
not found to be significantly greater for mid- 
range items than for items at the extremes of 
the scale, as expected by such an interpreta- 
tion. A second implication of the motivational 
approach might be that items near the own 
position of a highly involved judge would be 
displaced by him toward his own position 
(“assimilation”), while only more discrepant 
items would be displaced away from his own 
position (“contrast”). In comparison to a 
judge with less involvement, the highly in- 
volved judge might be expected to be es- 
pecially motivated to reduce the dissonance 
between his own position and those items with 
which he is in general agreement. In the 
present experiment, however, there was no 
evidence of highly involved judges displacing 
items toward their own positions. Instead, the 
direction of item displacement was consist- 
ently away from the involved judges’ posi- 
tions, even for very favorable items. For these 
reasons a motivational interpretation does not 
seem to be appropriate for the present data. 

A more appropriate interpretation of the 
present results is a cognitive psychophysical 
one. Such an interpretation is suggested by 
a special application of Helson’s adaptation- 
level model, as presented by Segall (1959). 
In this model the location of the adaptation 
level, or point of neutrality, is assumed to be 
the basis by which the placement of items into 
lower or higher categories on the continuum 
is determined. Since the judge’s attitude is 
assumed to be a determinant in the location 
of the adaptation level, the adaptation level 
is displaced towards the judge’s attitude, The 
items being judged consequently are displaced 
toward the end of the scale opposite that 
of the judge’s own position. Added to this 
basic model is the further assumption that 
the more salient the judge’s attitude, the 
more the adaptation level is displaced towards 
it. Consequently, the greater the saliency of 
the judge’s attitude, the further he displaces 
the items being judged toward the opposite 
end of the scale. 

In the present experiment all subjects had 
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equally extreme attitudes, but only the 
pickets had sought out the picketing group 
for membership. Presumably the attitudes of 
the pickets were more salient and important 
to them than were the attitudes of the non- 
pickets. Among the pickets attitude saliency 
was further varied by means of the Charters 
and Newcomb (1952) technique. As a result 
of the differential application of this technique 
the anchor of own attitude was made even 
more salient for pickets in the pickets-salient 
condition than for pickets in the pickets- 
nonsalient condition, 

In order to produce a salient anchor in a 
psychophysical experiment, the experimenter 
emphasizes the anchor’s importance, stresses 
its relevance, and in general draws the sub- 
ject’s attention to it. In producing involve- 
ment the Charters and Newcomb technique 
employs the same general procedures. From 
a psychophysical standpoint it thus appears 
possible that the involvement observed in the 
present experiment is simply a special case of 
anchoring, in which the anchors are unusually 
salient attitudes. Anchoring phenomena have 
long been observed in psychophysical experi- 
ments. If the present psychophysical in- 
terpretation is correct, many instances of 
involvement-influenced judgment may be 
simply special cases of such commonly 
observed anchoring phenomena. 
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MOTIVATIONAL DETERMINANTS OF ACADEMIC 
TASK PERSISTENCE* : 


a ESTHER S. BATTLE? 
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Task persistence at a difficult mathematics problem was examined among 74 
junior high-school students. Minimal goal level, expectancy, and attainment 
value were the main independent variables. Other predictors were: the dis- 


crepancy between minimal goal level and expectancy, the certainty one holds of 
attaining his minimal goal, “inner-other direction” and social class membership. 


Expectancy related positively to persistence. Neither minimal goal level nor at- 


tainment value was predictive. Minimal goal certainty and the discrepancy 


scores related positively to persistence. It was impossible, however, to isolate 
the effect of these scores from the expectancy level since both related to ex- 
pectancy and to persistence. “Inner-directed” students were more persistent 
than “other-directed.” Social class related to persistence for boys, Neither “so- 


This is a study of why some children 
persist in their attempts to solve a difficult 
«academic task, while others give up. Attitudes 
toward academic achievement which had been 
built up in school were thought to influence 
the amount of task persistence children dis- 
play when they encounter such difficulty. For 
example, one might expect that the child 
. who holds high standards for himself and is 
confident of reaching them would be more 
` persistent than the uncertain, easily satisfied 
individual. Rotter’s (1954) social learning 
theory, with some modifications, provides the 
theoretical context in which this question is 
approached, 
While early explanations of task persistence 
relied on the concepts of will (Webb, 1915) 
4 and character (Stone, 1922), these explana- 
tions were replaced by the interest hypothe- 
sis: the subject persists at a task because he 
has interest in it and his experience or 


7 This report is based upon a doctoral dissertation 
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cial desirability” nor IQ was associated with persistence. 


learning affects the amount of interest 
(Hartshorne, May, & Maller, 1939). The im- 
portance of this development is that it repre- 
sents a transition from a trait to a learning 
theory approach. Contemporary learning ex- 
planations of task persistence use the concepts 
of motivation to achieve success (Atkinson, 
1958; Feather, 1961, 1963), valence (Lewin, 
1935), need value (Rotter, 1954), or the at- 
tainment value construct (Crandall, Katkov- 
sky, & Preston, 1960) which was used in the 
present study. Attainment value is the impor- 
tance to the individual of achievement in a 
given task and should determine the length of 
his persistence in working at it. Two kinds of 
importance are investigated in this study. 
The first, absolute attainment value, is the 
importance of doing well in mathematics, re- 
gardless of the importance attached to doing 
well in other academic subjects. The first ex- 
planation of task persistence proposed in the 
present study, then, is that an individual per- 
sists because it is important to him to be good 
in mathematics, per se. 

The second type of importance, relative 
attainment value, is the importance of com- 
petence in mathematics, compared with the 
desire to be proficient in other school sub- 
jects. Robinson (1940) and Feather (1961) 
stress that one should consider the nature 
and desirability of alternative activities which 
will lead to the same end (in this case posi- 
tive reinforcement for academic competence) 
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if one is to predict how long an individual 
will persist with the task at hand. Here, it 
would be expected that students who felt it 
was more important to be good in English 
than in mathematics, for example, would be 
less persistent in math than those who rated 
mathematics highest in importance. Thus, the 
second explanation of task-persistent behavior 
is that the individual persists because it is 
more important for him to do well in this 
subject than to develop skills in other avail- 
able courses. 

The ability of the experimental subject was 
believed to be a necessary component of 
task-persistent behavior by Nelson (1931), 
Kremer (1942), Ryans (1939), Thornton 
(1941), and Rethlingshafer (1942). Never- 
theless, experimental findings revealed only a 
low relationship between intelligence and task 
persistence. When the individual’s own assess- 
ment of his ability was used as a predictor 
it was found to relate more directly to per- 
sistence, for example, Thornton’s (1939) 
feelings of adequacy. Rotter’s (1954) con- 
struct of expectancy, used here, represents 
one such approach. Expectancy is defined as 
the probability held by the individual that a 
particular reinforcement (in this case, suc- 
cessful solution of the problem) will occur 
as a function of a specific behavior on his 
part (here, persistence) in a specific situation. 
Todd, Terrell, and Frank (1962) found that 
students with a greater expectancy for aca- 
demic success were more likely to be normal 
achievers than underachievers. The third 
explanation of persistence used in this study, 
then, is that an individual continues to work 
at a problem if he expects that he will be 
successful, 

The importance of achievement standards, 
their source and effect on achievement be- 
havior, have been stressed by Wolf (1938), 
Gould (1939), and more recently by Crandall 
et al. (1960) who discuss Rotter’s construct 
of minimal goals. The minimal goal level is 
the lowest goal in a continuum of potential 
reinforcements which will be perceived as 
satisfaction. It was proposed that the highly 
persistent individual would be likely to hold 
high minimal standards of success for himself. 

Not only was it expected that the height 
of a child’s minimal goal would affect the 
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amount of his task persistence, but it was 


also thought that the subjectively held proba- . 


bility of reaching these minimal goals, that is 


f 


his minimal goal certainty, would also be 


a factor. Thus, the more certain an individual _ 


Ay 


was that he would be able to reach his mini- | 


mal goal, the more persistent he would be. 


It seemed evident that the child’s certainty "p 


of reaching his minimal goal level would be 
affected by his feeling of competence in that. 
activity. If an individual expects to be able 
to perform at a level much higher than that, 
of his minimal goals he should have high 
minimal goal certainty; 


the person with ~ 
standards much higher than his self-assessed 
level of competence will be more uncertain — 


of obtaining the grade which gives him mini- | © 


mal satisfaction. Four kinds of relationships 
between the level of expectancy and the mini- 


mal goal level were examined as they affect — 


both the amount of minimal goal certainty 
and the length of task persistence. 


1, Students whose minimal goals were 
roughly consonant with the level of perform-_ 
ance they expected to be able to achieve were 
felt to have a positively motivating pattern 
of academic aspiration and would persist in 
an achievement task because they felt they 
had a fairly good chance of achieving at least 
minimal satisfaction. They would be likely to 
have moderate minimal goal certainty. 

2. Students whose minimal goals were 
much higher than their level of expectancy 
would have little possibility of satisfaction 
because the goals they had set for themselves 
would seem out of reach. These individuals 
would be foredoomed to frustration and the 
eventual demoralization described by Wendell 
Johnson’s (1946) IFD pattern, One would 
expect these children to give up soon after 
they encountered difficulty since they would : 
feel there was little chance of reaching even 
their minimal goal. Their nonpersistencé 
would be due to their low minimal goal 
certainty. j 

3. If a child states a minimal goal which 
is appreciably lower than his level of expect- 


ancy, it is suggested that the low minimal = 


goal statement may be defensive-against- 
failure or is being used as a face-saving 


mechanism, By stating that he would be — 


| 


‘were he to receive a grade which was 
that he would be absolutely certain 
ing it, he may be trying to insure 
in his own or in others’ eyes. It was 
that the child who attaches a great 
f importance to doing well (high at- 
t value) would be most likely to be 
erized by a pattern which is designed 
ct him from failure. It became ap- 
that it would be necessary to intro- 
ome sort of control to differentiate the 
| who was stating a low goal just to save 
from the student who was stating a 
imal goal simply because competence 
formance was not important to him. 
s reason, the level of attainment value 
so taken into account for children whose 
expectancy greatly exceeded the grade 
ibed as a satisfactory one (Patterns 3 
4). One might expect that students who 
a high attainment value with this mini- 
oal-expectancy pattern would be task 
stent for several reasons: if they were 
a low goal just to save face in the 
of poor performance, it might be as- 
hat they actually held higher minimal 
5 which would mediate their behavior, but 
fraid to state them; other children in 
toup may be applying a successful de- 
‘mechanism and would be running little 


was easily within their reach. However, 
children, would also persist even after 
Nave reached their minimal goal level 
$ the rewards for greater success are 
7 important to them, that is, they have a 
ttainment value. In other words, suc- 
hay not be an all-or-none matter with 
oup. Crandall et al. (1960) discuss the 

of standards as a characteristic which 
| be differentiated in the understanding 
i levement motivation. Thus, the minimal 
Statement may be a measure of the low- 
of their scale of satisfaction, and they 
y certain that they will reach this, but 
they attach great importance to 
matics achievement, they persist, striv- 


feels it is relatively unimportant to 
(low attainment value) and states 
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that he is satisfied if he attains a very low 
level of performance (low minimal goal) 
which he is sure to reach (high minimal goal 
certainty) because of his judgment that he is 
highly capable (high expectancy). This child 
would be nonpersistent because he is satisfied 
with very minimal performance and achieving 
in mathematics at any higher level is of 
little importance to him. 

A child’s achievement attitudes and behay- 
ior are believed to be affected by the relevant 
values perpetuated within his social class. 
The job aspirations of a child will, in part, 
determine the differential importance of any 
given academic pursuit. In addition, the 
academic achievement behavior of the par- 
ents, their own standards and those they hold 
for the child’s performance should have some 
effect on his minimal goals and level of 
expectancy. Furthermore, placing a high value 
on persistence in academic pursuits and re- 
warding a child for such behavior would be 
more likely to be associated with some social 
class orientations than with others. Thus, 
socioeconomic status was believed to bear a 
predictive relationship to task persistence, 
Previous research has demonstrated that it 
is the middle class that lays the most stress 
on achievement; upper-upper and lower-lower 
class individuals tend to discount the impor- 
tance of academic achievement as the means 
to success. The curvilinear relationship be- 
tween social class and achievement, usually 
expected because of the foregoing reasons, 
was not expected in the present study since 
individuals holding true upper-class values 
were not represented in this sample. 

An additional influence on the length of 
persistence at a task in which one is experi- 
encing difficulty is the student’s desire to 
appear socially acceptable. If the child felt 
that persistence was the socially acceptable 
behavior and it was important to him that 
he meet standards of social acceptability, he 
might persist for these reasons rather than 
because of his own attitudes toward achieve- 
ment, Bronner (1914) found this propensity 
to be a factor in relation to physical persist- 
ence and the work of Crowne and Marlowe 
(1960) and of Crandall, Crandall, and Kat- 
kovsky (1965) illustrates the necessity of con- 
trolling this factor if other predictors are to 
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be studied, since it frequently serves as a 
powerful source of motivation for the subject. 

Finally, the desire to obtain approval from 
others was felt to be a factor in the length 
of time an individual persisted. Keister 
(1937) found that actual training in persist- 
ence was possible. Children who received 
approval for persisting and reproof for quit- 
ting were more persistent in later testing with 
different tasks than children who were not 
trained in this way. The extent to which a 
child independently valued academic com- 
petence was felt to be an important factor 
in whether or not he would persist. An opera- 
tion was devised for Reisman’s concepts 
of the inner- and other-directed individual. 
It was anticipated that the child who stated 
that he worked in school to please himself and 
meet his own standards might be expected 
to persist in this situation in which every 
attempt was made to provide no reward from 
the examiner. Other-directed students, who 
worked to obtain approval from others, would 
be less persistent in this situation in which 
approval was not forthcoming from the 
examiner, 

In summary, this study tested the follow- 
ing hypotheses: 


I. The length of time which a subject will 
persist at a mathematics problem is posi- 
tively related to the expectancy, minimal goal 
level, and attainment values he holds for 
mathematics. 

IIA. The persistence time of individuals 
is predictable from the relationships between 
their minimal goal level, expectancy, and at- 
tainment values. Children whose standards are 
consonant with their expectancies and/or 
those with the defensive-against-failure pat- 
tern described above (1 and 3), are more 
persistent than children whose goals are truly 
lower than their expectancies or those who 
hold unreasonably high goals (2 and 4). 
These relationships are expected to be pre- 
dictive of task persistence because of the 
amount of minimal goal certainty associated 
with each pattern and/or the value of 
attaining academic success for the individual. 

IIB. There is a direct, positive relation- 
ship between the amount of minimal goal 
certainty and the amount of task persistence. 
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III. Social class values for achievement and 


rewards for persistence are predicted to gen- , 


eralize to the experimental situation so that 


there is a positive relationship between. © 
socioeconomic status and the length of task 


persistence. 


IV. Inner-directed students are predicted. | 


to be more task persistent than other-directed 
individuals. 


Two additional relationships were exam- 
ined: a test was made to determine if the 
IQ measure related significantly to task per- 
sistence; an effort was made to ascertain 
whether the child’s responses to the question- 
naires or his behavior in the experimental 


situation was related in any systematic way _ 


to his need to appear socially desirable. 


METHOD 
Subjects 


Over 500 seventh-, eighth-, and ninth-grade chil- 


dren were tested from a community populated 
mainly by families who derive their income from 
work at a large military establishment on the 
outskirts of the same city. 


Procedure 


A 1-page questionnaire was presented to the 
children in each classroom by a female examiner 
who identified herself with the Fels Research 
Institutet Among other questions, the student was 
asked to give his minimal goal and minimal goal 
certainty levels for mathematics and for English. 
The minimal goal question stated: 


Circle the grade below which shows 
the lowest grade that you could get in 
mathematics and still be satisfied. 


A A= Bt B B C+ C C— 
D+ D D- F 


The minimal goal certainty question asked: 


How certain are you that you could 
get at least the grade you circled 
above? 


Dea e So TE 8: 9.10 


uncertain certain 


®The author wishes to thank the teachers and 
students of the Fairborn Junior High School for 
their cooperation with the collection of the data) 
and Robert Martin, Principal, and Jaenice Middle- 
swart, Secretary, for arranging the schedule. aa 

4The author appreciates the help of Virginia 
Crandall, Suzanne Good, Elinor Waters, Shelley 
Wing, Beverly Collins, and Janet Robbins with this 
phase of the study. 
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On the same questionnaire the child was asked: 


Why do you try to do well in school? Or, if you 
feel you do not try, then why not? 


These responses were scored according to the inner- 
- other directed criteria developed for this study. Re- 
sponses scored as inner-directed were those in which 
the person stated that he did/did not work because 
of some intrinsic motivation, personal goals, likes or 
dislikes, or cases in which he has internalized external 
standards so that he works because of feelings of 
pride, shame, or satisfaction. Other-directed responses 
“ must have reference to the fact that the reinforce- 
ments which motivate performance are external’ and 
come from either parents or school. The person 
works either to please others and win approval for 
academic work or to avoid their disapproval. A 
scoring manual was developed with specific examples 
from pretest questionnaires. 

One week later, a new examiner appeared in 
the classroom to administer the expectancy and 
attainment value questionnaire as well as the Chil- 
dren’s Social Desirability Scale (Crandall et al, 
1965). The statement from which the expectancy 
level was obtained instructed the child: 


Circle the grade below which shows 
the grade you really expect to get in 
mathematics on your next report card 
for this grading period. 


A A= B+B; B=) CE CIC 
D+) Depa 


The absolute attainment value statement said: 


Circle the number below which shows 
how important it is to you to do well 
in mathematics, 


r E A Sones 


not very important 


8 9 10 


very important 


P On the same questionnaire, the student was asked 
+ to indicate his relative attainment value: 


Write on the lines below, the names 
of the following school subjects to 
show how important it is to you to 
be good in each of them. 

Social Studies (History or Geog- 
raphy); Mathematics; Art; Sci- 
ence; English; Physical Education 

List them from the most important 

to the least important. 


i R A the results from both the absolute and 
Bea attainment value scales were examined, the 
the aa attainment value score was used to test 
it aR hypotheses because it was believed that 
Gey be the importance of mathematics achieve- 
wie hee se which should predict persistence, not 
iene achievement in mathematics is more or less 
tüd nt than success in English, for example. 

Y of an absolute attainment value would repre- 
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sent a theoretical contribution since previous studies 
have relied on relative attainment value. 

From the full 48-item Children’s Social Desira- 
bility (CSD) scale, 25 items were selected as par- 
ticularly discriminative in previous studies. This is 
a true-false questionnaire in which the child must 
characterize himself in a highly improbable way in 
order to appear socially desirable. An example is: 
“I have never felt like saying unkind things to 
a person.” 

From the original 500 students, an attempt was 
made to select 20 subjects, whose minimal goals and 
expectancies differed by at least one grade level, for 
each of the four minimal goal-expectancy discrep- 
ancy groups described above. A discrepancy of this 
size was 1 SD from the mean discrepancy. This was 
accomplished for all but Group 4 for which there 
were only 14 subjects, a fact which attests to the 
rarity of a combination in which the student 
states that he cares little for mathematics achieve- 
ment and holds minimal goals which are appreciably 
lower than his level of expectancy, 

The Henmon-Nelson IQ test score obtained by 
the student when he was in the sixth grade was 
used as the IQ measure. The mean IQ was 108.5, 
the standard deviation 11.6, and the range of scores 
was 81-136. 

The Hollingshead (1957) classification of occupa- 
tion was used to determine the socioeconomic status 
of each subject because it included the position of 
military ranks. However, observer experience in this 
community would indicate that members of the 
military assigned to Class 1 do not hold the attitudes 
of “the 400” usually associated with an upper-upper 
class position. It was for this reason that predictions 
from socioeconomic status to task persistence were 
linear rather than curvilinear. The Hollingshead sys- 
tem assigns occupations to 1 of 7 categories with a 
rating of 1 representing the highest class, 7, the 
lowest. The distribution of subjects by social class 
was trimodal with 20% of the cases in Class 1: 
that of “higher executives, proprietors of large 
concerns, and major professions, with commissioned 
military officers of major and above.” Twenty per- 
cent fell in Class 3; “administrative personnel, small 
independent businessmen, and minor professionals”; 
and 30% represented Class 5: “the skilled manual 
employee.” 

Each of the 74 students in the final sample was 
seen individually by one of two unfamiliar female 
examiners.5 The child was presented with a com- 
pleted 3X3 mathematics magic square (Kraitchik, 
1942) as an example of the kind of task he would 
be asked to do and the rules of solution were 
explained. He was told that he would be given a 
similar math problem (4 X 4) in which he would 
have to ‘arrange the numbers to yield the correct 
sum for each column, row, and diagonal. He was 
given the problem when the directions were under- 
stood. A general information questionnaire was 


5 Beverly Collins’ assistance is greatly appreciated. 
She and the author served as examiners for the task 
persistence measure, 
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placed face-down next to the student. It was ex- 
planied that this “second task” had nothing to do 
with mathematics and was available whenever the 
student wished to turn to it but that once he began 
it he could not return to the mathematics problem. 
The examiner withdrew, still within the subject’s 
view, to read and surreptitiously record the length 
of time the. child persisted with the mathematics 
puzzle. Because of scheduling problems, no child was 
allowed to persist more than 45 minutes. None of the 
subjects was able to complete the problem during 
this time. At the end of each session, the subject 
was reassured that although the problem was soluble, 
it was indeed difficult. He was briefly shown the 
solution to demonstrate that he had not been 
deceived. 


Statistical Analysis, 


Because the data did not meet the requirement of 
normal distribution, McCall T scores were assigned 
for all variables, All correlation coefficients were 
Pearson rs and p values reported resulted from 
two-tailed tests of significance, except where specified. 


RESULTS AND DISCUSSION 
Hypothesis I 


It was predicted that the higher the child’s 
expectancy for success, his minimal goal level, 
and his attainment value for mathematics, 
the more he would persist in the face of 
difficulty. Expectancy values, obtained in the 
form of letter grades, were transformed into 
numerical scores ranging from 1 to 12 
with high scores representing high grades; 
thus, F = 1, D— = 2, D = 3, etc. The level 

` of expectancy was found to relate positively 
and significantly to the length of task per- 
sistence (r= 47, p<.001) for the total 
group. These correlations for the boys and for 
the girls separately reveal that the relation- 
ship appears to be somewhat stronger for the 
boys (r = .52, p < .001, n = 46) than for the 
girls (r = 38, p < .05, n = 28). 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF 
Task PERSISTENCE TIMES 


Group | Discrepancy patterns | N M b SD 
1 |MG=Ex 20 | 30.05 10.71 
2 |MG> Ex 20 | 22.85 11.64 
3 |MG < Ex, AVhigh| 20 | 33.45 10.04 
4 |MG <Ex,AVlow | 14 | 30.50 14.36 


Note.—Data presented in minutes. MG = mini 
Ex = expectancy, AV = attainment value, re et a 
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Neither the minimal goal level nor the ab- 
solute attainment value scores were found to 
relate significantly to the length of task 
persistence. 


Hypothesis ITA 


It was predicted that differences in the 
persistence times for individuals would occur 
depending on the relationship between their 
minimal goal levels and their expectancies, 
The means and SDs of persistence time, in 
minutes, are given in Table 1 for each of the 
four minimal goal-expectancy discrepancy 
groups. The range of task persistence was 3-45 
minutes. Fifteen children continued working 
at the problem until they were stopped at the 
end of 45 minutes. 

With a one-way analysis of variance for 
unequal Ws (Lindquist, 1953) a significant F 
ratio of 2.80, p < .05, was found between 
groups. A comparison between the groups by 
t test revealed the source of this difference. 
Group 1 (with consonant minimal goals and 
expectancies) was found to be significantly 
more persistent than Group 2 (that character- 
ized by minimal goals which are higher than 
expectancies), ¢ = 2.04, p< .05. The only 
other significant comparison was between 
Groups 2 and 3 indicating that Group 2 is 
significantly less persistent than Group 3 
(those with defensive minimal goal state- 
ments), t= 3.19, p< .01. Since Group 1 
does not differ significantly from Group 3, it 
would appear that it is only the children in 
Group 2 who behaved differently from the 
others. Group 4 does not differ significantly 
from any of the other groups (perhaps be- 
cause of its small N and large variance). 

Since there was a significant correlation 
between expectancy and task persistence for 
the total group (r=.47), and between the 
discrepancy scores and task persistence, it was 
necessary to examine the relationship between 
the discrepancy scores and the level of ex- 
pectancy. It may have been that Discrepancy 
Group 2 was a subsample with low expectancy 
scores while Group 3 and Group 1 had high 
expectations, This possibility was examined 
with a one-way analysis of variance procedure 
on the four discrepancy groups using the 
subjects’ expectancy scores, rather than theif 
task persistence scores, A significant F ratio 
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of 12.87, p < .01, was found and subsequent 
_-# tests were computed. The results of this 

_ series of comparisons revealed that significant 
. differences existed between the low expectancy 
scores of Group 2 and the high expectancies 
of Group 3 (ż = 9.36, p < .001). The expec- 
` tancies of Group 2 were also significantly 
lower than those of Group 1 (¢= 5.34, p< 
001). Thus, the analysis of variance using 
_ expectancy scores produced results which are 
` essentially similar to that of the analysis of 
variance with persistence data. 

From these analyses it was clear that there 
is a confounding of the two variables, expec- 
tancy and discrepancy scores, so that a clear 
test of Hypothesis II was not accomplished. 
One cannot say whether the discrepancy be- 
tween minimal goal and expectancy is an ad- 
ditional and independent predictor of task 
persistence without obtaining a representative 
sampling of expectancy for each type of dis- 
crepancy score in a future study. 


Hypothesis IIB 


The amount of minimal goal certainty as- 
sociated with discrepancy patterns was pre- 
dicted to bear a direct, positive relationship 
to the amount of task persistence. Predictions 
based on the discrepancy scores rely on the 
intervening variable of minimal goal cer- 
tainty; that is, students do/do not persist 
depending on how certain they are that they 
. will be able to attain at least minimal satis- 
faction. The minimal goal certainty score was 
found to relate negatively and significantly 
to the minimal goal-expectancy discrepancy 
Score (rho = —.27, p< .02). That is, the 
‘higher the minimal goal certainty, the less 
does the minimal goal level exceed the level 
of expectancy. Minimal goal certainty was 
also found to relate positively and signifi- 
‘cantly to task persistence for the total group 
_ &= 42, p< .001), for boys (r= .36, p < 
. 02), and for girls (r = .54, p < .01). These 
- Tesults support the interpretation that it is 
the child who feels that there is a good chance 
of obtaining at least minimal satisfaction 
(high minimal goal certainty) who persists. 

more one’s minimal goals exceed the level 
of expectancy, the lower the chances of ob- 


taining satisfaction and the less persistent the 
Individual, 
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The minimal goal certainty score has been 
conceptualized as a kind of expectancy score, 
but it is an expectation of reaching one’s 
minimal goals, Since those with minimal goals 
lower than their expectancy level and high 
minimal goal certainty all turned out to have 
expectancies above the median, while those 
with higher minimal goals than expectancies 
and low minimal goal certainty all had low 
expectancy levels, it is not surprising that 
there was a direct correlation of minimal 
goal certainty and expectancy (r = .46, p < 
.001). Once again, studying the independent 
effects of minimal goal certainty and expec- 
tancy is possible only if there is representa- 
tive sampling for all levels of expectancy with 
each discrepancy group. } 


Hypothesis III 


Social class was predicted to bear a direct, 
positive relationship to task persistence, The 
upper end of the current sample was felt to 
represent children whose parents accept the 
middle-class belief in the importance of edu- 
cation as a means to social and vocational 
advancement, so that the children would have 
experienced consistent reinforcement for 
achievement efforts such as persistent be- 
havior, and the curvilinear relationship often 
found between social class and achievement 
variables was not expected. The lower-class 
child probably sees little need for academic 
success in his future nonacademic job, This 
fact, coupled with the lower-class deemphasis 
on achievement, would contribute to less per- 
sistence on his part. The absolute attainment 
value for mathematics was, indeed, found to 
bear a low but significant positive relation- 
ship to social class (r = .28, p < .05). In line 
with the prediction, social class was found to 
bear a significant positive relationship to task 
persistence, but only for the boys (r= .42, 
p< .01). 

It was confirmed that the other-directed 
child was, in fact, less task persistent than 
the one who was inner-directed (7point piserial 
= .23, p<.02). Sensitivity to social rein- 
forcements, which are absent in the experi- 
mental situation, would appear to be a con- 
tributing factor to the lack of motivation for _ 
the nonpersistent, other-directed child. 

There was no significant relationship be- 
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tween IQ and task persistence (7 = .24). In 
this study too, then, it has been found that IQ 
alone is not sufficient to account for task 
persistence; rather, it is the child’s perception 
of his capabilities (i.e., his level of expec- 
tancy) which will determine his behavior in 
situations in which success is largely depend- 
ent on persistence. 

The CSD scale did not correlate signifi- 
cantly with either the independent or depend- 
ent variables. Thus, those who persisted ap- 
parently did not do so merely to meet the 
perceived demands of the experimenter. 

The problem of measuring subjectively held 
beliefs, values, and expectancies is central to 
this study. There is some evidence by which 
we may examine the extent to which expec- 
tancy statements are tied to reality. If one 
examines the correlation of the grade the 
student says he expects to get in math and the 
grade he actually receives (r = .63 for boys 
and .84 for girls, p < 001 for both), it be- 
comes apparent that the child is able to report 
realistically the grade which he may expect 
to receive and that he will be willing to state 
this if requested, as in the present study. 

The lack of prediction from the minimal 
goal statement may be due, in part, to the 
fact that the student has only a hazy idea of 
the amount of work required to meet his 
minimal goal level. Perhaps differences in 
the predictive utility of the expectancy and 
minimal goal statements are due to differences 
in the amount of feedback which the child 
receives and uses to adjust his expectations 
and minimal goals, His expectations may be 
constantly tested with each examination and 
teport-card grade, In contrast, the minimal 
goal statement asks him for the lowest grade 
he could get and still feel satisfied. This may 
be a grade which is discrepant from his ex- 
pectancy, that is, one which he has always 
exceeded or one which he has never been able 
to attain, Thus, the amount of work required 
to achieve a given minimal goal may be poorly 
defined in classroom work and could -not be 
expected to relate to the length of task per- 
sistence in the experimental situation. If the 
subject were to receive continuous feedback 
as to the level of performance he had achieved 
on the magic square, one would be able to 
| examine the degree of correspondence between 
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his minimal goal level and the length of his 
persistence time. Other reasons, as previously 
stated, may be that the child is working 
toward something higher than his minimal 
goal, the minimal goal statement may be more 
influenced by defenses against public failure, 
by an attempt to conform with public stand- 
ards, or by an unwillingness to admit failure 
to oneself, 

Two measures of the child’s attainment 
value were obtained. The information from 
the 10-point, absolute scale was used to test 
the main hypotheses, It becomes clear when 
one examines the high median attainment 
value score (Md=9.4) that although the 
range is wide (1-10), the distribution is 
skewed so that there is poor differentiation 
of individuals at the upper end of the scale, 
Because of the present cultural stress on 
achievement, it is quite likely. that most 
children have incorporated the achievement 
values of their culture and hold a high attain- 
ment value for competence in math. It is 
clear that performance in mathematics (re- 
port-card grades) is fairly representative* of 
academic performance in general (grade-point 
average). For boys, there was a correlation 
between these two of .90 and for girls, a cor- 
relation of .77, both significant at less than 
the .001 level. The median attainment value 
score indicated by both boys and girls was 9 
in a scale of 1-10. The importance of mathe- 
matics achievement does not seem to be as 
sex differentiated as is commonly thought; 
nor does performance in mathematics appeat 
to differ from performance in other academic 
subject matter, 

The relative attainment value score did not 
relate significantly to anything other than the 
absolute attainment value score (r = .38, 2 < 
01). The fact that no larger proportion o0. 
the variance is held in common by the two 
measures, however, suggests that they are 
getting at different aspects of attainment 
value. It is possible that the crudeness of the 
questions used to measure the two constructs 
is responsible for the lack of a stronger rela- 
tionship between them. It is also possible that 
this same crudity is responsible for the fact 
that attainment value was not found to relate 
significantly to the length of task persistence 
in this study. A comparison of absolute and 
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relative attainment value using more refined 

„measures appears warranted. 
If Hypothesis II held up, after representa- 
. tive sampling was completed to obtain all 
-Jevels of expectancy with each type of mini- 
* mal goal-expectancy discrepancy score, one 
- would be able to explore the antecedents and 
current status of the defense-against-failure 
process which is believed to operate with 
those children who have a high attainment 
* value, high level of expectancy, and low mini- 
, mal goal level. Since task persistence is neces- 
sary in many academic situations, study of 
this most persistent group might yield prac- 
tical implications for the mediation of non- 
persistent, academically incompetent behavior. 
Direct modification of the child’s minimal 
goal-expectancy pattern might result in 
changes in the amount of task persistence he 
displayed. 
' An additional question is why there are 
certain types of discrepancy patterns more 
naturally associated with given levels of ex- 
pectancy, It is suggested that subjects with 
low expectancies and high minimal goals may 
have achieved such discrepancies through 
either of two sequential processes. It may be 
that they start out with a low expectancy and 
obtain their only satisfaction by holding re- 
spectably high minimal goals (i., goals 
commensurate with the standards of an 
achievement-oriented culture). Or, it may be 
. that they start with such unobtainable goals 
that they become disillusioned by their fail- 
ure to reach them and eventually feel incapa- 
ble ôf achieving at even the most menial 
level, that is, they hold low expectations. 
‘These two processes are not mutually exclus- 
lve and may, in fact, work interactively in the 
Same individual, 

Students with the pattern of high expec- 
‘tancies, low minimal goals, and a high attain- 
Ment value may feel confident of doing well 
because they have defined success at an easily 
` obtainable level. It may also be that these 
Students hold broader standards and that 
their behavior is mediated by the upper end 
of these goals; that is, they are working for 
maximal rather than minimal satisfaction. 
The foregoing suggested interpretations bear 
Systematic examination. 
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in support of the above hypotheses. 


to this approach, one basis of attitude 
ge—reality testing—may be found in the 
ic gestalt tenet that an individual always 


ronment (see Sarnoff & Katz, 1954, and 
ith, Bruner, & White, 1956, for the original 
ulation of this functional view). Cohen, 
otland, and Wolfe (1955) devised an as- 
ent procedure to measure the strength 
this postulated need for cognitive clarity. 
|a later study, Cohen (1957) utilized his 
cognition scale to investigate the effects 
Mm attitude change of the interaction between 
ength of a cognitive need and variations in 
ambiguity of the communication situa- 
Cohler and Kelman (1959) provided an- 
her useful extension of this functional ap- 
Oach by demonstrating that the amount of 


i The present study is a portion of a dissertation 
‘Baron, 1963) presented to the Graduate School of 
$ and Science in partial fulfillment of the require- 
for the degree of Doctor of Philosophy at New 
University. The author wishes to express his 
appreciation to Irwin Katz and the late Bob 
n without whose guidance and encouragement 
tudy could not have been undertaken. The con- 
ictive criticisms of Herbert C. Kelman are also 
lully acknowledged. 

Ow at Wayne State University. 


COGNITIVE BASIS OF ATTITUDE CHANGE AS A 
FUNCTION OF MOTIVATIONAL, STYLISTIC, 
AND STIMULUS FACTORS* L 


REUBEN M. BARON? 


University of Michigan 


A sample of 110 male and 89 female undergraduates was used to test some 
derivations from a functional model of the cognitive basis of attitude change. 
The results for the males offered strong confirmation of the major hypotheses 
regarding attitude change. Immediate attitude change was found to involve a 
significant interaction between the level of cognitive need, differences in infor- 
mation storing style, and the variations in the ambiguity of a persuasive com- 
munication. Also, as predicted, a significant interaction effect of need clarity 
and the information storing variable occurred on a delayed measure of attitude 
change. These effects were assumed to be mediated by differences in emotional 
tension generated by imbalances between cognitive needs, cognitive capacity, 
and the demands of the stimulus situation. This assumption was supported by 
a significant triple interaction among all 3 independent variables on a measure 
of perceived frustration. The results for the females offered no reliable evidence 


positive attitude change produced by a per- 
suasive communication was a function of the 
interaction between the strength of the need 
for cognitive clarity and variations in cog- 
nitive style. 

It should be noted, however, that the above 
line of research is characterized by certain 
sins of omission and commission. Cohen et al. 
(1955) and Cohler and Kelman (1959) never 
attempted to manipulate experimentally this 
postulated need for cognitive clarity. Instead, 
they restricted themselves to pencil-and-paper 
assessment procedures having low internal 
consistency.? In addition, because Cohen’s 
(1957) communication topic involved student 
grades, his manipulation of ambiguity may 
have varied not only ambiguity but the de- 
gree of general anxiety produced in the ex- 
perimental situation. The Cohler and Kelman 
study neglected to investigate the stimulus 
side of the communication situation though 
this variable has important implications for 
the activation of the need for cognitive clarity 
and for the choice of a need reduction mecha- 

3A pilot study by the present investigator using 
Cohen’s and Kelman’s cognitive need scales yielded 
values for Cronbach’s alpha coefficient of .29 and 
.23, respectively. This indicates that these scales are 
probably not tapping a single common factor, but 
may involve a confounding of cognitive style with 
cognitive need items. 


219 


220 


More important, however, than any short- 
comings in the above functionally oriented in- 
vestigations is the fact that they call atten- 
tion to the possibility that the cognitive basis 
of attitude change may involve a set of per- 
sonality and situational factors more complex 
than current consistency-type models of atti- 
tude change would suggest. In order to in- 
vestigate this proposition in some depth, it 
was decided to focus on the interactive effects 
on attitude change of variations in the need 
for cognitive clarity, cognitive styles, and the 
ambiguity of a persuasive communication. 
The general conception underlying this kind 
of interaction approach is that while varia- 
tions in the strength of the need for cognitive 
clarity should create differences in the degree 
to which subjects are attentive to and chal- 
lenged by a persuasive communication, knowl- 
edge of stylistic and of stimulus factors is 
necessary before a relatively precise predic- 
tion can be made as to the amount and direc- 
tion of attitude change. It should also be men- 
tioned that the above concatenation of cog- 
nitive factors should be most representative 
of the processes underlying attitude change in 
a situation where the subject conforms to the 
influencing person or group primarily because 
he views them as a source of valid knowledge. 
This kind of stricture would seem to rule out 
face-to-face interaction situations where in- 
fluence acceptance is primarily normative. 
Thus, the present model is deemed most ap- 
propriate for the kind of quasi-interaction 
situation which often characterizes the dis- 
semination of propaganda through the mass 
media, 

Before considering any complex interac- 
tions, it may be helpful to take a separate 
look at the conceptual properties of each of 
the major independent variables. 


Need for Cognitive Clarity 


According to Cohen et al. (1955), the need 
for cognitive clarity “is a need to understand 
and make reasonable the experiential. world 
[p. 291].” Cohler and Kelman define this 
need as “the extent to which a person is made 
uncomfortable by ambiguity and incongruity 
and is motivated to restore a state of clarity.” 
The mode of achieving cognitive clarity seems 
to be dependent on two classes of variables: 
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Variations in the structure of the stimulus 
situations and individual differences in the 
processing of stimulus information. They will 
be discussed in turn. 


Structure of the Stimulus Situation 


The notion that stimuli may differ along an 
ambiguity dimension is one which all psy- 
chologists accept but few stop to analyze, 
The type of ambiguity to be dealt with here 
is characterized by an inversion of the ex- 
pected order of sequentially dependent stimu- 
lus elements. An example of this kind of ap- 
proach to ambiguity can be found in the work 
of Cohen (1957). In this experiment Cohen 
created ambiguity by presenting a solution 
to a problem before the problem itself was 
enumerated. Similarly, information about the 


effects of an event may be presented before ” 


the event itself has been described, or the 
conclusions of a syllogism may be given be- 
fore the underlying premises have been stated. 
In each case, the individual stimulus elements 
are themselves perfectly clear and compre- 
hensible—it is the lack of the expected se- 
quential arrangement of these elements that 
creates the ambiguity. 

The present model assumes that the greater 
the stimulus ambiguity and/or attitudinal in- 
consistency present, the greater the arousal of 
the need for cognitive clarity. The structure 
of the stimulus situation may also affect the 
mode of need reduction. Thus, people may be 
unwilling or unable to fit contradictory stimuli 
into their existing cognitive system if these 
stimuli are also ambiguous. It is assumed that 
a person’s ability to handle such stimuli is 
largely determined by his characteristic cog- 
nitive style. 


Information-Processing Style 


The concept of cognitive control, as used 
by Klein (1958), refers to the various in- 
formation-processing strategies that an indi- 
vidual utilizes in attempting to satisfy mo- 
tives. The leveling-sharpening control, based 
on Cohler and Kelman’s (1959) study, seems 
to be particularly well suited for investiga- 
tions of possible stylistic determinants of atti- 
tude change. However, because the leveling- 


tA notable exception is Luchins’ (1950) excellent 
conceptual analysis of ambiguity. 


Se 
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sharpening construct as used by Klein and 
co-workers has broad personality connota- 
_ tions in addition to its specific cognitive com- 
ponents, this construct will not be used per se. 
stead, the present focus will be on the more 
“limited notion that there are individual dif- 
‘ferences in memory organization along an 
_ assimilation-differentiation dimension. Thus, 
Twill speak of two strategies of information 
storing: assimilating and differentiating. 
I Assimilators are assumed to have poorly 
_ differentiated memory schemata character- 
“ized by a relatively high degree of interac- 
tion between memories and present percepts. 
Differentiators, on the other hand, are as- 
Sumed to possess a memory organization 
» which allows stimulus elements to retain their 
| individuality. Furthermore, persons with 
poorly differentiated memory schemata may 
be expected to have acquired fewer cognitive 
“Categories than those with well-articulated 
memory schemata (see Klein, 1958). This 
difference in cognitive complexity should also 
be reflected in their respective attitudinal sys- 
tems, That is, the evaluative dispositions of a 
differentiator should be embedded in a more 
complex set of beliefs and expectations than 
lose of an assimilator. 
The conceptual basis of a possible interac- 
tion effect on immediate attitude change of 
all three independent variables may be stated 
as follows. When persons who lack the requi- 
Site number of cognitive categories (i.e., as- 
similators) are strongly motivated to impose 
Meaning on an ambiguous communication, 
] , they are likely to be frustrated as a result of 
their inability to organize the content ele- 
ments of the communication to the point 
Mere their meaning is clear enough to be 
| Waluated adequately. This frustration should 
_ accompanied by a strong rejection of the 
‘evaluative position advocated by the per- 
ceived source of their frustration. However, 
a these Same persons encounter a clearly 
{ae communication that is incongruent 
: An attitude they hold, they should be 
fad Prone to persuasion. This propo- 
R Tests on the assumption that a strong 
4 usal of a need for cognitive clarity forces 
; ee to view closely a challenging com- 
ka he which they are ill-prepared to re- 
© “hat is, impoverished cognitive struc- 
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tures are assumed to lead to a deficiency in 
“prior belief defenses,” to borrow McGuire’s 
(1962) term. 

Differentiators, because of their abundant 
supply of cognitive categories, should be more 
successful in restructuring an ambiguous com- 
munication to the point where the ideas are 
sufficiently clear to have a persuasive impact. 
Because of their greater cognitive capacity 
differentiators should be able to see even the 
information contained in the ambiguous com- 
munication as relevant for the satisfaction of 
their strongly aroused need for cognitive clar- 
ity. It is assumed that the ideas expressed in 
a persuasive communication are only relevant 
to the extent that they can be incorporated 
into existing cognitive categories. This suc- 
cessful restructuring should leave differentia- 
tors relatively free of the frustration effects 
predicted for assimilators. Indeed, one could 
argue that there might be sufficient reinforce- 
ment inherent in overcoming ambiguity to 
create a situation where an ambiguous com- 
munication would have a greater persuasive 
impact than a clearly structured one. In any 
case, the combination of a strong motivation 
to achieve cognitive clarity and an ample sup- 
ply of cognitive categories is assumed to coun- 
teract the usual decrement in positive attitude 
change produced by an ambiguous communi- 
cation. 

Under low need for clarity it may be ex- 
pected that increasing ambiguity will be less 
likely to produce consistent differences in 
frustration and attitude change between as- 
similators and differentiators. The differen- 
tiator when no longer motivated to make 
sense out of ambiguous stimuli may either 
ignore them or even become somewhat an- 
noyed. The assimilator, who ordinarily does 
not feel a need for “close listening,” may be 
expected to be relatively unaware of differ- 
ences in ambiguity. 

The above model may be extended to the 
problem of attitude change through time if it 
is assumed that once a person is removed from 
the communication situation the amount of 
attitude change he will continue to manifest 
will primarily be a function of how many 
well-articulated incongruent cognitions he has 
stored; and the strength of his motivation 
to make use of such discrepant information. 
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This effect should occur if the effect of stimu- 
lus disorganization on attitude change is pri- 
marily a matter of a short-term increment in 
arousal rather than an interference with the 
learning process. That is, subjects will prob- 
ably quickly forget any temporary annoyance 
occasioned by a confusing organization of the 
communication. 


METHOD 


Experimental Materials and Procedure 


The subjects were 110 male and 89 female students, 
all freshmen and sophomores, recruited from intro- 
ductory psychology and social science classes at New 
York University College and at Brooklyn College. 
The subjects volunteered for the experiment and 
were not paid for their participation. 

The initial measure of the subjects’ attitude toward 
advertising was obtained in the subjects’ classroom, 
The attitude questionnaires were administered under 
the guise of a study of the relationships between atti- 
tudes and interests, and the evaluation of the read- 
ability of various kinds of written material. Each 
subject rated 22 Likert items on a 6-point scale from 
“I disagree very much with the statement” to “I 
agree very much with the statement.” Sixteen of the 
items were “fillers” concerned with various persons 
and issues of contemporary importance. Six items 
were designed specifically to tap the subject’s atti- 
tude toward advertising. Three of the items were 
positively worded and three were negatively phrased. 
For example, “A lot of advertising is very educa- 
tional” versus “You usually can’t believe what ad- 
vertisements say.” 

About 2 weeks after the subjects were premeas- 
ured they reported to the experimental room. Nine to 
12 randomly assigned subjects were scheduled in each 
group to insure the presence of at least 8—the num- 
ber necessary to represent each of the experimental 
conditions, Procedures involving staggered scheduling 
were undertaken to prevent excessive contamination 
of subjects. When they arrived subjects were seated 
and handed the first experimental forms which in- 
cluded the manipulation of the need for cognitive 
clarity as well as the persuasive communication 
whose structural arrangement constituted the stimu- 
lus disorganization manipulation. Since pilot work 


5 These six items were the most discriminating 
ones on a 20-item attitude scale on advertising de- 
veloped by Robert Lee (1958 unpublished). Adver- 
tising was chosen as the attitude topic because: it is 
frequently written about in the mass. media; it 
elicited a relatively homogeneous negative reaction 
from the liberal arts undergraduates used in the 
pilot sample; and a previous study by Glass, Baron, 
and Giniger (1960 unpublished—see Footnote 6), 
using this topic, demonstrated that attitudes toward 
advertising could be shifted by giving subjects a rela- 
tively short-term exposure to a written persuasive 
communication, 
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had indicated that the experimental sample would 


initially have predominantly unfavorable attitudes 


on advertising, a proadvertising communication was 
used. The communication was presented as a con- 


densed version of an article which appeared ina | 


contemporary periodical of high prestige, written by 
a leading expert in advertising. The article was actu- 
ally an adaptation by the present investigator of a 
persuasive communication originally used in a study 
by Glass et al. (1960 unpublished) .° 


Manipulation of the Need for Cognitive 
Clarity 


Confrontation with the proadvertising communi- 
cation occurred under conditions of high and low 
need for cognitive clarity. The need for clarity op- 
eration itself rested on two different sets of instruc- 
tions about how to evaluate the readability of the 
advertising article. All the subjects read the following 
general instructions: 


You are to evaluate the reading ease of the ma- 
terial. Before I give specific instructions on what 
Td like you to do I want you to read through this 
article quickly. Later you will have a chance to 
read it again more carefully. 


All subjects were allowed a brief general perusal of 
the article. Then, the instructions diverged. In the 
high need clarity condition, the instructions read: 


You are to evaluate the readability of this kind of 
material. The measure of readability which „will be 
used for this group is a reading comprehension test 


on which you will be asked questions dealing with 


the following: 


1. What is the general point of view of the at- 
thor? 


2. What are the implications and consequences of 
holding a point of view like the author's? 


3. An evaluation of the clarity and organization of 
the arguments and conclusions presented by the 
author. 


This reading comprehension test is designed to tell 
us if readers like yourself are able to grasp the 
essential meaning and significance of this kind % 
material, We are interested in whether you ate 
able to follow the logical implications of mait- 
taining a position like the author's, as well 25 
whether you are able to understand the specifie 
content of the article. 


Remember, in reading this material you are t0 
concentrate on the total or overall impact aM 
meaning rather than on the separate points matt 
by the author. 


In the low need clarity condition the instructions 
read: 


We want you to evaluate the difficulty of languag? £ 


in this article, using as your standard a level 0 


€ Unpublished study entitled “An Experimentil 
Study of Some Interpersonal Determinants of Guilt 
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ability that you think an average 12-year-old 
should have reached. In evaluating the reading 
ease of the material you are to take account of 
the difficulty of the words, and the complexity of 
the sentences. 


First, you are to list all the words that you think 
an average 12-year-old would find too hard. By 
hard words we mean words that would be un- 
familiar because they are: technical, foreign in 
origin, or polysyllabic (i.e. have a large number 
of syllables). 


Secondly, you are to list the sentences that you 
think are too complex. A sentence is defined«for 
our purposes as a group of words that is gram- 
matically independent, having its end marked by 
a period or question mark, You are to judge the 
complexity of a sentence by its length (ie. the 
number of words in it) and by its grammatical 
structure (i.e. whether it is a simple or a com- 
pound sentence having a number of clauses). 


Remember, in reading this material you are to 
concentrate on the difficulty of the individual 
words and sentences rather than on the total or 
overall meaning. 


The basic difference between the two instructional 
sets is the extent to which the subject’s attention 
and thinking is being focused on the total meaning 
and jmplications of the communication. No attempt 
was made to assess the strength of the person’s pre- 
€xperimental need for cognitive clarity. 


Manipulation of Degree of Stimulus Organi- 


-zation 


The organization of the content of the persuasive 
communique was varied so as to create ambiguity 
by interfering with the expected sequential arrange- 
ment of the material. This was accomplished by 


‘Placing Proadvertising conclusions before the appro- 


priate premises had been stated, and by describing 
the effects of events which suggested the indispen- 
Sable value of advertising before information about 
the occurrence of the event had been presented. An 
example of this latter technique is given below. Un- 
der the low stimulus disorganization condition, one 
Passage read as follows: 


The American people like advertising and depend 
on it for information. If it were abolished, the 
demand for its return would be overwhelming. 
For eleven days in the fall of 1953, most of the 
. Newspapers in New York were out of business 
ecause of a general strike. The effect on the busi- 
hess and social life of the city was cataclysmic. 
enone store sales, for example, dropped as 
z uch as 25 percent. On the day the strike ended, 
RT organization asked people, “What did 
Eh miss most in not having your paper?” Eighty- 
O percent answered, “The advertisements.” 


Under high 


stim i somes o 
graph read: ulus disorganization the same para 
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The American people like advertising and depend 
on it for information. If it were abolished, the de- 
mand for its return would be overwhelming. The 
effect on the business and social life of the city was 
cataclysmic. Department store sales, for example, 
dropped as much as 25 percent. For eleven days 
in the fall of 1953, most of the newspapers in New 
York were out of business because of a general 
strike. On the day the strike ended a survey or- 
ganization asked people, “What did you miss most 
in not having your paper?” Eighty-two percent 
answered, “The advertisements.” 


Following the article, the “meaning”-oriented sub- 
jects were required to do the following: summarize 
the main point that the author was trying to make; 
evaluate whether the conclusions the author reached 
followed logically from the facts he presented; and 
list at least three major behavioral implications of 
holding a point of view like the author’s, The “gram- 
mar”-oriented subjects were told to list all the words 
and sentences that a 12-year-old would find too diffi- 
cult or complex. The operation for arousing the need 
for cognitive clarity was assumed to be a joint func- 
tion of induced set and the nature of the evaluative 
task. A maximum of 20 minutes was allowed for 
filling out all of the aforementioned materials, 

A second questionnaire was then administered 
without a time limit. (All subsequent questionnaires 
also had no time limit.) The first part of this ques- 
tionnaire contained items designed to measure vari- 
ous aspects of frustration. The subjects rated on a 
7-point scale the extent to which they were “ex- 
asperated” by the experimental task, their willing- 
ness to volunteer for a similar task on another oc- 
casion, how interesting they found the author's pres- 
entation, and their liking for the way the article was 
written. The questionnaire also included an 8-item 
multiple-choice information quiz on the content of 
the article that had just been read. This quiz was 
to serve as a measure of learning. When the subjects 
completed the second questionnaire, they were given 
fresh copies of the attitude questionnaire which they 
had originally filled out. The readministration of the 
attitude questionnaire was justified to them on the 
grounds that the investigator was interested in the 
reliability of the scale. 

Upon completion of the attitude questionnaire, 
subjects received a questionnaire primarily designed 
to check on the effectiveness of the experimental 
manipulations. This was accomplished by including 
items whose content referred to various assumed ef- 
fects of arousing a need for cognitive clarity and 
various assumed effects of producing stimulus dis- 
organization. For example, the subjects were to rate 
themselves on 7-point scales with regard to the de- 
gree to which they “deliberated upon the implica- 
tions and consequences of what the article said,” 
and the degree to which “the article was confusing 
and ambiguous.” This questionnaire also contained 
scale items designed to measure the subject’s enjoy- 
ment of the entire experiment and the success of the 


deceptions. 
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Procedure for Assessing the Information Stor- 
ing Variable 


The extent of each subject’s tendency to assimi- 
late memories to one another, or to differentiate 
them, was ascertained by the use of a paper-and- 
pencil measure designed originally by Irvin Child 
(1960)7 to measure Klein’s (1958) leveling-sharpen- 
ing control principle. This scale was selected for use 
rather than the Klein-Holzman schematizing task be- 
cause it specifically focused on differences in memory 
organization, as opposed to Klein’s measure which is 
largely perceptual in character. Subjects were asked 
to indicate on a 4-point scale ranging from “de- 
cidedly true” to “decidedly not true,” how repre- 
sentative a series of 20 descriptive statements were 
of their own behavior or philosophy. These involved 
10 statements worded so that a “true of myself” re- 
sponse would indicate a differentiating orientation, 
and 10 statements phrased so that a “not true of 
myself” response would indicate a differentiating 
style of memory organization. The following are 
illustrative items: “I have a clear mental picture of 
the passage of time since I was born, with the ma- 
jor events of my life and of world events pretty 
accurately placed” versus “My memories of early 
childhood are of a general feeling rather than of a 
detailed experience”; or “My life has included a 
number of highly distinctive experiences which have 
meant a great deal to me” versus “As I look back 
on it, my relation to teachers in secondary school 
seems to have been much as it was in primary 
school.” 

„Although the original Child scale consisted of 20 
items, subsequent analyses of the results were con- 
fined to the 16 most discriminating items. These were 
selected on the basis of an extensive item-analysis 
study using 103 undergraduates at the Washington 
Square College of Arts and Science at New York 
University. The 16-item scale had an internal con- 
sistency alpha coefficient of .66, as compared to an 
alpha of .44 for the original 20-item scale. 

The questionnaire containing the assimilating-dif- 
ferentiating scale was administered after the subjects 
had experienced the experimental manipulations and 
completed all other questionnaires. Subjects were 
classified as assimilators and differentiators on the 
basis of a median-split procedure. The administra- 
tion of this questionnaire was justified to the sub- 
jects in the following manner: 


It is important for us to obtain information about 
certain personal characteristics that might be ex- 
pected to affect one’s evaluation of the reading 
material, 


“Delayed After” Measure of Attitude 


The experimenter returned to the original class- 
rooms 2 or 3 weeks after the experimental session 
and administered the attitude questionnaire a third 
time. The subjects were told that it was necessary 


* Unpublished study entitled “Measure of Level- 
ing-Sharpening.” 
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to do so “for the purpose of a reliability study de- 
signed to standardize the scale for a college popula- 
tion.” After all the attitude forms had been collected, 
the subjects were told, in very general terms, the. 
purpose of the investigation. 


RESULTS 


Effectiveness of the Experimental Manipula- 
tions 


Five items on the postexperimental ques- 
tionnaire were used to measure the effective- 
ness of the manipulation of the need for cog- — 
nitive clarity. Subjects rated themselves on’ 
7-point scales with regard to the degree to 
which they engaged in the following behav- 
iors: f 

1—deliberated upon the implications and | 
consequences of what the article said 

2—were aware that “the point of view at- 
tributed to the author was opposed to mine” 

3—were interested in evaluating “the sound- — 
ness of the author’s arguments” 

4—were made to “think about the sound- 
ness of my own point of view” 

5—were “stimulated” to think up counter- 


and 89 females, Items 1, 3, 4, and 5 yielded | 
significant mean differences in the expected 
direction between the subjects in the high and 
low need for cognitive clarity conditions. In 
addition, when the mean scores of the 5 items 
were pooled to make a total score index, 4 
reliable difference was obtained between high 
and low need clarity subjects (¢ = 2.77). 

There is also evidence that degree of arousal | 
of the need for cognitive clarity was effec: 
tively varied for both males and females, 
when the data for the two sexes are analyzed 
separately. It should be noted, however, that 
the manipulation was somewhat more effec- 
tive for the males. 

The postexperimental questionnaire als? 
contained four 7-point items requiring the 
subject to evaluate the ambiguity of the pê 
suasive communication. Two of them dealt 
with the extent to which the subjects found 


8 All ¢ tests in this report are two-tailed ui 
otherwise indicated. Values of p at or below the © 
level will be considered statistically significant. Values 
of p between the .10 and the .05 level will be 1° 
garded as trends. 


arguments. 
For the total sample consisting of 110 males 


: 


i 


A 
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the article “confusing and ambiguous” and 

_ “poorly organized.” Two items were supposed 
to tap the subject’s perception of the amount 

„of effort that would be required for any “in- 
telligent person” to understand the articles 

“and the degree of difficulty such a person 
would have in following the article. 

. For the total sample and for males, sub- 
jects in the high stimulus disorganization con- 
dition found the article more “confusing and 
“ambiguous” (t= 2.70, p < .01 and ¢ = 2.64, 

. p=.01) and more “poorly organized” (¢ = 
4.31, p < .001 and £ = 3.15, p < .01 for the 
total sample and for males, respectively). In 
addition, when the scores on the above items 
were pooled and ¢ tests done on the combined 

* score, reliable differences were found between 

_ high and low stimulus disorganization sub- 
jects for the total sample and for males (p’s 
<.01). For females, there were significant 

differences between conditions on Item 2 (t 
= 3.30, p < .01) and on the combined item 
scores (t= 2.01, p < .05). However, there 
was no reliable difference between high and 
low stimulus disorganization females in their 
perceptions of how “confusing and ambigu- 

. ous” the article was. (Female subjects in both 
conditions rated the article as being com- 
paratively low on this dimension.) 

The items designed to tap perceived “ef- 
fort” and “difficulty” provided only weak 
evidence as to the effectiveness of the stimu- 
lus disorganization induction for the total 
Sample and for males.” That is, while there 
Was a tendency for those in the high stimulus 

l „disorganization condition to perceive that the 
article required more “effort for understand- 
Ing” than for low stimulus disorganization 
Subjects (¢ = 1.84, p < .10 and #=1.65, p 
S -10 for the total sample and for males, re- 
spectively), there was no indication of a re- 
Jiable difference in their perceptions of the 


ont failure to obtain more than trends on these 
RT may be a function of their wording. That is, 
Rae a confusing and ambiguous” and “poor or- 

À telio pena referred only to the article, the 
* jectiv and “difficulty” items also contained a sub- 
telli i lereni they were prefaced by “any in- 
keete aron would.” This difference may have 
jects n he a social desirability effect such that sub- 
Sons, elused to admit that they, as intelligent per- 

», Would have to put out a lot of effort to under- 


Stand th P ni a P 
the aaen A or have any difficulty in following 
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degree of “difficulty in following.” No signifi- 
cant differences were found for the females 
on either of these items. 

One last finding should be mentioned. Be- 
cause the administration of Child’s scale fol- 
lowed all other questionnaires, as well as the 
experimental manipulations, there existed the 
possibility that these scale scores had been 
influenced by the subject’s previous experi- 
ences in the experimental situation. In order 
to explore the possibility that the experimen- 
tal variables had affected scores on Child’s 
scale, ¢ tests were performed separately for 
males and females. No significant differences 
were obtained for either of the two experi- 
mental variations (high versus low need for 
clarity and high versus low stimulus disor- 
ganization). 


Effects of the Independent Variables on Im- 
mediate and Delayed Attitude Change” 


The immediate attitude change measure in- 
volved a comparison of the subjects’ scores on 
six advertising items administered 2 weeks 
before and immediately following the pres- 
entation of a persuasive communication.» A 
change score was computed for each subject 
taking the difference between algebraically 
summed scores on these items. 

Table 1 presents the analysis of variance 
results for the total sample and for males and 
females separately. For the total sample, the 
data revealed no statistically significant ef- 
fects for the independent variables either 
singly or in combination with one another. 
An examination of the distribution of means 
for the males and for the females, as shown 
in Table 2, indicates that the cells showing 
the maximal amount of positive attitude 
change differ for the sexes. Therefore, the 
combined sample data reflect a canceling out 


10 Because there were significant sex differences in 
the variability of the assimilating-differentiating 
scores (SD =2.05 for females, SD = 7.39 for males, 
F =2.35, p < 01) and in the effectiveness of the ex- 
perimental manipulations, it was decided that for 
each of the main dependent variables, separate analy- 
ses of variance should be done for the total sample, 
males, and females. 

11A comparison of the cell means on the initial 
attitude measure revealed no significant preexperi- 
mental differences between conditions for the total 
sample and for the males and females taken sepa- 


rately. 
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TABLE 1 
ANALYSES OF VARIANCE OF IMMEDIATE ATTITUDE CHANGE SCORES FOR THE TOTAL SAMPLE, 
Mates, AND FEMALES 
Total sample Males Females 
(N = 199) (N = 110) (N = 89) 
Source 
af MS F af MS F af MS PF 
Stimulus disorganization (A) 1 88 1.08 1 1.25 <1 1 19 <1 
Need for cognitive clarity (B) 1 05 <1 1 27 <1 1 2.36 | 1.51 
Assimilating-differentiating (C) 1 .03 <i 1 50 <1 1 -09 <1 
AXB 1 .01 <i 1 4.03 2.69 1 10.74 | 6.88 
CXB 1 1.09 TORE A 3.81 2.54 1 .36 <1 
CXA 1 1.56 1.95 1 54 <i 1 2.03 | 1.28 
AXBXC 1 1.40 1.75 1 6.06 4.04** 1 2.89 | 1.85 
Error 191 .80 102 1.50 81 1.56 


Note.— Because of the unequal cell frequencies an analysis of variance technique with a harmonic correction factor was 
utilized (according to Walker & Lev, 1953). In addition to requiring an MS, based on a correction factor, the technique involves | 


doing the analysis of variance on the cell means, 
ver > < 01. 


TABLE 2 


MEAN Amount or ĪMMEDIATE CHANGE OF MALES 
AND FEMALES BY EXPERIMENTAL CONDITIONS 


Stimulus disorganization 


High Low 
Males 
High need clarity 
Differentiators 2.83 (N= a 2.78 (N=14) 
Assimilators —.31 (V=16) | 4.16 (W=12) 
Low need clarity 
Differentiators 1.50 (N =14) | 2.09 (V=11) 
Assimilators 4.60 (V=15) | 2.75 (N=16) 
Females 
High need clarity 
Differentiators 4.63 (N =16) | 2.73 (N=15) 
Assimilators 5.00 (V=7) |2.89 (N=9) 
Low need clarity 
Differentiators 2.82 (N=11)|3,33 (N =12) 
Assimilators 0.00 (V=11) }4.25 (V=8) 


Combined means for each variable across 
both other variables 


Males Females 

Need clarity 

High 2.37 3.81 

Low 2.74 2.73 
‘Stimulus disorganization 

High 2.16 3.11 

Low 2.95 3.43 
Differentiators- 

assimilators 
Differentiators 2.30 3.38 
Assimilators 2.80 3.04 


Note.—The analyses of variance were performed not on 
these raw score means but on corrected means based on dis- 
tributions of the raw change scores that had a constant of 12 
added to each score in order to eliminate negative scores. 


of effects due to differences in male and fe 
male reactions to the independent variables. | 
Table 1 reveals that there was only one 
significant effect of the independent variables 
on the immediate attitude change score of fe- 
males, a Stimulus Disorganization x Need for 
Cognitive Clarity interaction (F = 6.88, p < | 
-01). The results for males, however, as shown” 
in the same table, provided statistically reli- 
able support for the general hypothesis of a | 
triple interaction on immediate attitude scores | 
(F = 4.04, p< .05). Further, the findings” 
for the males, as presented in Table 2, were 
also consistent with the suggested pattern of | 
specific effects. That is, among male subjects 
with a strong need for cognitive clarity, only 
the assimilators showed a significant, decre- | 
ment in positive attitude change with increas j 
ing stimulus disorganization, as indicated by 
Duncan’s multiple range test (MRT). Un 
der conditions of low arousal, assimilators and f 
differentiators showed no consistent differences 
in amount of positive attitude change as the 
degree of stimulus disorganization increased. 
Table 3 presents the analysis of variance 
results on the delayed measure of attitude { 
change for the total sample and for males and 
females separately. Table 3 shows that for the 
total sample, the hypothesized interaction ef- 


12 All comparisons of cell means were based on 4 | 
version of Duncan’s multiple range technique (mock 
fied for unequal Ns) as described in Kramer (1956) 
and Duncan (1957). The multiple range test is tw 
tailed. 
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TABLE 3 
X ANALYSES OF VARIANCE OF DELAYED ATTITUDE CHANGE SCORES FOR THE TOTAL SAMPLE, 
5 MALES, AND FEMALES 
ia Total sampl M: 
N = 16h N zn ets) 
5 Source 
ag df MS F df MS F df MS F 
* Stimulus disorganization (A) 1 18 <1 1 02 <1 q; 12 <1 
Need for cognitive clarity (B) 1 ll <i 1 02 <i 1 1.48 <1 
~ Assimilating-differentiating (C) 1 1.49 | 1.96 1 -56 <1 1 1.67 <1 
AXB 1 32 <i 1 4.13 3.44* 1 2.63 1.28 
CXB 1 5.59 i Vagadi 1 16.68 | 13.90*** 1 06 <1 
_CXA 1 2.97 .90** 1 1.87 1.56 1 3.14 1.53 
"“AXBXC 1 17 <1 1 38 <1 1 — = 
Error 159 -76 89 1.20 62 2.05 
ee subjects who were present for the immediate test of attitude change were absent for the delayed test. 
Wp S 105. 
a <.01 


fect of Assimilating-Differentiating X Need 
for Cognitive Clarity on the delayed measure 
.of attitude change was significant (F = 7.35, 
P< .01). Under high need for clarity, differ- 
entiators manifested significantly more posi- 
tive attitude change than assimilators (p < 
01 for Duncan’s MRT), while there was no 
teliable difference between them under low 
_ need arousal. Further, Table 3 reveals that 
a significant interaction occurred between the 
information storing and stimulus disorganiza- 
„tion variables (F = 3.90, p < .05). The main 
trends to be observed are the following: dif- 
ferentiators changed significantly more than 
_ assimilators under high stimulus disorganiza- 
‘tion (p < 01 for Duncan’s MRT); and no 
Significant difference was obtained under the 

_ low stimulus disorganization condition. 

The delayed attitude change results, as pre- 
sented in Table 3, also show that the hypothe- 
Sized significant Assimilating-Differentiating 
F Need Cognitive Clarity interaction occurred 

or the males (F = 13.90, p < .001). There 
aes no evidence of this interaction with the 
emales. Table 4 presents the distributions of 
ae and female means on delayed attitude 
“hange. When the means of males were com- 

: Aae two effects were evident: among per- 
` E with a strongly aroused need for cog- 
- Mtive clarity, differentiators showed signifi- 
ja many more attitude change in the induced 
i aion than assimilators (p < .01 for Dun- 
E J MRT); among persons with a weak 
Susal assimilators changed significantly more 


TABLE 4 


MEAN AMOUNT OF DELAYED ÅTTITUDE CHANGE 
OF MALES AND FEMALES BY EXPERIMENTAL 
CONDITIONS 


Stimulus disorganization 


High Low 
Males 
High need clarity 
Differentiators 3.90 (N =10) | 4.00 (N =13) 
Assimilators —.93 (N =15) | 2.00 (N=12) 
Low need clarity 
Differentiators 2.05 (N =12)| .20 ia 10) 
Assimilators 3.92 (N=13)|3.08 (N =12) 
Females 
High need clarity 
Differentiators 4,08 (N =12) | 2.00 (V=9) 
Assimilators 2.17 (N=6) | 2.43 (V=7) 
Low need clarity 
Differentiators 2.33 (N=9) |2.38 (V=8) 
Assimilators. —.09 (N=11) | 2.63 (V=8) 


Combined means across all conditions 


Males Females 

Need clarit; 

High if 2.24 2.67 

Low 2.32 1,56 
Stimulus disorganization 

High 2.24 2.12 

Low 2.32 2.33 
Differentiators- 

assimilators 
Differentiators 2.55 2.70 
Assimilators 2.02 1.79 


Note.—The analyses of variance were performed not on 
these raw score means but on corrected means based on dis- 
tributions of the raw change scores that had a constant of 12 
added to each score in order to eliminate negative scores, 
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TABLE 5 


ANALYSES OF VARIANCE OF SELF-RATINGS OF 


MALES, AND FEMALES 


Reusen M. BARON 


“EXASPERATION” FOR THE TOTAL SAMPLE, 


‘al Femal 
TRER are to) (v= 89) 
Source 
af MS F df MS F df MS F 
Stimulus disorganization (A) 1 -28 3.50* 1 .12 5.50** 1 -03 <1 
Need for cognitive clarity (B) 1 -001 <1 1 00 .00 1 03 <1 
Assimilating-differentiating (C) 1 .03 <1 1 01 <1 1 21 <1 
AXB 1 -10 1.25 1 13 1.00 1 06 <1 
CXB 1 .03 <1 1 .02 <1 1 10 <1 
CXA 1 .28 C Eoia RANE | .02 <1 1 -18 3.12% 
AXBXC 1 15 1.88 1 61 4,09"* 1 1.00 <i 
Error 191 -08 102 .13 81 .25 
> «10. 
T 


than differentiators (p < .05 for Duncan’s 
MRT). 


Emotional Effects of the Independent Vari- 
ables 


The degree of frustration experienced by 
differentially motivated subjects following ex- 
posure to a persuasive communication of 
varying degrees of ambiguity was most di- 
rectly tapped by a 7-point questionnaire item 
involving a rating of how exasperating the 
task of reading the article was. 

Table 5 shows that for the total sample 
and for the males, subjects in the high stimu- 
lus disorganization condition tended to be 
more “exasperated” than those in the low 
stimulus disorganization condition (F = 3.50, 
p< .10 and F = 5.50, p < .05 for the total 
sample and the males, respectively). More 
important, however, was the significant triple 
interaction obtained for males on this item 
(F = 4.69, p <.05). Table 6 presents the 


TABLE 6 


Mean RATINGS or “ExAspERATION” BY Mates 
IN DIFFERENT EXPERIMENTAL CONDITIONS 


Stimulus disorganization 


High Low 
High need clarity 
1. Differentiators | 2.00 (N = 12) | 2.33 (N = 14) 
2. Assimilators 2.81 (N = 16) | 1.80 (V = 12) 
Low need clarity 
3. Differentiators | 2.90 (N = 14) | 1.60 (N = 11) 
4. Assimilators 2.40 (N = 15) | 2.00 (N = 16) 


q 
distribution of means for males. A comparison |) 
of cells by means of Duncan’s MRT revealed | 
that while increasing the level of stimulus dis- 
organization resulted in a marked increment . 
in frustration for the assimilators only under | 
high need (Table 6, Row 2, high versus low, 
$ < .10), it resulted in a marked increment 
in frustration for differentiators only under 
low need (Table 6, Row 3, high versus‘low, | 
Pp < .05). The results for the females on this 
item failed to yield any statistically signifi- 
cant results. 


Discussion 


Although the present study employed both 
male and female subjects, support for the hy- 
potheses is provided almost exclusively by the 
results for males. The data suggest two fac- 
tors that may account for the lack of the pre- 
dicted effects in the female subjects. First 
the manipulation of stimulus disorganization 
was not successful with females. Second, the 
females showed a relative lack of dispersion 
on the assimilating-differentiating measure: 
These findings suggest that the females may, 
have been more “source” than “content” of | 
ented; more concerned about presenting 4 fa) 
vorable image to a male experimenter that 
objectively evaluating themselves or the con- i 
tent of a persuasive communication. Further 
more, advertising may represent an attitude 


18 Pettigrew (1958) found that males had a large $ 
standard deviation than females on a measure K 
another information processing strategy, equivalent 
range. 


ATTITUDE CHANGE 


topic that is differentially involving for males 
_-and females. 


Results for Males 


The results on immediate attitude change 
* of males support the major derivation from 
' the present cognitive model. That is, scores 
on immediate attitude change reflect a signifi- 
cant interaction effect of the need for cogni- 
tive clarity, assimilating-differentiating strate- 
' gies, and stimulus disorganization. As pre- 
. dicted, among subjects with a strong need for 
cognitive clarity, assimilators, but not dif- 
ferentiators, showed less change in the direc- 
tion of the communication as it became more 
ambiguous. 

The assumptions about corresponding dif- 
ferences in the amount of frustration experi- 
enced by subjects in the various conditions 
were tested. As hypothesized, a significant 

` triple interaction was obtained on the sub- 
jects’ ratings of how “exasperated” they were 
during the experimental task. Assimilators 
with a strong need for cognitive clarity tended 
to become more “exasperated” with increas- 
ing stimulus disorganization, while differentia- 
. tors did not manifest an increase in frustra- 
tion under comparable conditions. It had also 
` been expected that there would be no consist- 
ent difference in “exasperation” under the low 
arousal condition. Contrary to this expecta- 
tion, it was found that among persons with a 
„Weak need for cognitive clarity, only differ- 
entiators experienced a significant increment 
in “exasperation” when the degree of stimu- 
__ lus disorganization increased. This latter find- 
ing becomes understandable if it is assumed 
that differentiators because of a greater sen- 
Sitivity to variations in stimulus structure be- 
Come more annoyed by ambiguous stimuli 
than assimilators in a situation where the goal 
a Structure does not clearly require a toleration 
of such stimuli, 
gi When the frustration means are compared 
With the attitude change means for the com- 
ites Conditions, a strong, if complex, re- 
6 ti, ip between frustration and attitude 
cae Is Suggested. Thus, under high need 
Es al, assimilators, when faced with an am- 
eae as opposed to a relatively well-struc- 
fie „communication, manifested an incre- 
nt in frustration and a decrement in atti- 
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tude change. High need differentiators, on the 
other hand, were unaffected by increasing am- 
biguity both in their emotional and attitudinal 
reactions. These reciprocal responses to am- 
biguity suggest that variations in emotional 
tension play an important mediating role in 
the process of attitude change. 

On the delayed measure of attitude change 
the hypothesized interaction between need 
clarity and cognitive style occurred. When the 
need for cognitive clarity was strong, differ- 
entiators showed significantly more positive 
attitude change than did assimilators. Under 
low need arousal assimilators changed signifi- 
cantly more than differentiators. A possible 
explanation may be that when the prepotent 
motivation of differentiators and assimilators 
is not the need for cognitive clarity, their be- 
havior becomes determined by other strong 
motives that are naturally aroused in any in- 
fluence situation (e.g., the need for autonomy, 
the need for social approval, etc.). Thus, as- 
similators and differentiators may differ in 
the strength of their desire to either please the 
experimenter or assert their autonomy. This 
difference may be masked in the immediate 
attitude change condition where the need for 
cognitive clarity is strongly aroused. In any 
case, further research is required before the 
basis of the interaction is clear. 


Alternate Interpretations of the Male Results 


A possible alternative interpretation of the 
present results is suggested by Hovland, 
Janis, and Kelley’s (1953) modified learning 
theory interpretation of the effects of per- 
suasive communications on attitudes. Accord- 
ing to this kind of view, differences in attitude 
change may, among other things, be a func- 
tion of differences in the ability to learn the 
relevant content of the persuasive communi- 
cation: the better the learning, the more the 
attitude change. To test this interpretation 
of the present results, -test comparisons of 
amount of learning were made between the 
levels of each of the independent variables. 
It was found that there were no significant 
differences in amount of learning between dif- 
ferentiators and assimilators, nor between 
subjects under high and low stimulus disor- 
ganization. Although there was a tendency for 
subjects under high need clarity to show bet- 
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ter learning than low need subjects, this ef- 
fect was not accompanied by a significant 
difference in attitude change between high 
need subjects and low need subjects. Further- 
more, the average correlation for all eight 
cells of the design between amount of attitude 
change and amount of learning was not sig- 
nificant. 

In addition to the possible differences in 
learning discussed above it is possible that 
the instructions used to manipulate the need 
for cognitive clarity simultaneously varied 
other important factors involving the sub- 
jects’ attitudes toward the communication. 
For example, it may be argued that the per- 
suasive intent of the article is more disguised 
in the low need clarity condition, thereby pre- 
disposing a less critical reaction to the dis- 
organized message, than in the high need 
clarity condition. If the above line of reason- 
ing is correct we would expect that within the 
high stimulus disorganization condition there 
will be greater positive attitude change under 
low than high need arousal. A look at the 
relevant data reveals that this is only true 
for assimilators. Differentiators tend to show 
more change in the induced direction under 
high rather than low need arousal. Further- 
more, low need clarity differentiators tend to 
be more annoyed with the disorganized mes- 
sage than their high need counterparts, a find- 
ing clearly contrary to the “high need clarity 
suspicion” hypothesis. These findings are par- 
ticularly striking in light of the fact that 
Klein (1951, 1958) typically found sharpen- 
ers more suspicious and critical than levelers. 

Certain unclarities notwithstanding, the ex- 
perimental demonstration (at least for males) 
that the immediate impact of a persuasive 
communication can be regarded as a complex 
function of motivational, stylistic, and stimu- 
lus factors provides us with a more detailed 
view of the microstructure of the cognitive 
basis of attitude change than had hitherto 
been accomplished with a functional approach. 
It is hoped that this approach may offer a 
rather useful supplement to the consistency 
type of cognitive model, The time is long 
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overdue for us to recognize that the cognitive 
basis of attitude change may involve some- 
thing more complex than an automatic bal- 
ancing out of forces within a relatively closed 
cognitive system. 
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COGNITIVE REORGANIZATION FOLLOWING 
E A DISCONFIRMED EXPECTANCY: 
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The present study tested some derivations from dissonance theory regarding the 
cognitive effects of undergoing an unpleasant experience to prepare oneself for 
( an unpleasant treatment only to discover that it would not be necessary to re- 
a ceive that treatment. Pleasantness of preparation for the expected treatment and 
choice in undergoing this preparation were experimentally varied. The different 
s conditions of pleasantness failed, in general, to produce the cognitive changes 


expected. Choice consistently affected the dependent variables in a direction op- 


posite to that predicted, so that Ss having a high degree of choice in undergoing 
the preparation thought the experiment was less valuable (p < 05), were less 
willing to recruit other Ss (p<.01), and perceived the preparation as some- 
what more unpleasant (.10 < p < .20) than their counterparts who were given 


no choice in the matter. 


The present study was designed to investi- 

gate the cognitive effects of the disconfirma- 
tion of an unpleasant expectancy for which 
preparatory action had been taken. One would 
expect a person in such circumstances to feel 

a certain amount of discomfort upon learning 
that the event for which he had been prepar- 
ing would not occur, in spite of its undesir- 

_ able nature, if one reasoned from Festinger’s 
(1957) theory of cognitive dissonance. Ac- 
cording to this theory, the disconfirmation 
would be dissonance arousing due to the si- 
tultaneous cognitions of having prepared for 
the event, both psychologically and through 

_ Preparatory action, and the new cognition 
that it would not occur. Indeed, previous re- 
Seatch on expectancy has demonstrated that 
Subjects will strive to avoid disconfirmation 
(Aronson & Carlsmith, 1962; Aronson, Carl- 
_ Smith, & Darley, 1963), or if disconfirmation 
does occur will experience a negative affect 
(Carlsmith & Aronson, 1963) even when the 
expectation is of an unpleasant event. Al- 

ı though these experiments did not manipulate 
ot even allow preparation for the expectancy, 
‘other than perhaps some covert psychological 
Set, it is clear that if cognitive dissonance is 
ane Paper is based on a dissertation submitted 
Palen A of the requirements of the PhD 

M debte a als University. The author is greatly 
tacy Adams il i J. McGuire and also ee J. 
valuable Aaa Stanley Schachter for their 

a ce throughout the study. Thanks are 


So extend 
for her see Ann Haeberle of Brooklyn College 


securing subjects for the experiment. 


the process involved, preparation should mag- 
nify this effect. 

Furthermore, the amount of effort or un- 
pleasantness involved in the preparation 
should be directly related to dissonance 
arousal, This is clearly derived from Festing- 
er’s original formulation of the theory as it 
increases the ratio of dissonant to consonant 
elements. Although it has not been examined 
experimentally in this context, a number of 
studies report a positive relationship between 
effort or unpleasantness and dissonance 
arousal in human subjects (Aronson, 1961; 
Aronson & Mills, 1959; Raven & Fishbein, 
1961; Yaryan & Festinger, 1961). Two field 
studies (Festinger, Riecken, & Schachter, 
1956; Hardyck & Braden, 1962) bear more 
directly upon this problem (expecting and 
preparing for an event that does not material- 
ize). The first reported indications of dis- 
sonance reduction (mass proselyting) while 
the latter did not. 

We would also expect the magnitude of 
dissonance aroused to be greater if the prep- 
aration is undertaken of the subject’s own 
volition. Although Festinger’s (1957) original 
statement of cognitive dissonance did not 
clearly specify the role of choice, much sub- 
sequent research has concentrated on it. These 
studies generally support the notion that the 
dissonance aroused by a behavior is positively 
related to the extent the person feels he per- 
formed the behavior of his own volition 
(Brock, 1962; Brock & Buss, 1962; Cohen, 
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Terry, & Jones, 1959; Davis & Jones, 1960). 

Dissonance theory, therefore, predicts a 
strong interaction effect between unpleasant- 
ness of preparation and choice in undergoing 
this preparation. The direction of this inter- 
action should be that choice has more of an 
effect in the unpleasant-preparatory condition 
with the greatest amount of dissonance being 
aroused in the high-choice, unpleasant-prepa- 
ration condition. Such interaction effects have 
been demonstrated between choice and magni- 
tude of shock a subject administered to a 
“victim” (Brock & Buss, 1962); choice and 
confrontation or meaning (Brock, 1962); and 
choice and discrepancy in opinion (Cohen et 
al., 1959). 

In the present study we attempted to bring 
together these various aspects of disconfir- 
mation by inducing the belief in all subjects 
that they would receive a painful electrical 
stimulation of the tongue during an experi- 
ment presumably studying the electrophysiol- 
ogy of taste. Preparation for this event was 
manipulated by providing a purported anes- 
thetic taken orally to mitigate the pain of the 

* electric shock. This “anesthetic,” pleasant 
tasting in one condition but quite unpleasant 
in the other, was made optional for half of 
the subjects and mandatory for the other half. 
The disconfirmation consisted of informing 
the subject, after having made this prepara- 
tion, that due to an unexpected event, he 
would not be able to complete the experiment 
so would not have his tongue painfully stimu- 
lated. 

We measured a number of possible modes 
of dissonance reduction that were available to 
the subject and which had been either dem- 
onstrated in previous research or selected on 
the basis of theoretical interest. These in- 
cluded changes in the perceived value of the 
experiment, willingness to recruit other sub- 
jects for the experiment, liking for the ex- 
perimenters, volunteering to complete the 
experiment at a later date, anticipation of 
painfulness and fear of the expected experi- 
ence, and the perceived taste of the anesthetic 
and choice in taking it (which can themselves 
reflect dissonance reduction as indicated by 
Brock & Buss, 1962). 

Each of these modes can readily be seen to 
provide the subjects a means of dissonance 
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reduction by supplying cognitions consonant 
with the unpleasant preparation they had | 
undergone. Positive changes in evaluation or. 
belief have been demonstrated as a function 
of pain endured (Raven & Fishbein, 1961) _ 
and deprivation (Brehm & Cohen, 1959), 

Recruiting other people as a means of dis- ` 
sonance reduction has been observed in a dif- 
ferent context (mass proselyting) by Fest- 
inger et al, (1956), Tangential evidence oi 
changes in liking of a person as a mode 0 


dissonance reduction can be found in Cohen’s ' 


study of Princeton students becoming engaged 
(see Brehm & Cohen, 1962, pp. 78 ff.). Vol- 
unteering to perform an unpleasant task that 
could be avoided, presumably to reduce the _ 
dissonance resulting from a disconfirmed ex- «i 
pectancy, has been demonstrated by Aronson 
et al. (1963). 

That a subject should increase his anticipa- . 
tion of pain and fear of the disconfirmed event 
may not be so obvious a mode of dissonance 
reduction as the others. Nevertheless, if after 
the realization that the event will not occur, 
a residual of fear persists, this is dissonant 
with the cognition that there is nothing to be 
afraid of (Festinger, 1957). The knowledge of ` 
having undergone an unpleasant preparation , 
of one’s own volition, due to fear of the elec- 
tric shock, provides further dissonant ele- 
ments. Convincing oneself that it would in- 
deed have been dreadful supplies cognitions 


consonant with the fear and preparation f 


taken, Festinger cites as anecdotal evidence of 
this phenomenon two studies of rumors (Pra 
sad, 1950; Sinha, 1952) and Murray’s (1933) ' 
study of the effect of a frightening experience 
on children’s ratings of photographs. 

Another theoretical reason, outside of the 
dissonance framework, for expecting a relation 
among the unpleasantness of the preparation 
and the experienced pain and fear lies "M 
Schachter’s work in the physiological and 
cognitive determinants of emotion. Schachte! 
(1959) suggested that emotion led to “eval 
uative needs,” after Festinger’s (1954) theory 
of social comparison, or pressures to under- 
stand and label bodily feelings. Subsequent | 
research by Schachter and Wheeler (1962) 
and Schachter and Singer (1962) demon- 
strated that inducing in an individual a state 
of physiological arousal for which he had 1° 
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immediate explanation led to seeking an ap- 
ropriate cognition in the situation by which 
’ to label it. There was strong indication that 
‘the vile taste of the unpleasant liquid in our 
' experiment caused an emotional arousal. To 
. the extent that this arousal was not clearly 
„differentiated from the general arousal result- 
ing from the frightening aspects of the experi- 
ment itself, it was ambiguous; and we suggest 
~ .that the subject will label this ambiguous re- 
‘sponse physical fear determined by situa- 
_ tional factors in the experiment and, herice- 
‘forth, show an increased anticipation of the 
painfulness and fear of the electric shock. 
Therefore, on this basis also, we might expect 
greater anticipation of pain and fear for sub- 
+ jects in the unpleasant-taste condition than 
’ those in the pleasant-taste condition. 
The subjects in this experiment can pre- 
sumably reduce their dissonance or justify 
their actions by any combination of these 
available modes; but it is not to be expected 
that each will receive equal usage or, indeed, 
any usage at all (Steiner & Johnson, 1964; 
Steiner & Rogers, 1963). Rather, it is ex- 
pected that there will prove to be preferred 
. Modes of dissonance reduction (Brock, 1962; 
Brock & Buss, 1962; Pilisuk, 1962). While 
this problem of the relative likelihood of 
usage of the various modes of dissonance re- 
duction is not the focus of the present study, 
we have endeavored to tap a wide variety of 
_ Such alternative modes as our dependent 
variables, 


A METHOD 
Initial Instructions to All Subjects 


‘The experiment was represented to the subjects as 

à study of the psychological factors underlying taste 
Pretererice! Upon entering the experimental room 
id Were told, in a standardized introduction, that 
Were examining this relationship by means of two 

S re: actually having the person taste 
Methods, iquids and through  electrophysiological 
ee Was explained that the latter was ideal for our 

J ee Since it had been found in previous re- 
. tongy that an electric stimulus applied to the 
some ¢ Produced artificial tastes which varied to 
iain from one person to another. Hence, the 

at Scho stimulus producing different tastes in dif- 
i look pos viduals provided the ideal situation to 

s: 3 
for the differences psychological factors accounting 


© subject was told that my associate in this 


` 
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research project, who was a physician, Dr. Bowman, 
would stimulate different areas of the subject’s 
tongue with electrodes to determine the taste sensa- 
tions that resulted, and these data would later be 
related to his response on a personality inventory. 
The subject’s attention was directed to an electrical 
apparatus on the table which purportedly produced 
a 14-volt direct current. He was assured that this 
was absolutely harmless. A physician’s coat and 
stethoscope were casually draped near the instru- 
ment along with a supply of tongue depressors. Pre- 
study showed that these props were quite effective 
in lending credence to the experimental induction. 

. The experimenter then remarked that although the 
electrical stimulation is absolutely harmless, un- 
fortunately, the tongue is such a sensitive organ that 
the stimulation is quite painful. Therefore, we pro- 
vide a local anesthetic which is simply rinsed around 
the tongue and somewhat desensitizes it. It does not 
completely eliminate the pain, but it does help to 
reduce it. At this point, the subject was asked to 
fill out a personal data sheet which included the 
premanipulation measures of the dependent variables. 


Manipulation of Choice and Pleasantness 


After completing the preexperimental question- 
naire, the subject was told that Dr. Bowman should 
be there at any moment and that, in the meantime, 
they could have the anesthetic, thus utilizing this 
time for it to take effect. It was explained that the 
anesthetic is taken orally in two small doses spaced 
about 1 minute apart. It is important to rinse it 
thoroughly around the tongue holding it in the 
mouth for some time before swallowing as it is this 
local contact which has the effect. Half of the sub- 
jects (unpleasant-preparation condition) were told 
the anesthetic itself had a rather unpleasant taste, 
and attention was again called to the fact that it 
did not completely eliminate the pain. These subjects 
were given a very bitter and extremely unpleasant 
tasting solution of iron and quinine. The other half 
of the subjects (pleasant-preparation condition) were 
told the solution had a rather pleasant taste and 
were given mint-flavored colored water. Subjects 
were assigned randomly to these two taste conditions. 

Choice in taking this preparatory “anesthetic” was 
manipulated by the following instructions in the 
choice condition: 


We provide this anesthetic because the electrical 
stimulation is quite painful, but it is entirely up to 
the individual whether he takes it or not. You are 
completely free either to take the anesthetic or to 
go directly ahead with the experiment—whichever 
you choose. Would you prefer the anesthetic or 
not? 


Actually, all but 1 of the 41 subjects assigned to 
this “choice” condition chose to take the anesthetic. 
The 1 who refused was dropped from the experi- 
ment. Subjects in the no-choice condition were given 
identical instructions up to the point of choice, Here 
they were simply given the anesthetic without any 
mention of their having a choice in the matter. 
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Subjects were assigned randomly, within each taste 
condition, to one or the other of the two choice 
conditions, 

Following the anesthetic, several minutes elapsed 
without the appearance of the physician, During this 
time the experimenter glanced at his watch in frus- 
tration and commented, “I’m surprised Dr. Bowman 
isn’t here yet—he is usually very punctual.” About 1 
minute later the experimenter said he would look in 
the Psychology Department office to see if Dr. Bow- 
man might be there and left the room for a brief 
period. He reentered and announced in a frustrated 
and agitated voice that Dr. Bowman had been un- 
expectedly called out on a sudden emergency and 
had left a message that he would not be back 
for approximately an hour-and-a-half, Therefore, it 
would be impossible to complete the experiment. The 
experimenter expressed his sincere apologies for hav- 
ing inconvenienced the subject and not being able to 
complete the experiment, but assured him that this 
completely fulfilled any commitment he may have 
felt from volunteering. If the subject spontaneously 
volunteered to return at a later date to complete 
the experiment, he was told that it probably would 
be impossible to use his data since he had already 
been through part of the experiment and this might 
affect his responses the second time, particularly on 
the attitude and belief items. We would, of course, 
be glad to have him come back just to satisfy his 
own curiosity. The experimenter later recorded those 
subjects who volunteered and persevered in their 
desire to return. 

Before leaving, the subject was asked to fill out 
another brief form which (he was told) subjects 
usually completed after going through the entire 
experiment. Although he was unable to complete the 
experiment we were asking him to fill it out for the 
sake of consistency and to provide us with some 
control data as well. The subject then filled out the 
second form of the questionnaire. The experimenter 
questioned the subject concerning possible suspicions 
about the experiment, explained the true purpose 
of the experiment, and asked him not to discuss it 
with anyone. The evidence indicates that the sub- 
jects complied with this request. 


Subjects and Design 


All subjects were recruited from introductory 
psychology courses at Brooklyn College. The total of 
80 subjects who participated in the experiment were 
predominantly freshmen and sophomores and con- 
sisted of 28 males and 52 females. Sixty-nine percent 
of the subjects had no previous experience in psy- 
chological experiments. Two subjects became sus- 
Picious in the course of the experiment, indicating 
disbelief about having their tongue stimulated. The 
data for these subjects were discarded. One subject 
in the high-choice, pleasant-taste condition claimed 
that pain did not bother him and chose not to take 
the anesthetic. He was excused at this point from 
completing the experiment. 

Since we wished to evaluate the effects of the two 
levels of taste of the anesthetic (pleasant-unpleasant) 
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and the two levels of choice in taking the anesthetic 
(high and low), as well as the interaction between 
these two variables, we employed a 2 X 2 factorial 
design. Before and after measures of the major de- 
pendent variables were obtained so that each subject 
served as his own control, thus achieving greater ` 
statistical sensitivity. 

Subjects were randomly assigned to the experi- 
mental conditions and “run” individually. The’ 
amount of time required for each subject to go | 
through the experiment was approximately 25 min- 
utes. 


Materials 


Two questionnaires were used to provide a check 
on the manipulations of the independent variables 
and to measure the dependent variables. The “before” 
questionnaire consisted of a series of questions de- 
signed to measure the subject’s anticipation of pain 
and fear of the electrical stimulation, his belief in } 
the value of psychological research, his willingness to f 
recruit other subjects to take part in the experiment | 
and, finally, his liking of the experimenters. The 
“after” questionnaire contained the same questions 
with rewording to past tense when appropriate. In 
addition, questions were asked concerning the effec- 
tiveness of the manipulation of the independent 
variables and perceived effectiveness of the “anes- 
thetic.” (All questions are included in the Results 
section according to the variable they were designed 
to measure.) Subjects responded to each of these 
questions on the basis of a 15-point scale labeled 
each third point. In all cases, 1 indicated a low score 
on the item and 15 a high score. The scores on each 
of the questions relating to a particular dependent 
variable were summed to provide an overall score 
for that variable. For all variables measured, both 
prior to and following the experimental manipula- 
tions, the statistical analyses were performed on the 
difference scores between these two measures. 

The liquids used in the experiment to create the i 
conditions of unpleasant or pleasant taste welt 
respectively, a tonic containing 60 minims of tincture 
citro-chloride of iron, 8-60 grains of strychnine, 
sulphate, and .25 gram of quinine hydrochloride pet 
fluid ounce and a solution of water flavored wi 
mint and colored with food dyes to be approximately 
the same color as the unpleasant solution. An elec- 
trical converter with electrodes, a white physicians | 
coat, a stethoscope, and a generous supply of tongue 
depressors were included in the “set design.” i 


RESULTS 
Effectiveness of the Manipulations y 


The effectiveness of manipulating pleasant | 
ness of taste of the preparatory solution W# 
checked by a postexperimental question which 
asked, “How unpleasant did the local anes- 
thetic taster” Subjects responded to this g 
question on a 15-point scale ranging fro" 
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“not at all unpleasant? at one extreme to 
“yery unpleasant” at the other extreme. The 
' results in Column 1 of Table 1 indicate that 
‘the manipulation was highly successful. The 
main effect for pleasantness of taste is signifi- 
+ cant well beyond the .001 level (F = 125.62).? 
+. The second independent variable was a 
manipulation of choice in taking the “anes- 
thetic.” Its effectiveness was evaluated by a 
postexperimental question which asked di- 
‘rectly “How much choice did you have in 
taking the local anesthetic?” Subjects re- 
“sponded on a 15-point scale with “no choice” 
at one end and “complete choice” at the other. 
It can be seen in Column 2 of Table 1 that 
this manipulation was also highly successful 


< with subjects clearly perceiving high choice or 


low choice in their respective conditions (F = 
155.65, p < .001). 
_ These data regarding the effectiveness of 
' the experimental manipulations also provide a 
test of one possible mode of dissonance reduc- 
tion; namely, minimizing in memory the 
unpleasantness of the liquid or the amount of 
choite. It can be predicted on the basis of 
dissonance theory that subjects taking the 
' very unpleasant “anesthetic” of their own 
choice should feel more dissonance and tend 
to reduce it, in one way, by “finding” the 
liquid less unpleasant than those subjects hav- 
ing no choice, consequently lessening their 
otherwise high dissonance. In the pleasant 
Condition, the choice manipulation should 
have little, if any, effect on perceived taste. 
Table 1 shows that not only do the data not 
 SUppoft this prediction, but there is a dis- 
Cernible trend in the opposite direction. In 
both the pleasant and unpleasant conditions 
the high-choice subjects actually find the 
liquid tastes more unpleasant than subjects in 
the low-choice condition. This main effect in 
the Opposite direction to the prediction from 
_“ssonance theory does not reach conven- 
a levels of statistical significance (F = 
ait, 10 <p < .20). 
© support was obtained for the interaction 
vig predicted by dissonance theory (F = 
oe ), namely, that the effect of the choice 
manipulation would be greater in the condi- 


2 es s 
S Si Statistical tests in this report are two-tailed 
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tion involving drinking the unpleasant liquid 
than in that involving the pleasant. liquid. 

The data on the choice manipulation also 
provide a check on another possible mode of 
dissonance reduction. To the extent that choice 
is critical in arousing cognitive dissonance, 
one means of dissonance reduction is per- 
ceiving that one really had little choice in 
undergoing the unpleasant preparation (Brock 
& Buss, 1962). The interaction effect involv- 
ing choice and taste is in the predicted direc- 
tion (Column 2 of Table 1), that is, less 
difference in perceived choice between high- 
and low-choice conditions in the unpleasant as 
compared with the pleasant-taste condition, 
but the magnitude of this interaction is very 
small (F = .88). 

Clearly, then, these analyses of the “check 
on the manipulations” data offer no support 
for the utilization of distorted perceptions of 
these manipulations of choice or unpleasant- 
ness as a mode of dissonance reduction. 


Effects of the Taste and Choice Manipulations ` 


Effects on subjects’ evaluation of the ex- 
periment. Assuming the subject is in a dis- 
sonant state after having needlessly taken the 
very unpleasant liquid, cognitive balance could 
be regained by overevaluating the worth of 
the experiment; convincing himself that the 
experiment is of great scientific value would 
help justify his previous actions. Value attrib- 
uted to the experiment was measured by three 
items: “How valuable do you think this par- 
ticular experiment is?” “Do you think that it 


TABLE 1 


Mean RATINGS OF UNPLEASANTNEsS OF TASTE OF 
AND CHOICE IN TAKING THE “ANESTHETIC” 


Dependent variables 


Experimental 
conditions Unpleasantness Choice in taking 
of taste the “anesthetic” 
a) 2 
Pleasant taste $ 
High choice 3.25 (n = 20) 13.85 
Low choice 50 (n = 20) 4.20 
Unpleasant taste 
High choice 11.75 (n = 20) 13.10 
Low choice 10.35 (n = 20) 4.80 


‘ote.—All nonmarginal cell entries in this table and following 
To are 20 unless otherwise specified. A different set of 20 sub- 
jects furnished the scores for each ex] mental condition. 

High carnes, indicate high perceived unpleasantness and 
high perceived choice. 
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TABLE 2 


INITIAL MEANS AND MEAN CHANGES FROM THIS 
INITIAL Position IN RATING OF VALUE OF 
THE EXPERIMENT FROM PRE- TO Post- 
EXPERIMENTAL MEASURES 


WILLIAM A. WATTS 


rows margin in Table 2, is significant at the 
.05 level and is the only significant effect in 
the table. The trend remains about the same 
in both the pleasant and unpleasant condi- . 


tions (F = .30) for the columns margin (F = 


Actual taste 
cig N Pleasant | Unpleasant! M 
taking the liqui 

High choice i 

Premanipulation 

mean p 30.00 29.60 | 29.80 

Mean change —0.85 —0.25 |—0.55 
Low choice 

Premanipulation 

mean 25.60 30.05 27.82 

Mean change +0.75 +0.90 | +0.82 
Overall 

Premanipulation 

mean 27.80 29.82 28.81 

Mean change —0.05 +0.32 |+0.14 


Note.—Cell entries based on composite of three items. 


would be a good idea to require all students in 
the social sciences to serve as subjects in at 
least one experiment?” “How rewarding an 
experience do you find it to serve as a subject 
in an experiment?” è The scores presented in 
the cells of Table 2 are the mean differences 
of this composite between the before and after 
measures, 

The results, at least as far as choice is con- 
cerned, are contrary to the dissonance predic- 
tion, Instead of the presumably dissonant 
subjects in the high-choice condition enhanc- 
ing the value of the experiment, they tend to 
discredit it showing a mean decrease in value 
from the before-to-after measures. The sub- 
jects having no choice, on the other hand, 
show the increase that one would expect in 
the dissonant condition, This difference in 
trend between high-choice and low-choice, 


8 An intended fourth item was deleted post hoc on 
the basis of an item analysis. Two other questions 
included in the after-only measures completely failed 
to discriminate among experimental conditions, They 
are mentioned here for convenience since they were 
isolated questions not intended to measure any of 
the aforementioned dependent variables. One item 
asked: ~ 


Tf you were to make this choice in a future experi- 
ment whether to take the local anesthetic or to 
take your chances on the full painfulness of the 
electrical stimulation, how would you decide? 


the other: 


Did you feel that you had freedom to stop at any 
point in the experiment and not go on with it? 


-11 for interaction), { 
Effects on willingness to recruit other sub-. 
jects. A second mode of dissonance reduction | 
provided for the subject was an opportunity to 
recruit other subjects to take part in the ex- 
periment. Persuading others that the experi-’ 
ment is valuable and worth participating in | 
provides cognitions consonant with having | 
undergone the unpleasant preparation. Table 
3 presents the data on willingness to recruit, 
the entries in each cell being the mean change | 
from pre- to postexperimental measures based | 
on a composite of two questions. These ques- 
tions were: “Would you be willing to help us 
by recruiting three of your friends to take 
part in this experiment?” and “Would you be 
willing to write a strong convincing argument 
urging other students to take part in the ex- 
periment which we can use to help recruit 
subjects?” The significant difference in» row 
means (F = 7.88, p< .01) indicates that, | 
irrespective of the taste condition, subjects, 
having low choice in taking the liquid are | 
more willing to recruit other subjects than“ 
those in the high-choice condition. The trend, 
similar to that of evaluation of the experi- | 
ment, is contrary to what dissonance theory | 
would predict. In fact, the theoretically most 


TABLE 3 


INITIAL MEANS AND MEAN CHANGES FROM THIS — 
INITIAL POSITION IN WILLINGNESS TO RECRUIT 
OTHER SUBJECTS TO TAKE PART 
IN THE EXPERIMENT 


y a taste M 
Choice in Pleasant | Unpleasant 
taking the liquid 
_ e 
High choice j 
Premanipulation 45 | 
mean 18.90 20.00 | 19% 
Mean change —0.75 =1,35 10 i 
Low choice 
Premanipulation i o 
mean 18.45 | 18.75 | 180 
Mean change —0.30 +0.10 —0. 
Overall N 
Premanipulation o 3 
mean 18.68 | 19.38 | 1% 
Mean change —0.52 | —0.62 |—-0 


Note.—Cell entries based on composite of two items. 
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dissonant cell (unpleasant taste, high choice) 
shows the greatest decrease in willingness to 
‘recruit. There may be some statistical regres- 
sion effect underlying the large decline in this 
‘cell as the initial willingness score happened 
‘to be slightly higher (see Table 3) in this con- 
dition than in the other three cells. However, 
_ such an artifact could hardly have been an 
important factor since the prescore in this cell 
` did not differ significantly (t = 1.02) from the 
tombined mean of the other three cells, nor 
_ did the differences between the four cells ap- 
proach significance (F = .36, df = 3/76). 

Taste alone has no effect, the marginal 
means for pleasant and unpleasant conditions 
being virtually identical (F = .09), but it can 

be seen that the variable of choice is more 

potent in the unpleasant condition. This inter- 
action effect, which is exactly opposite in di- 
tection to the choice-taste interaction that 
‘dissonance theory predicts, falls short of con- 
ventional levels of statistical significance (F = 
2.18, 10 < p < .20). 

Descriptively, the trend for recruiting ap- 
Pearse quite similar to that of evaluation of 
the experiment. An analysis of the relationship 
between them on the intraindividual level 
yields a positive within-class correlation of 
r= 36, p< .01, between change scores in 
value of the experiment and willingness to 
recruit, This consistent relationship suggests 
that, in this particular case, willingness to 

i Tecruit others is a function of greater value 
attributed to the experiment. 

_ Effects on liking the experimenters. Sub- 

» Jects responded to three questions concerning 

how friendly and competent they found the 

experimenters to be. These were: “How 

‘ friendly and courteous do you find the experi- 

mnenters?” “How competent do you feel the 

experimenters are?” and “How pleasant and 

Sooperative are the experimenters?” These 

ata are presented in Table 4. While disso- 

| Rance theory would lead to the prediction that 

ae subjects experiencing more dissonance 

i fee show greater liking for the experi- 

a ers, no such trend occurred. Neither the 

tne nor the taste manipulations, nor their 

EA ad had any significant effect; all 

K a F's < 1.00. The only cell that shows 

nctease in liking for the experimenters is 

© Presumably nondissonant, pleasant-taste, 
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TABLE 4 


INITIAL MEANS AND MEAN CHANGES FROM THIS 
INITIAL Position IN LIKING FOR THE 


EXPERIMENTERS 
Actual taste 
A Ciera Pleasant | Unpleasant] M 

taking the liquid 
High choice 

Premanipulation 

mean 37.15 38.15 | 37.65 

Mean change +0.35 —0.45 |—0.05 
Low choice 

Premanipulation 

mean 36.85 38.55 | 37.70 

Mean change —0.60 —0.45 |—0.52 
Overall 

Premanipulation 

mean 37.00 38.35 | 37.68 

Mean change —0.12 —0.45 |—0.29 


Note.—Cell entries based on a composite of three items, 


high-choice condition, and this increase is 
quite small. 

Effects on volunteering to return and com- 
plete the experiment at a later time. One of 
the most interesting tests of dissonance theory 
was whether a larger proportion of the sub- 
jects in the high-dissonance cell (the high- 
choice, unpleasant-taste group) would volun- 
teer to return and go through the entire ex- 
periment at a later time. As would be expected 
by dissonance theory, a larger proportion of 
the subjects in the unpleasant-taste condition 
wanted to return and complete the painful 
experiment. However, this difference in per- 
centage (50 versus 65)* is significant only at 
the .27 level (Z = 1.10). The effects of choice 
(54% in the high-choice condition compared 
to 62% in the low-choice) and of the inter- 
action are even less significant. Hence, we 
find no evidence to confirm the prediction 
that the different conditions of preparation 
for the event contributed to the willingness to 
return, 

4 These percentages are based upon 52 subjects (13 
in each cell) rather than the total sample of 80 sub- 
jects due to a revision in classifying volunteers. Ini- 
tially, subjects were asked if they wished to return 
and complete the experiment at a later time. After 
a number of subjects had been “run,” it became 
apparent that such a preponderance of- subjects 
agreed to return (approximately 90%) so as to make 
comparisons extremely insensitive. Therefore, the 
more stringent requirement was introduced that sub= 
jects must themselves spontaneously volunteer and 
persevere in the desire even though their data would 
not be used. 
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Effects on anticipated pain, fear, and pre- 
sumed effectiveness of the “anesthetic.” These 
three dependent variables are discussed to- 
gether because of a complex interrelationship 
revealed in the analysis, It may be profitable, 
however, first to consider them separately be- 
fore turning to the complexity of the interrela- 
tionships. 

It was predicted that subjects who had 
undergone the unpleasant preparation would 
exaggerate the painfulness of electric shock to 
a greater degree than those in the pleasant- 
preparation condition and that this effect 
would be more pronounced for those subjects 
taking the preparation of their own volition. 

Anticipated pain and fear were measured 
by five items, three of which pertained to 
anticipation of pain (“How painful do you 
expect the electrical stimulation to be?” “How 
painful have you found electric shock to be in 
the past—that is, accidental shock through 
faulty wiring, etc.?” and “How sensitive are 
you to painful experiences?”) and two items 
which were more directly related to fear of 
the electric shock (“How worried or fearful 
are you about receiving the electrical stimu- 
lation?” and “How dangerous do you think 
the use of electric shock is?”). It was as- 
sumed that anticipation of pain and fear 
would be highly correlated. The data in Table 


TABLE 5 


Sunyects’ RATINGS OF ANTICIPATED PAIN, FEAR OF THE ELECTRIC SHOCK, AND PERCEIVED 
EFFECTIVENESS OP THE “ANESTHETIC” ror EACH EXPERIMENTAL CONDITION 


5 do not follow this predicted pattern, There 
is an overall decrease in anticipated painful- 
ness (¢ = 2.99, p < .01) from the pre- to post- 
manipulation measures which may be due to 
an adaptation effect. The differences between 
experimental conditions as regards anticipated 
pain are trivial, the main effects of taste and 
choice yield F’s < 1.00 and the interaction F 
is only 1.31. f 
Fear of the coming shock, in contrast, shows 
a definite increase during the course of the 
experiment (¢= 3.83, p < .01); but, as 
the case of pain, the experimental conditions 
have no significant effect (see Table 5). The 
F ratio for taste is 2.28 (.10 < p < .20) whil 
for the effects of the choice manipulation and 
of the choice-taste interaction F’s < 1.00. 
It can be seen in Table 5 that, in general, 
fear of the shock and anticipated painfulness 
of the shock are changing rather paradoxically 
in opposite directions during the course of the 
experiment, The unpleasant conditions which 
show the greatest increase in fear descrip: 
tively show a slightly greater decrease in pain. 
This interaction effect between the » meal , 
change scores from the pre- to postmanipula- 
tion measures was evaluated by an analysis of 
variance for independent and correlated data 
which uses the Residual ‘Treatments X Sub: 
ject’s interaction in the error term to test this 


Dependent variables 
_ 
Experimental conditions Anticipated pain Fear Bia 
= phoi = 2 
meee | ce |e | ae rat 
Pleasant taste 
High choice 23.25 1.25 12.40 +0.65 ei, 
Low choice 21.50 ~0.40 11.20 +0.65 Saa 
Unpleasant taste Wr n 
igh choice 20.70 —095 10.85 +1.45 11.94 
“ee (n = 16) 
Low choice 24.15 —1.30 12.15 +1.55 11.76 
(n =17) 


Notes.—Cell entries for anticipated pain are based on a composite of three items; those for fear and perceived effectiveness 


each based on a composite of two items. 
s Perceived effectiveness was measured only in the 


ental questionnaire. High score indicates high perceived eff 


ness. Reduced cell Ne in this column are due to the addition of an item after the experiment had commenced. 


action effect and the correlated pain-fear 
r (Lindquist, 1953). The resulting F = 
approaches significance (.05 < p < .10) 
sting that perhaps as subjects become 
re fearful they try to reduce their level of 
f by belittling the painfulness of the ex- 
ent, the converse of the original disso- 
prediction. The overall difference in 
an change scores for pain and fear is highly 
mificant (F = 32.35, p < .001), the antici- 
ed pain decreasing and the fear of shock 
reasing during the experiment. A within- 
s correlation between pain and fear of 
33 (p < .01) indicates that this negative 
tionship is due to the experimental treat- 
nts rather than being a personality charac- 
fistic, 
The subjects responded to two items in the 
texperimental questionnaire concerning 
Ow effective they thought the anesthetic to 
i “How effective do you think the local 
lesthetic was in reducing the painfulness of 
ihe electrical stimulation?” and “How de- 
ensitized or numb does your tongue feel from 
-anesthetic?” These data, presented in 
le 5, show a significant tendency (F = 
, b < .02) for subjects in the unpleasant 
Condition to think the anesthetic is more ef- 
ive than those subjects in the pleasant 
dition, Although this agrees with a disso- 
ce prediction, the absence of any choice 
t or interaction effect (both F’s < 1.00) 
ites an alternative interpretation: that the 
leasant solution tasted more like medicines 
ey had previously experienced. It may even 
that the bitter solution did make their 
les feel numb. 
This dependent variable was related to fear 
electric shock and anticipation of pain 
4n interesting manner. While the overall 
-class correlation between fear and ex- 
Cted effectiveness is a negligible —.14, an 
MSpection of the individual correlations within 


of the correlation reverses from positive 
two low-choice conditions to negative 
two high-choice conditions thus largely 
ling out each other. Since there was no 
eciable difference between the taste con- 
$, Within-class correlations between rat- 
of fear and perceived effectiveness were 
Nputed for the low-choice subjects (+.22) 
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and for the high-choice subjects (—.37), This 
difference between 7’s is significant at the .01 
level. The relationship between anticipated 
pain and effectiveness is somewhat less but 
exhibits a similar trend between choice con- 
ditions, +.25 for low-choice subjects and 
—.04 for those subjects given a high degree 
of choice in taking the anesthetic, the differ- 
ence between these two 7’s being significant 
at the .20 level. 

Our interpretation of these data is in terms 
of the more general attitude toward the ex- 
periment reflected in the subjects’ ratings of 
the experiment’s value and their willingness 
to recruit others, The fact that those subjects 
in the low-choice condition who were more 
fearful and/or expected the shock to be more 
painful showed a correspondingly stronger be- 
lief in the effectiveness of the anesthetic sug- 
gests a protective feeling congruent with their 
greater expressed confidence in the experi- 
ment’s value. In contrast, the high-choice sub- 
jects who showed more annoyance with the 
experiment and lower evaluation of it might 
well be inclined to think the anesthetic, like 
the experiment itself, was of little value lead- 
ing to the inverse relationships obtained. 


Discussion 


The data in the present study bear on three 
aspects of the person’s response to the infor- 
mation that an unpleasant event, for which he 
has been preparing, will not occur. The three 
aspects are: the effect of degree of unpleasant- 
ness of the preparation, the effect of choice in 
taking this preparation, and the interaction 
between these two, They will be discussed in 
turn to see how well the data fit the predic- 
tions derived from dissonance theory. 

Effects of unpleasantness of the prepara- 
tory action. Although the attempted manipu- 
lation of the pleasantness of the preparatory 
action was highly effective (p < .001), this 
variable did not prove to be very powerful in 
producing the cognitive changes predicted on 
the basis of dissonance theory. It may be that 
unpleasant tasting as the “anesthetic” was, it 
did not arouse much dissonance in view of the 
greater threat of unmitigated shock. 

Effects of choice in taking the preparatory 
action. The highly significant choice effect as 
regards value attributed to the experiment (p 
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< .05) and willingness to recruit other sub- 
jects ( < .01), as well as the less significant 
(.10 < p < .20) trend in perceived taste of 
the anesthetic, are opposite to the dissonance- 
theory prediction. The low-choice subjects 
think the experiment is more valuable, are 
more willing to recruit, and find the anesthetic 
less unpleasant. This consistent reversal, al- 
though perplexing, is not without precedent. 
Brehm and Cohen (1959) in a somewhat 
analogous fait-accompli experiment report 
just such a relationship in their low-relative- 
deprivation condition. Subjects who had high 
choice in participating in the task were less 
satisfied than those who had low choice. This 
reversal occurred in all five of the replications 
(p = .06 by sign test). These data and the 
present results in the same direction suggest 
that choice is more complex than most previ- 
ous studies conducted in the framework of 
dissonance theory would lead one to suspect; 
a fact recognized by Deutsch, Krauss, and 
Rosenau (1962) who report data indicating 
that not all decisions are dissonance arous- 
ing. Festinger and his co-workers have al- 
ready reformulated the theory (Festinger, 
1964) in terms of a brief period of post- 
decision regret during which time the dis- 
sonance is very salient and the person thinks 
that perhaps he did the wrong thing. Since 
our measures were taken shortly after the 
subjects learned that the preparation they 
chose to take was of no avail, it is not impos- 
sible that we caught our subjects in a moment 
of postdecision regret. This interpretation, 
however, is not compelling because the choice 
effect occurred in the pleasant as well as the 
unpleasant-preparation condition and it is 
not clear why the pleasant preparation should 
arouse any appreciable dissonance. 

There are several possible interpretations 
for the unexpected outcome, none of which 
deserve more than brief mention considering 
that they cannot be verified by the present 
data. One concerns the commitment effect of 
decision (Bennett, 1955; Lewin, 1951). If 
making a choice as to whether to take the 
“anesthetic,” which was an important part of 
this experiment, served to increase the sub- 
jects’ involvement with the experiment, or 
their commitment to finish it, we would ex- 
pect those subjects in the high-choice condi- 
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tion to feel more frustrated and annoyed with 
its abrupt and premature termination. ‘ 
The obtained effects of choice might also 
be explained by the somewhat greater degree 
of subject-experimenter interaction in the 
high-choice condition which may have re-' 
sulted in the subject feeling greater freedom 
to speak out and express his feelings. If the | 
general feeling was one of frustration and re- 
sentment, those subjects in the high-choice 
condition would then presumably feel more" 
liberty to air these feelings. The fact that | 
such unintended effects often vary concomi- 
tantly with the experimental treatments in 
psychological research has been emphasized 
by Orne (1962). i; 
The possibility cannot be ruled out that * 
being called upon to make a choice, particu- ` 
larly in an already stressful situation, is itself 
unpleasant. Hence, a subject might enhance 
the chosen alternative or in some other man- 
ner justify his decision yet like the experi- 
ment less than one who was arbitrarily as- 
signed the alternative and, henceforth, experi- 
ence it as less satisfying and worthwhiles 
It is also possible that the results obtained 
are due to “overmanipulation” or a com- 
pounding of unpleasant events. Being asked 
to make a choice in effect involves an added* — 
manipulation not experienced in the low- — 
choice condition and could conceivably have — 
resulted in this group rejecting the whole ex- 
periment at the time of disconfirmation. Í 
If the dominant feeling, at least among 
subjects in the high-choice condition, was one 
of frustration and annoyance, it might be, 
asked why this was not reflected in the ratings © 
of liking for the experimenters. It will be 
recalled that subjects believed two experi- 
menters to be involved and, of course, never 
saw the imaginary physician who was respon- 
sible for their frustration, Remarks made by. 
the subjects indicated that they were rating 
the experimenters only on the basis of the one’ 
experimenter they had seen who was in no 
way responsible for their predicament. In ‘ 
addition, although steps were taken to ensure | 
the subjects of autonomy in their ratings of 
the experimenters, it is likely they felt social ` 
Pressures against giving a derogatory rating 
to an experimenter sitting in the same room. 


Interaction between choice and unpleasant- 

_ ness, Dissonance theory predicted an inter- 

action effect of these two variables on the 

„various cognitive-change measures. None of 

these predicted interaction effects were found, 

“nor was any significant effect between these 
two variables found in any other direction. 
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The effect of the importance of deviant 


CONFORMING AND 


or conforming group members on S’s 


evaluation of these group members was investigated. The importance of other 


group members to the Ss was varied in 


terms of the interaction distance be- 


tween Ss and conforming and deviant group members. Interaction distance is 

considered as a description of certain aspects of spatial closeness and temporal ? 
length of interpersonal association. It was found that (a) attitudes toward a 

deviant group member become more unfavorable as interaction distance de- 

creases and (b) attitudes toward a conforming group member become more 

favorable as interaction distance decreases. 


A large number of writers have been con- 
cerned with the effect of social support or 
nonsupport on attitude formation and atti- 
tude change. Attitudes and attitude change 
have been shown to be a function of mem- 
bership and reference groups (e.g., Siegel & 
Siegel, 1957), group cohesion and control 
(e.g., Argyle, 1957; Festinger, Schachter, & 
Back, 1950), membership status and valua- 
tion (e.g, Kelley & Volkart, 1952; Sherif, 
1951), importance of communicators and 
communications (e.g., Schachter, 1951), 
power of the individual over the group or 
the group over the individual (e.g., Collins 
& Guetzkow, 1964), and other factors related 
to the individual in his social environment. 

A number of experimenters have varied 
group composition or group tasks and meas- 
ured resulting interpersonal attitudes, Schach- 
ter (1951) in particular, has been concerned 
with the rejection of deviants by other 
group members. Schachter finds that deviant 
group members are rejected when the group 
task is relevant or when group cohesiveness 
is high. In the present experiment we will 
focus on the importance of a deviant or a con- 
forming group member. Will a subject, who 
is himself supported by the group, reject a 
deviant group member more if that group 
member is important to the subject? A similar 
question may be posed for the subject’s atti- 
tude toward a conforming group member. 
Let us consider a subject who is supported 


1 Support by the Rutgers University Research 
Council is gratefully acknowledged. 
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by a conforming group member against a " 


deviant group member. Will this subject hold 
a more favorable attitude toward the con- 
forming group member if he is important,’ 
to the subject? This experiment represents an 
attempt to investigate these questions. 


METHOD 
Subject Population 


Subjects used for all parts of this experiment were , 
paid undergraduate volunteers at an Eastern uni- 
versity. They were drawn from male sophomore and , 
junior classes. 


Interaction Distance 


Interaction distance is defined as a major com- 
ponent of the importance of an interaction to a sub- 
ject, in terms of the temporal and spatial charac- 
teristics of that interaction.? Three descriptions of 
interaction situations were selected to represent an 
increasing series of interaction distances: ` 


Description 1. Another student just introduced to 
the subject as one of the roommates during the 
coming college year. 

Description 2. Another student taking part with 
the subject in a decision-making task lasting 2-3 
hours, sitting at the other end of a conference room, 
discussing issues only occasionally directly with the’ 
subject. 

Description 3. Another student sitting with two 
or three others of his age and sex in a remote area’ 
of the cafeteria of another college which the subject’ 
is visiting for a few minutes, 


Forty students from the same population from 
which experimental subjects were later drawn were 


asked to rate the: above-described situations on a : 


? The term “interaction distance” is used only as 
a working description of certain components of 
social communications. 


| 
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continuous rating scale. The instructions for this 
task read: rate these situations in terms of how 
important a person as described would be to you. 
The extreme points on the scale were labeled “very 
important” and “very unimportant.” For analysis, 
the scale was divided into 10 equal parts. A score 
of 1 represents the closest possible interaction; a 
score of 10 represents the most distant possible 
interaction. 

Description 1 (minimal interaction distance) re- 
ceived a mean score of 1.90 (SD = 1.14); Descrip- 
tion 2 (intermediate interaction distance), a score 


r of 5.45 (SD=1.28); and Description 3 (maximal 


interaction distance), a score of 8.88 (SD = 133). 
A significant difference (p < .01) was found between 
all pairs of means. These results justify the use of 
the interaction distance descriptions as representing 
interaction situations varying in space and time 
characteristics. 


Data Collection 


Undergraduate male volunteers, numbering 212, 
were asked to indicate their attitudes toward rock 
and roll. Attitudes were measured on six evaluative 
scales of the semantic differential (Osgood, Suci, & 
Tannenbaum, 1957). The scales used were good-bad, 
true-false, bright-dark, kind-cruel, fair-unfair, and 
nice-awful. A score of 1 was assigned to the most 
unfavorable position on each scale, and a score of 7 
was assigned to the most favorable position on each 
scale. Summed scores over six scales permitted a 
total score range from 6 (highly unfavorable) to 
42 (highly favorable). Sixty-two subjects were 
selected on the basis of favorable scores (range: 
36-42). All 62 subjects were placed into a modified 
triad situation. Each subject was taken into the 
experimental rooms individually. No other subjects 
were present during the experiment. A set of stereo- 
phonic earphones was placed on the head of each 
subject. Subjects were told by a voice heard over 
both stereo channels that they would hear statements 
about rock and roll made by two undergraduates of 
another Eastern university. First, however, they 
would be able to listen to the instructions given 
to those undergraduates, After a switching noise, the 
subjects then listened to instructions given to those 
who would present statements on rock and roll to 
them. These instructions emphasized that the state- 
ments should be as clear and concise as possible, 
since they would be played back to another under- 
graduate who would have somewhat different 
opinions, The instructions also indicated that the 


` Speakers would be informed of the success they 


Would have in persuading the listener to accept the 
Presented point of view. The speakers were then 
informed in which order they should present their 
respective points of view. 

The subjects then listened to two 3-minute state- 
ments, one containing eight basic reasons for reject- 
—__ 

$ Previous experimentation has shown that taped 
Voices can be used successfully to simulate a group 
atmosphere (Blake & Brehm, 1954). 
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ing rock and roll, the other containing eight basic 
reasons for liking rock and roll. Arguments included 
in the statement favoring rock and roll were: popular 
music has often become acceptable later, for instance 
the waltz; many other people like rock and roll; 
people get good exercise dancing to rock-and-roll 
music; many rock-and-roll melodies played in a 
different style are liked by people who do not like 
rock and roll itself; rock-and-roll dancing uses 
up energy which might otherwise be used in destruc- 
tive ways by some teen-agers; rock and roll permits 
young people, who are more sensitive to the needs 
of other young people than adults would be, to 
perform; the common interest in the music gives 
young people a feeling of belonging; rock and roll 
has recreated interest among young people in playing 
musical instruments themselves, for example, the 
guitar. 

Arguments included in the statement rejecting 
rock and roll were: rock and roll represents noise 
and not music—it will, like the boogie-woogie, soon 
be forgotten; the strong interest in rock and roll 
has kept young people from better and more profit- 
able interests; rock-and-roll performers give a bad 
example; people who really appreciate good art 
do not like rock and roll; on a number of occasions 
rock-and-roll music has resulted in riots after the 
show was over; the movements of dancers in rock 
and roll look unpleasant; teen-agers and college 
students often go to dance to rock and roll in 
places where the social influences are unpleasant; 
rock and roll has used many aspects of other popular 
music but has itself contributed comparatively 
little. In addition, each of the statements contained 
two further arguments supporting the view opposed 
by the speaker and a refutation of these arguments, 
(It appeared to the experimenter to be of value to 
give the impression that the persons arguing for and 
against rock and roll were somewhat informed about 
opposing viewpoints.) Arguments included in the 
statement favoring rock and roll were: (a) Some 
persons have said the rock-and-roll issue has sharply 
divided young people, However, it has served to 
unite many in a common interest. This is of greater 
value. (b) Rock and roll, according to some, is all 
the same, but in reality only the rhythm is compara- 
tively constant. Melody and interpretations vary 
widely, permitting much creativity. 

Arguments included in the statements rejecting 
rock and roll were: (a) Supposedly rock and roll 
permits young people to express themselves but in 
reality they are only conforming to another rigid 
code. (b) Some people say that rock-and-roll singers 
are artists in their own right, but in reality they 
only scream and make noise. 

All subjects heard the same statements made by 
the same voices. Taped statements were recorded 
by two persons of college age using typical language 
used by students in the university from which the 
subject population was drawn. Each of the two 
statements was presented to one ear only, each 
statement to a different ear. Order of presentation 
and presentation of first or second statement over 
right or left channel was varied at random. 
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TABLE 1 


EVALUATIVE ATTITUDES TOWARD CONFIRMING AND REFUTING GROUP MEMBERS AS A 
FUNCTION OF CHANGES IN INTERACTION DISTANCE 


Attitudes toward confirming and refuting group members 


Interaction distance 


Minimum Intermediate Maximum 
Confirmation Refutation Confirmation Refutation Confirmation Refutation 
Mean evaluative 
attitude score 37.27 11.06 32.47 15.10 28.19 20,02 
SD 5.16 3.78 4.50 3.86 4.90 5.49 


Comparison of interaction distances 


Minimum versus intermediate 


Intermediate versus maximum 


Minimum versus maximum 


Confirmation Refutation Confirmation Refutation Confirmation Refutation 
Difference be- 
tween means 4.80 4.04 4.28 4.92 9.08 8.96 
<.01 <.01 <.01 <.01 <.01 <.01 


After the presentation of the two voices, subjects 
were given seven semantic differential forms (same 
as used for selection) including, among others, six 
evaluative scales, They were then asked to rate the 
concept “rock and roll” and to rate the persons 
whose voices had been heard in certain imaginary 
situations varying in interaction distance. Subjects 
had to rate: 


1. Rock and roll. 

2. The person who argued against rock and roll 
as you would if he had just been introduced to 
you as one of your roommates during the coming 
college year. 

3. The person who argued against rock and roll 
as you would if he would take part with you and 
some others in a decision-making task, sitting on 
the other end of a conference table, discussing 
issues only occasionally directly with you. 

4. The person who argued against rock and roll 
as you would if you would see him sitting with 
two or three others of his age and sex in a remote 
area of the cafeteria in another college where 
you are visiting for a few minutes. 

5. The person who argued for rock and roll as 
you would if he had just been introduced to you 
as one of your roommates during the coming 
college year. 

6. The person who argued for rock and roll as 
you would if he would take part with you and 
some others in a decision-making task, sitting on 
the other end of a conference table, discussing 
issues only occasionally directly with you. 

7. The person who argued for rock and roll 
as you would if you would see him sitting with 

two or three others of his age and sex in a remote 


area of the cafeteria in another college where you 
are visiting for a few minutes. 


The order of presentation of semantic differential 
rating sheets 2 through 7 was varied at random. 


RESULTS AND DISCUSSION 


Interaction distance, as described, did result 
in differing perceptions of closeness of spatial 
and temporal association with others (see 
above). It was assumed that a large inter- 
action distance would represent lower levels 
of importance attached to a co-communicator 
and that a smaller interaction distance would 
represent a higher level of importance 
attached to the co-communicator. 

It appears necessary to analyze for poten- 
tial attitude change of the subject himself, 
which might have precluded any possible 
rejection of the deviant. The original ratings 
of rock and roll (used for subject selection) 
were compared with the ratings produced by 
the subjects after exposure to the arguments 
pro and con rock and roll. A ż test for cor- 
related samples indicated no differences be- 
tween the two ratings. We may conclude that 
attitude change in the direction of the 
opinions presented by the deviant did not 
occur. 

The results of the experiment are sum- 
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marized in Table 1. A ¢ test for correlated 

samples was used to arrive at the reported 
< significance levels. The experimental results 
_‘would lead us to conclude that the impor- 

tance of another group member to a subject 
. greatly influences the subject’s evaluative 
„attitudes toward him. 

In this experiment, the subject was exposed 
to both one deviant and one conforming group 
member. One may speculate that an increase 

rin the number of attitudinally conforming 
group members could result in yet gredter 
` «differences in evaluative attitudes toward 
deviant group members. For instance, Strick- 
land, Jones, and Smith (1960) have shown 
that group support increases unfavorable atti- 
, tudes toward deviant group members. The 
~ more unfavorable an attitude would be under 
minimal interaction distance conditions, the 
greater the score range within which differ- 
», ences in ratings of a deviant group member 
could occur with changes in interaction dis- 
tance. Lesser derogation of a deviant group 
member, on the other hand, as might be pro- 
‘duced in a social dyad, may reduce variance 
in the subject’s ratings of the deviant group 
member. 
` Similarly, attitudinal support without the 
» Presence of a deviant (as would occur in a 
dyad) may not result in as favorable ratings 
of the supporting group member. 

Rejection of the deviant group member as 
found in this experiment may be related 
to positions of cognitive balance theory 
(Festinger, 1957; Heider, 1958; Newcomb, 
1960; , Osgood & Tannenbaum, 1955). Gen- 

` erally, these theories propose that dissonance 
Or incongruence would occur when an im- 
balanced three-way relationship is established 
between two persons and a cognition. In 
this experiment, two persons (subject and 
„deviant group member) have a divergent 
Point of view about rock and roll. In the 
. minimal interaction distance condition the 
‘deviant group member should be important 
to the subject. Consequently, the three-way 
relationship is established. Dissonance should 
Occur and require reduction behavior. Since 
. dissonance reduction cannot occur by per- 
Suading the deviant to change his point of 
view (he is not available), and since dis- 
Sonance reduction did not occur through atti- 
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tude change in the subject (see above), the 
reduction should occur through rejection of 
the deviant. 

On the other hand, the maximal interaction 
distance situation should represent a lesser 
degree of importance of the deviant group 
member to the subject. The three-way rela- 
tionship is less complete, and less dissonance 
resolution would occur (if any). We can 
therefore expect less (if any) rejection of 
the deviant. 

The intermediate interaction distance situ- 
ation represents somewhat of a midposition 
in the range of communicator importance. 
The results show that some rejection of the 
deviant did occur, however, not as much as 
in the minimal interaction distance situation, 
and more than in the maximal interaction dis- 
tance situation. If dissonance reduction is in- 
volved in the differential rejection of a deviant 
due to his varying importance to the subject, 
then we may be able to speak of variations 
in the degree of dissonance aroused, Such 
variations in aroused dissonance would be due 
to differential magnitude of relations in the 
person-person-cognition relationships. 

Sixteen percent of the subjects maintained 
the same rating of the deviant group member 
or of the conforming group member for all 
three interaction distances. This group ap- 
pears highly different from all other subjects 
who produced minimal differences of 4 points 
on the summed scales between ratings. for at 
least two of the interaction distances, These 
subjects did not differ from the rest of the 
sample in their rock-and-roll attitudes as 
measured for subject selection. They do, 
however, show a strong but insignificant 
trend toward greater constancy of attitude 
toward rock and roll, if one compares pre- 
and postexperimental ratings. Potential effects 
of personality variables, such as dogmatism 
(Rokeach, 1960), rigidity (Gough & Sanford, 
1952), or simplicity of conceptual structure 
(Schroder, Driver, & Streufert, 1965) may 
account for this “lack of sensitivity” to 
changes in interaction distance. 
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A series of experiments was undertaken to test the conditions that affect the 
yielding of a powerful person to the dependence of his powerless partner. In 
accordance with prediction it was shown that a powerful person will yield 
more when the cost of yielding is low than when it is high. Contrary to initial 
expectations, in 3 separate studies, an interaction effect between sex of the S 
and extent of partner’s dependence on yielding was found. Females yielded 
more when their partner was highly dependent than when he was less de- 
pendent, while the reverse was true of males. The interaction effect was inter- 
preted as representing a difference between men and women in the kind of cues 
instigating conformity to the norm of social responsibility. 


It is a well-established assumption of social 
psychologists that to have higher power than 
one’s partner in an interaction is advan- 
tageous. The person of higher power can steer 
the course of the interaction in the direction 
that he desires and can be confident of re- 
ceiving a constant supply of good outcomes. 
Conversely the person of lower power experi- 
ences a rather lower level of outcomes. When- 
ever a conflict arises he has to submit to the 
wishes of his more powerful partner. Thus, 
he tends to be at a relative disadvantage in 
the interaction. 

Yet many observations in real life contest 
this assumption. A beggar often seems to be 
able to steer the course of an interaction 
with a rich man into channels that profit 
him and not his partner. And children, who 
are conventionally held to be less powerful 
than adults, frequently see conflicts with their 
parents resolved in their favor. Philanthropy, 
altruism, generosity, charity—such common 
words in the English language as these de- 
Scribe situations in which typically a power- 
ful person at some sacrifice to himself helps 
Someone else who is relatively weak. ” 

A formulation of Thibaut and Kelley 
(1959) suggests, however, that the power dif- 


, ferential may be apparent rather than real. 
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They define Person A’s power over Person B 
(and Person B’s dependence upon Person A) 
as the range of outcomes through which A 
can move B, where an outcome is defined as 
an affective experience of a certain utility. 
In a dyadic relationship one person’s degree 
of power defines the extent of his partner’s 
dependence. 

Using this formulation, let us consider 
again the case of the rich man and the beggar. 
The rich man can presumably impose a wide 
variation of outcomes upon the beggar—he 
can throw him a purse of gold or he can 
have him arrested for vagrancy. The range 
of outcomes that the beggar can directly 
administer to the rich man is much narrower 
—he can thank him and flatter him or he can 
curse him. Apparently, therefore, the power 
of the rich man is vastly superior to that of 
the beggar; yet a beggar frequently has his 
request for money granted. One way of ex- 
plaining such a successful strategy is to 
consider the request as a cue for arousing 
the pervasive norm of social responsibility. 
Berkowitz and Daniels (1963, 1964) and 
Daniels and Berkowitz (1963) have recently 
conducted a series of experiments on depend- 
ence in which they make this interpretation. 

Thibaut and Kelley have suggested that a 
norm can be seen as having power. In this 
instance the social responsibility norm has 
power over the rich man because its existence 
creates a range of outcomes through which 
he can be moved. Certain of his behaviors, 
such as refusing the beggar’s request, would 
be against the norm. They would incur the 
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costs associated with deviating from an im- 
portant norm, in such forms as experiencing 
feelings of guilt or anxiety. Other behaviors, 
such as acceding to the beggar’s request, 
would be normative and would be rewarded 
by feelings of well-being. Thus, the existence 
of the norm adds another outcome dimension, 
other than that of money, which will influ- 
ence the rich man’s behavior. In one sense, 
the beggar, by arousing the norm, forms in 
effect a coalition with it, and the power that 
it can exert over the rich man is added to 
the meager power that the beggar wields 
directly. 

It is important to note that in this hypo- 
thetical example of a successful request it is 
impossible to separate the contributions made 
by the norm of social responsibility from 
those of the beggar’s power, however small it 
might be. For instance, the rich man may 
accede to the beggar’s plea out of fear that 
refusal will lead to an unpleasant public 
vilification. Insofar as the beggar can poten- 
tially affect the outcomes of the rich man, a 
range of outcomes exists through which he 
can be moved by the beggar’s behavior. The 
extent of this range defines the beggar’s de- 
gree of power, In order to study the effects 
of dependence as an influence strategy it is 
critical to block other sources of potential 
power by rendering the dependent person 
completely powerless to affect directly the 
outcomes of his partner, 

A series of experiments was undertaken on 
the conditions that affect the yielding of a 
powerful person to the influence of his de- 
pendent and powerless partner. A first step 
in this direction was an attempt to demon- 
strate that amount of yielding to a partner’s 
request would be positively associated with 


TABLE 1 


PERCENTAGE OF SUBJECTS WHO YIELDED TO 
PETITIONER, DIVIDED ACCORDING TO SEX 
OF THE SUBJECT AND DEPENDENCE OF 
PETITIONER 


Males Females 
Dependence 
Percentage | TSt#! | Percentage | Total 
Small 7 24 25 24 
Large 50 20 40 30 
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the degree of dependence of the partner. This 
effect had been reported by Berkowitz and 
Daniels, who defined A as dependent upon B 
to the extent that B mediated A’s goals. We 
used a broader definition of A’s dependence 
upon B as the range of outcomes through 
which B can move A. 


EXPERIMENT I 


The first attempt was a simple experiment 
involving a psychology class and a sociology 
class during a June summer-school session. 
The students in these classes were handed a 
written request stemming ostensibly from one 
George Gruber, psychology PhD candidate. It 
informed them that they would have to be 
subjects in an experiment that was to form 
part of his doctoral dissertation. This experi- 
ment was to have an experimental condition, 
in which subjects were to spend 30 minutes 
in a chamber at 125 degrees Fahrenheit, and 
a control condition, in which subjects were to 
spend the same period in a pleasant tempera- 
ture of 75 degrees. Gruber asked “as many 
students as possible” to volunteer for the 
experimental condition. Half the students 
in the class received a request in which 
Gruber explained that he was desperate to 
finish his dissertation before August so that 
he could take up a position that had been 
offered him: this was the large-dependence 
condition. In the small-dependence condition, 
Gruber’s request stated that he was under 
no time pressure—he had a whole year in 
which to finish.his dissertation. 

Our prediction was simply that a greater 
proportion of people would comply with his 
request by volunteering for the 125-degree 
Fahrenheit chamber in the large-dependence 
condition than in the small. Our reasoning 
was that the students’ perception of Gruber’s 
dependence would serve as a cue to elicit the 
norm of social responsibility. The overall 
results in fact showed no such trend, but when 
broken down according to the sex of the 
subject a trend did emerge: females pro- 
duced the predicted behavior, while males 
showed a precise reversal, yielding to Gruber’s 
request more when his dependence upon 
them was small than when it was large. The 
percentage of subjects who yielded by agree- 
ing to suffer in the high heat, divided ac- 
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cording to sex of the subjects and dependence 
of Gruber the petitioner, is presented in Table 
1. The chi-square test for multiple classi- 
fication designs (Winer, 1962) showed a sig- 
nificant main effect for sex (p < .01)—males 
yielding more than females. The Sex x De- 
pendence interaction resulted in a x’ of 2,97 
(p < .10). 

The main effect for sex is of no particular 
interest. It indicates that males, in compari- 
son to females, are more likely to volunteer 
for an unpleasant experience regardless of 
the petitioners amount of dependence. It 
may be that in this instance the experience 
was more often seen as a challenge by the 
males. The trend for a Sex X Dependence 
interaction effect, however, is independent of 
such a task-linked interpretation. More males 
conform to a low-dependent petitioner than to 
one who is highly dependent whereas more 
females conform to a high-dependent peti- 
tioner than a low-dependent one. 

Our next step was to see whether the trend 
for an interaction effect found in this simple 
experiment would hold up under more con- 
trolled experimental manipulations, including 
the variable of cost to the subject of yielding. 
The prediction was that as the cost of 
yielding increased, the powerful person would 
be less likely to yield regardless of the 
partner’s dependence. 


EXPERIMENT II 
Procedure 


Subjects were recruited from an introductory psy- 
chology course in which participation in research 
was part of the course requirement. Forty-eight 
women and 40 men undergraduates were used in the 
experiment. They came in groups of four; each 
group was composed of subjects of the same sex. 
On arrival each subject was placed in a cubicle from 
which it was practically impossible to see any of 
the other subjects. The experimenter then read aloud 
a set of instructions. He told the subjects that 
they were to be formed into pairs to take part in a 


“decision-making task.” Each pair would consist of 


a supervisor and a subordinate. These roles were to 
be assigned by a purely chance process. At the 
eee 

3 Occasionally one or two subjects would fail to 
arrive at the appointed time. On these occasions 
Confederates of the experimenter would take the 
Place of the missing subjects, thus giving the illusion 
to the subjects present of a genuine four-person 
group. 


beginning of each trial the subordinate would write 
a message to the supervisor, expressing his prefer- 
ence for the alternative he would like to have 
selected. In fact, all subjects were placed in the 
supervisor’s role, and the messages they received 
had been prepared beforehand by the experimenter. 
The experimenter would deliver this message, and the 
supervisor would then make a decision, which set 
the outcome both he and his partner received for 
that trial. This procedure would be repeated 26 
times, after which all subjects would be asked to 
fill out a questionnaire; the subordinates would then 
be able to leave the experiment, but the supervisors 
would be asked to remain behind for a few minutes, 

Elaborate precautions were taken by the experi- 
menter to give the subject the feeling that he 
would never be able to identify his partner after 
the experiment. These precautions were necessary in 
order to minimize the subjects’ perception of any 
potential counterpower their partners might possess, 
in such forms as possible disapproval, recriminations, 
side-payments, etc. They included: scheduling the 
four subjects from two different courses; keeping 
the subjects separated while they were waiting for 
the experiment to begin; telling the subjects that 
they were not to speak out loud during the experi- 
ment; telling the subjects that they were to be 
paired by a chance process, thus making it equally 
probable that any of the other subjects would be 
one’s partner; and instructing subordinates to leave 
the room before the supervisors. 


Independent Variable Manipulation 


Two independent variables were manipulated. One 
of these was the cost to the subject of being influ- 
enced by his partner, while the second variable was 
the extent of the partner’s dependence. 

High and low cost-of-yielding conditions were 
created by the information the subjects received at 
the beginning of each trial. At the onset of the 
experiment the experimenter placed a booklet on each 
subject’s desk. On each trial the subject began by 
tearing off the top sheet from the booklet. The sheet 
had four boxes portrayed on it, each box containing 
some numbers, and the subject was instructed to 
select one of these boxes, The experimenter an- 
nounced that at the end of each trial he would draw 
1 lucky number from an urn containing 16 numbers 
(1-16). If the number drawn was included in the 
numbers contained in the box the subject had 
chosen, the subject would win $1, but he would not 
be told how many times he had won until the end 
of the experiment. No number appeared more than 
once in any one box, and the numbers used always 
ranged from 1 to 16. Therefore, the chance of 
winning $1 with any one box was equal to the 
number of numbers in that box out of 16. The 
subjects in the high-cost treatment found that in 18 
of the 26 trials, their four boxes contained 14 
numbers, 10 numbers, 6 numbers, and 2 numbers, 
The subjects in the low-cost treatment found that 
in 18 of the 26 trials, their four boxes contained 10 
numbers, 9 numbers, 7 numbers, and 6 numbers. 
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Cost was therefore defined by the size of the decre- 
ment of the probability of winning from each of the 
four alternatives. The alternatives in the high-cost 
condition represented a greater proportional loss 
than those in the low-cost condition. The average 
probability of winning from the four alternatives 
was equal for both conditions and was set at 5. 

Through the message the experimenter delivered 
to the subjects, ostensibly from their partner, he 
was able to vary the partner’s degree of dependence 
on the subject. The partners reported to the sub- 
jects how much they stood to lose or gain from each 
of the four alternatives. In the high-dependence 
condition the partner could win $1, win $.50, lose 
$.50, or lose $1, depending on the choice the subject 
made. In the low-partner dependence treatment the 
sums were +$.10, +$.05, —$.05, and —$.10. The 
subjects were also told that in fact no money would 
be won or lost during the experiment, but they 
were asked to act as if real money were involved. 

The combination of two values of cost of yielding 
and two values of partner’s dependence, with male 
and female subjects produced a 2 X 2X 2 design. 
In each of the conditions the subject found on 18 
trials that there was a total “noncorrespondence of 
outcomes” with the partner. That is, partner’s most 
preferred alternative was the subject’s least pre- 
ferred, the partner’s second most preferred was the 
subject’s second least preferred, and so on for all 
four alternatives. These were the “critical trials” in 
which the subject's responses to partner’s depend- 
ence was assessed. On the remaining 8 trials there 
was a total “correspondence of outcomes” with 2 
trials being devoted to each of the four possible 
combinations. These trials were used to check on 
whether the subject understood the task. Because on 
these trials the subject could gain the best chances 
for himself, while at the same time optimizing his 
partner’s outcomes, there was no rational reason 
why the best choice for both should not have been 
made. If a subject deviated from the strategy in 
more than 2 trials it was taken as prima facie 
evidence that he did not understand the instructions, 
Five females and four males were discarded on the 
basis of this criterion, 


TABLE 2 


ANALYSIS Or VARIANCE OF TOTAL-YTELDING SCORES 
FOR EXPERIMENT IT 


Source af SS F 
Cost of yielding (A) 1 1,306 | 6.13** 
Partner’s dependence (B) 1 4 — 
Sex of subject (C) 1 7 — 
AXB 1 12 — 
AXC 1 119 = 
BXC 1 220 = 
AXBXC 1 712 3.34* 
Within 80 | 17,037 
Total 87 | 191487 


The major dependent variable measure was the 
degree to which the subject “yielded” (i.e., selected 
an alternative that favored the subordinate) on the 
18 noncorrespondent trials. Because the alternatives 
on each trial were scaled to represent proportionally 
greater yielding, weights were assigned to the sub- 
ject’s choice. A particular response could be weighted 
from 0 to 3 depending on how much of a sacrifice 
it represented. The total-yielding score represented a 
summation of these weights over the 18 trials. In 
addition, after the last trial, each subject filled out 
some rating scales. These included ratings of his 
partner on the evaluative and potency dimensions 
of the semantic differential, estimates of his own and 
his subordinate’s financial outcomes, ratings of the 
amount of influence his subordinate had exerted on 
him, the degree of difficulty he had experienced in 
making decisions, and his subordinate’s competence. 


Results 


Two items from the postexperimental ques- 
tionnaire directly checked the success of the 
inductions of cost of yielding and of the sub- 
ordinate’s degree of dependence. The first item 
assessed the subjects’ perception of their own 
costs for yielding by asking them to recall 
“the number of trials where the alternatives 
ranged from a very good to a very poor chance 
of winning.” Their responses to this item were 
divided at the median number of trials esti- 
mated and were arranged according to the 
cost-of-yielding condition. The estimates of 
subjects in the high-cost condition should be 
above the median, while those in the low-cost 
condition should be below the median. The 
resultant chi-squares of 8.12 (p< .01) and 
8.39 (p < .01) for males and females, respec- 
tively, indicate that the cost-of-yielding ma- 
nipulation succeeded equally well for ‘both 
sexes. A second item asked the subjects to 
recall their partner’s dependence by recalling 
the number of trials where his gain or loss 
could have been very large. A similar chi- 
square analysis resulted in values of 3.64 ($ 
< .10) for males and 16.33 (p < .001) for 
females. Although the dependence manipula- 
tion was highly successful for females it was 
only marginally effective for males. 

A variance analysis was performed on the 
total-yielding scores. This analysis is sum- 
marized in Table 2. It revealed a main effect 
for the variable of cost of yielding (F of 6.13, 


*p <.07, two-tailed. 
** p < .02, two-tailed. 


‘df = 1/80, p< .02), thus confirming our 
hypothesis that a person whose outcomes are 
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Experiment IT 
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Experiment III 
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Fic. 1. Mean yielding scores in two experiments as a function of cost to the subject of yielding 


(CY), sex of the subject, and 
considerably reduced by yielding will yield 
less than a person whose outcomes suffer only 
a slight reduction. It also provides an incre- 
ment of support for the validity of the experi- 
mental situation, 

The only other effect shown to approach 
significance by the variance analysis was a 
triple interaction between cost of yielding, 
sex of subject, and subordinate’s dependence 
(F of 3.34, df = 1/80, p < .07). The means 
of the total-yielding scores are presented in 
Figure 1. (For convenience, the means of the 
total-yielding scores obtained in Experiment 
II, to be described below, also are presented 
in Figure 1.) The triple interaction can be 
interpreted as showing that sex of subject and 
the partner’s amount of dependence are much 
more potent variables in the low cost-of- 
yielding condition than in the high-cost con- 


` . dition, It should also be noted that in the low- 


cost condition the direction of the means is 
consistent with the one found in Experiment 
I. However, the significance level of the 
triple interaction was again marginal and the 
variance of the total-yielding scores in all 
conditions was large. It was decided to repeat 
the experiment, using only the low cost-of- 


his subordinate’s dependence. 


yielding condition, in conjunction with some 
minor changes in the procedure which were 
aimed at reducing the variance of the total- 
yielding scores. 

Before turning to the next experiment one 
other finding related to the meaning of yield- 
ing in this situation should be reported, Our 
original reasoning had assumed that the 
powerful person could be influenced to give 
up optimal outcomes by an awareness of his 
partner’s degree of dependence. An indication 
that yielding to a dependent partner was ex- 
perienced as submission to his influence was 
provided by the postexperimental rating scale 
asking how much influence the subordinate 
had exerted on the subject’s decisions. The 
means for this item paralleled those for the 
total-yielding scores; the triple interaction was 
significant beyond the .05 level. In addition, a 
strong, positive association was found between 
the subjects’ ratings of having been influenced 
and their amount of yielding. When subjects 
were categorized according to their position 
relative to the median on both measures the 
resultant chi-square was 29.78 (p< .001). 
Yielding was indeed experienced as an exer- 
tion of influence. 
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EXPERIMENT IIT 
Procedure 


Thirty males and 26 females served as subjects. 
Members of each sex were randomly assigned to 
either the small or the large partner’s dependence 
condition, All subjects were in a low cost-of-yielding 
condition, that is, on each trial the outcome values 
of the four alternatives were 6 numbers, 7 numbers, 
9 numbers, and 10 numbers. Subjects were again told 
that 1 of 16 numbers would be drawn after each 
trial. If the number was contained in the alternative 
they had selected they would win on that trial. In 
an effort to increase amount of yielding and thereby 
obtain a better approximation of a normal distribu- 
tion for the total-yielding scores, the amount of 
money to be won on a trial was raised to $2. 
Furthermore, the subjects were told that their sub- 
ordinates had already completed their part of the 
task and were no longer in the building, thereby 
further decreasing, in the perception of the subjects, 
the possibility of their subordinates exerting any 
counterpower. Finally, communication booths were 
constructed which assured isolation and permitted 
the use of tape-recorded instructions. Because the 
manipulation of partner’s dependence was accom- 
plished by varying the values the subordinate could 
receive, all subjects heard an identical set of instruc- 
tions. The essential format remained the same as in 
the previous study. 


Results 


The dependence manipulation was clearly 
successful this time. The median split on the 
item asking the subjects to recall their part- 
her’s range of outcomes resulted in a chi- 
square of 8.47 (p < .01) and 7.58 (p < .01) 
for males and females, respectively. The 
means of the total-yielding scores have been 
presented in Figure 1. They replicate those of 
the previous studies. The variance analysis, 
presented in Table 3, showed no main effect 
for either sex of the subject or the partner’s 
degree of dependence, but the Sex x Depend- 
ence interaction was significant (F of 4.32, 
df = 1/52, p < .05). The ratings of amount 


TABLE 3 


ANALYSIS OF VARIANCE OF TOTAL-YIELDING SCORES 
FOR EXPERIMENT IIL 


Source df SS F 
Partner’s dependence (A) 1 87 — 
Sex of subject (B) 1 370 2.72 
AXB 1 586 4.32* 
Within 52 7,050 
Total 55 8,093 


*p <.05, two-tailed. 


of partner’s influence again paralleled the 
means of the total-yielding scores, again at a 
significant level. 

The results of the other postexperimental 
ratings showed no consistent effects of the 
dependence manipulation. Although a number 
of expected sex differences were obtained 
(e.g., males rated their partners as being more 
potent than females) these did not contribute 
to understanding the interaction effect and 
will not be presented. 


DISCUSSION 


The Sex X Dependence interaction effect 
has now been demonstrated in three experi- 
ments, when the cost of yielding is not large. 
Its successive appearance in three independent 
samples appears sufficient to regard it as a 
genuine effect. How then can it be explained? 
We would begin by proposing that the extent 
of a partner’s dependence instigates the norm- 
conforming process differently in males and 
females. It will be recalled that our initial 
position was that a partner’s dependence 
would serve as a cue for arousing the norm ‘of 
social responsibility. However, the psycho- 
logical process of conforming to an internal- 
ized norm requires not only that the norm 
should be aroused but also that the conform- 
ing response should not be incompatible with 
an alternative behavior that has also been 
instigated. In taking this approach we follow 
Campbell’s (1961) general analysis of con- 
formity behavior as representing final “com- 
posites” which are reached by assigning 
weights to discrepant information arising from 
different sources, 

In our situation the powerful person (the 
subject) is placed in a conflict between op- 
timizing his own outcomes and helping his 
partner, because the critical trials represent a 
noncorrespondence arrangement of outcomes. 
Furthermore, great care was taken to insure 
that the subject made choices solely on the 
basis of the information about his own and 
his partner’s outcomes. He had absolute con- 
trol in assigning outcomes both to himself 
and to a partner who remained anonymous 
and who had no counterpower. The response 
selected by the subject must therefore repre- 
sent a weighted composite of the value placed 
on conforming both to the social responsibil- 


y 


ity norm and to the norm of optimizing own 
outcomes. As the partner’s dependence in- 
creases, the outcomes potentially available to 
him must become better, by the definition of 
“dependence, and at some point must surpass 
those available to the high power person. It is 
evident that such a possibility can itself be- 
come a cue for arousing the norm of optimiz- 
ing own outcomes. We would suggest that 


`> there is a difference between the sexes in the 


desirability of maintaining a favorable out- 
„ come ratio between the powerful person and 
His partner. As the partner’s potential out- 
comes approach the level experienced by the 
powerful person, the latter may feel that his 
status is threatened. Our results indicate that 
¿as the partner’s dependence increases males 
‘`'are more ready than females to react against 
the threatening partner by refusing to yield 
to him, 
' Although the above interpretation cannot 
be tested within the limits of this research, it 
is in keeping with a very common thesis com- 
paring different social roles adopted by men 
and women. Men tend to be oriented toward a 
struggle with their physical and social environ- 
ment. They, therefore, tend to be competitive 
and highly conscious of their own standing 
relative to others in the struggle for survival. 
Women, on the other hand, tend to be oriented 
to the needs of others. Their socialization is 
likely to include a greater emphasis on the 


, Value of enacting helping and nurturant be- 


L, 


haviors, This emphasis appears quite func- 
tional in light of such typical female roles as 
“mother” or “nurse.” The potentially better 
Outcomes of a highly dependent partner are 
more likely to arouse the competitive orienta- 
tion and concomitant norm of optimizing own 
Outcomes in men than in women. 
, It now remains to attempt to integrate our 
interpretation of the Sex X Dependence inter- 
action with other relevant studies. Aside from 
, the series of studies conducted under Berko- 
: Witz’ direction, only one other study could be 
found. Walker and Heyns (1962) in a non- 
+ technical summary of a series of studies focus- 
ing on conformity behavior report one experi- 
“Ment testing the amount of conformity ob- 
tained by a partner’s dependence. Subjects 
Were required to bring a friend (of the same 
Sex) to the experiment who was ostensibly to 
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be their experimental partner. The task con- 
fronting the subjects was to encode as many 
words as possible from a set of word lists. 
These anagrams were then to be given to 
their partner, who would attempt to unscram- 
ble them. The task was presented as a contest 
to see which subject, in a group of 10, could 
scramble the most words on each of six trials. 
The subjects were led to believe that their 
partners were in competition with each other. 
Their contest was to be won by the person 
who correctly unscrambled the largest per- 
centage of scrambled words received. After 
the fourth trial each subject received a fake 
message, supposedly from the friend, request- 
ing the subject to “please slow down.” Walker 
and Heyns report that female subjects yielded 
to this request, but that males did not. The 
authors see this finding as fitting 

into a general pattern of sex differences in need for 
Achievement and need for Affiliation. Our culture 
places a high value on independence in men, and 


they tend to receive early and consistent independ- 
ence training [p. 63]. 


It should be noted that their findings fit our 
interpretation as well. Males will not conform 
to the social responsibility norm when such 
conformity threatens their status, in this in- 
stance, decreasing their chances of winning in 
order to help a partner to win, Females, how- 
ever, respond to the partner’s dependence by 
conforming to the norm. 

The results obtained by Berkowitz and 
Daniels (1963, 1964) and Daniels and Berko- 
witz (1963) are also consistent with our in- 
terpretation. In their studies dependence was 
manipulated by whether or not the subject 
thought his “supervisor” would be affected by 
the amount of work the subject performed. 
Subjects were led to believe they were partici- 
pating in an experiment evaluating super- 
visor skills. The dependence of the “super- 
visor” on the subject was manipulated by two 
sets of instructions. High dependence was de- 
fined by the subject believing that his work 
productivity was an important factor in de- 
termining the supervisor’s chances of obtain- 
ing $5, while low-dependence subjects were 
told their work productivity was in no way 
related to the partner’s chances of obtaining 
$5. Subjects were required to build boxes after 
receiving instructions from their supervisor, 
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who was in fact an accomplice. Although there 
were minor variations in procedure, in each 
experiment the subjects in the high-depend- 
ence condition produced significantly more 
work than subjects in the low-dependence 
condition. The effect was obtained with male 
subjects (Berkowitz & Daniels, 1963; Daniels 
& Berkowitz, 1963) and female subjects 
(Berkowitz & Daniels, 1964). 

When these manipulations are compared to 
those in the present research some clear dif- 
ferences emerge. The cost of yielding to the 
subject appears to be much less in the Berko- 
witz and Daniels studies. It is reasonable to 
assume that working hard is more costly than 
working at a leisurely pace, but the difference 
is probably not great when it is recalled that 
all subjects had to spend the same amount of 
time in the experiment and received an identi- 
cal amount of experimental credit. Their 
manipulation of “high” dependence appears to 
correspond to our definition of “low” depend- 
ence, In their high condition the subject could 
move his partner through a range of outcomes 
extending from $0 to being “eligible for $5.” 
Furthermore, the subject had very imperfect 
control of the outcomes his partner would 
receive; he had no useful standard for 
assessing how much work was required for 
the partner to be able to obtain the reward. 
In the low-dependence condition the subjects 
had no control—their partner’s outcomes were 
not contingent on work output. These sub- 
jects may well have felt that their work out- 
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owitz and Daniels’ results therefore seem to 
show that when the cost of yielding is ex- 
tremely low both males and females respond 
to low dependence, compared to no depend- 
ence, by yielding more. Low partner’s depend- 
ence would be expected to produce conformity 
to the social responsibility norm for males, 
especially since their role in the experiment 
was of lower status than the supervisor-part- 
ner. Their lower status could hardly be jeop- 
ardized by contributing to the partner’s 
chances of winning $5. 
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i , IMITATION, SELF-REINFORCEMENT, AND REINFORCEMENT 


OF ANOTHER PERSON 
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The effect of imitating a model upon the self-reinforcement (sr) rate of college 
Ss was investigated using a word-association task and a vocal reinforcer 
(“Good”). Ss gave associations and said “Good” if they evaluated their re- 
sponse as a “good popular response.” The experimental groups observed a tape- 
recorded model giving associations and saying “Good” at a low rate or a high 
rate. There were controls for observation of the model and for the degree of 
S’s participation in the task while observing. After a period of postmodel sr, 
all Ss had 20 trials of reinforcing another person on the same task and then 
took Cattell’s Goodness of Work Test. The findings indicated an imitation of 
sr rate that persisted through the postmodel trials and transferred to Ss’ 
evaluation of others (ro), both in the word-association task and the Cattell 
test. There was no significant effect of degree of S participation during the 
observation period. Exp. II replicated the relationship of model sr to observer 
sr and the relationship of sr and ro. In addition, 4 of Ss evaluated a tape 
containing low-frequency associations. Competence of the other person being 
reinforced was found to be related to ro; and this relationship was independent 
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of the effect of prior rate of sr. 


A number of recent studies (Kanfer & Mars- 
ton, 1963a, 1963b; Marston & Kanfer, 1963) 
have dealt with the phenomenon of self-reinforce- 
ment (sr). Self-reinforcement is defined as the 
administration of reinforcements by the subject 
to himself, without external controls. The con- 
cept was introduced to explain the reinforcing 
characteristics of a self-evaluation and to analyze 
the way in which self-evaluation enables the in- 
dividual to change or otherwise control his own 
behavior. The studies have dealt both with the 
identification of variables which influence the 
pattern of sr and with the effects of sr on the 
behaviof being self-evaluated. The variables found 
to affect administration of sr include: incentive 
connected with the reinforcement (Marston & 
Kanfer, 1963), level of learning of the behavior 
being evaluated (Kanfer & Marston, 1963b), 
Prior reinforcement for varying rates of sr 
(Kanfer & Marston, 1963a), and social imita- 
tte a model’s sr pattern (Bandura & Kupers, 

4). 
| Bandura and Kupers (1964) examined the ef- 
fect of a model’s criterion for administering sr 
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upon the self-reinforcing behavior of an observ- 
ing child. The models, who varied in age and sex, 
received a block of trials on a motor task while 
the subject observed. In all of the experimental 
groups the model took a reinforcement from an 
available bowl of candy on half of his trials. In 
half of the groups, the model reinforced himself 
after exceeding a low criterion of accuracy on 
the motor task, and in half took a candy only 
after exceeding a more stringent criterion. The 
experimenter controlled information concerning 
the model’s accuracy on the motor task and 
thereby manipulated the subject’s and the model’s 
apparent competence. The results indicated that 
the children imitated the model’s criterion of re- 
sponse accuracy for the administration of sr. 
There appear to be at least three variables in 
the model’s behavior which determine an ob- 
server's sr responses: the model’s rate of sr, the 
level of his criterion for administering sr, and 
his apparent competence in the behavior being 
evaluated. A problem similar to that met in 
studies of partial reinforcement is present in ef- 
forts to separate the effects of these three vari- 
ables, While they kept constant the rate of sr by 
the model in order to examine the effect of level 
of his criterion, Bandura and Kupers had to vary 
slightly the apparent accuracy of the model’s 
motor responses. This was done so that half of 
the model’s responses appeared to fall above and 
half below the given criterion for administering 


255 


256 


sr. Thus, the effects of model’s criterion level 
for sr could not be separated from the effects of 
model’s competence. In order to circumvent this 
problem of the relationship of criterion and 
competence in imitation of sr, the present study 
investigated the model’s rate of sr and eliminated 
systematic variation in model’s criterion and in 
his competence. 


EXPERIMENT I 


The experiment examined the effect of imita- 
tion upon the frequency of positive sr by college 
students’ performing a verbal task and self-ad- 
ministering a vocal, social reward. Model and 
subject gave word associations to different lists 
of stimuli and administered sr by saying “Good” 
whenever they judged their response to be “the 
most popular association.” Level of model’s ap- 
parent competence was equated across groups by 
having all models give the same associations 
while rate of model’s sr was varied. Since com- 
petence was not made explicit for the model or 
the subject (all sr was administered without feed- 
back as to the accuracy of the response being self- 
evaluated), the model’s criterion for sr was only 
indirectly implied by his rate of sr and not ex- 
perimentally separated from rate. 

A second purpose of the study was to examine 
the effect of the subject’s performance of the 
imitated responses during observation. Bandura 
and Walters (1963, p. 55) discuss the theoreti- 
cal implication for Miller and Dollard’s theory, 
of demonstrating imitation without actual per- 
formance during acquisition. In the Bandura and 
Kupers procedure, the subject performed the 
task only after the model had performed his 10 
responses and then left the experimental room. 
The present design compared three levels of the 
subject’s performance during observation: ob- 
servation without concurrent word association by 
the subject, observation with concurrent word 
association by the subject but no overt sr, and 
observation with both concurrent word associa- 
tion and overt sr. Especially where imitation in- 
volves inhibition or disinhibition of self-control 
responses (i.¢,, giving self-rewards), it appeared 
important to determine whether the effect of ob- 
servation would be evidenced without the ob- 
server’s responding overtly. 

The positive relationship between the degree 
of approval of others and the degree of self-ac- 
ceptance has often been noted in a clinical con- 
text (Rogers, 1951). A third purpose of the study 
was to relate the effects of changes in the sub- 
ject’s sr behavior upon his evaluation of another 
person’s responses. After a period of sr following 
observation of a model, the subject administered 
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the same rewards to another person performing 
the word-association task, It was predicted that 
not only would the subject imitate the rate of sr = 
of the model but that his rate of reinforcement | 


of another person performing the task would be, 
related to sr rate. In addition, at the end of the 
experiment, all subjects received Cattell’s Good- 
ness of Work Test (Cattell, 1955), a paper-and- 
pencil test involving the subject’s evaluation of _* 
a variety of work performances, It was predicted 

that the model’s effect on the subject’s reinforce- > 
ment of others would transfer to this new task: 


Method 


Subjects. The subjects were 135 male undergradu- 
ates enrolled in the introductory psychology course 
at the University of Wisconsin. 

Apparatus. The subject and experimenter were 
seated in separate cubicles with a plyboard partition 
between them. All instructions and stimulus words’’+ 
for the word-association task were tape-recorded 
and transmitted to the subject via earphones. 

Procedure. The subject was seated in his cubicle 
and told that he would receive all instructions over 
the earphones, The initial tape-recorded instructions 
over the earphones informed all subjects that they 
would be tested on their word-association ability. 
They were to try to give popular responses, which 
were described as words: given quickly, related ob- 
viously to the stimulus word, and common in usage 
(the example, cat to dog, was given). After each re- 
sponse, the subject was to say the word “Good” if 
he evaluated his response as “the best popular re- 
sponse.” The stimulus words were given at 8-second 
intervals with the word “ready” preceding each 
stimulus by 2 seconds. They were selected from the 
Connecticut Word Association Norms (Bousfield, i 
Cohen, Whitmarsh, & Kincaid, 1961) and consisted 
of words which have at least two associations with | 


= 


a frequency of occurrence greater than 20 for the 
150 subjects in the normative sample. The Thorn- 
dike-Lorge (1944) frequencies of the words were 
equated across blocks of 10 trials. 

After 10 trials to establish the preobservation rate 
of sr, the groups observing a model were told that 
they would then begin to hear a tape recording of 
another person performing the word-association task + 
with different stimulus words. The design consisted 
of three levels of observed sr rate, One set of groups * 
heard the model increase his rate of sr in a mono- \ 

+ 


tonic function over four blocks of 10 trials each, in- 
creasing from “Good” on 2/10 trials to 8/10 trials. 
A second set heard the model give the same word 
associations as the preceding groups (either the first 
or second highest association in the norms), but ’ 
with rate of sr remaining constant at 2/10 across , 
blocks. A set of control groups did not hear a model 
during this block of 40 trials. However, all groups 
received instructions after the first block of 10 trials; ` 
the groups without a model hearing only a summary =» 
review of the original instructions. 

The second variable in the 3 X 3 factorial design 
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, was degree to which the observer responded during 
.’ the modeling trials and had the opportunity to copy 
` overtly the model’s sr pattern. One-third of the sub- 
jects continued to give word associations and sr, al- 
ternating with the model; one-third gave associa- 
tions but were instructed not to say “Good”; the 
remaining third gave neither word associations nor 
sr during this period. Thus, one of the nine groups 
did not hear a model and gave neither word associa- 
J tions nor sr. This rest group was given a magazine 
to read for a length of time equivalent to that taken 
>to observe the model (5 minutes). Each group con- 
tained 15 subjects. 
Following the 40 observation trials (or rest pẹ- 
« riod), all groups received 40 trials without a model, 
giving word associations and overt sr as in the first 
10 trials, with a summary of the original instruc- 
tions preceding this block of trials. 
After the postobservation block of trials all sub- 
i) jects were told that they would be hearing a tape 
» recording of another person giving associations and 
+ ‘that they would no longer be responding themselves. 
Their task for the next 20 trials was to evaluate 
the tape-recorded word associations, saying “Good” 
„Whenever the response met their own criteria for the 
“best popular response.” Again, the tape-recorded 
voice gave either the first or second highest associa- 
tion listed in the norms. 
At the end of the experiment all subjects com- 
pleted Cattell’s Goodness of Work Test (G-8) (Cat- 
» tell, 1955) without a time limit and wrote the an- 
Swers to the following questions: 


1. What do you think was the purpose of the 
experiment ? 
a 2. How did you decide whether to say “Good” 
after your own responses? 
3, Did anything you heard on the tape affect 
your saying “Good” after your own responses? 
(If so, what ?) 
$ 4. How did you decide whether to say “Good” 
j to the tape-recorded subject? From two to four 
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subjects in each of the nine groups gave replies 
related to the effect of the tape-recorded model’s 
self-evaluation on their own sr behavior. 


Results 


The mean rate of administering positive sr 
during the preobservation block of trials was 
5.2/10. An analysis of variance on the rates (con- 
verted to arc sines) yielded no significant group 
differences (all Fs < 1.00). 

The analysis of the effect of the model on the 
subject’s sr rate was done in two parts: sr rate 
during the model’s responding and during the 
postobservation trials. For the groups that gave 
overt sr concurrently with the model (on alter- 
nate trials), it was possible to measure the im- 
mediate effect of the model. An analysis of vari- 
ance was done on proportion (converted to arc 
sines) of associations followed by “Good” for 
the four blocks of 10 trials during which the tape 
recording of the model was being heard (Table 1). 
A ¢ test adjusted for multiple comparisons (using 
the .05 level of significance) on the three overall 
group means (high sr model, low sr model, and 
no model) indicated that the control group hear- 
ing no tape gave significantly higher rate of sr 
than the low- and high-model groups which did 
not differ significantly from each other (M for 
no-model group = 5.5/10, low =3.2/10, high = 
3.5/10). The significant Groups X Blocks effect 
reflects the differential effect of the two taped 
models. (See Table 2 for block means for all pe- 
riods of the procedure.) At Block 1 the relation- 
ship among the three groups was the same as 
that reflected in the above comparisons of over- 
all means, while by Block 4 the low-model group 
had decreased sufficiently to fall significantly be- 


s TABLE 1 

EN ¢ MEAN FREQUENCY OF REINFORCEMENTS PER Brock or 10 TRIALS 

t = -i 

Blocks of trials 
L Grou Eich Observation Postobservation SEEN 
ation 
3 1 1 pi 3 4 1 2 a 3 4 1 2 

pate ; E 4a E S E EEA 
04 33 re i 27,| 30) 38 35 25 26 s1 | 38 
Ea 29 58 | 58 | 50 | 53 | T | ag | 48 | 44 | 61 | 54 
L-1 44 28 2.9 2.6 3.0 5.8 4.2 

EE rh À 53 | 46 54 | 44 | 56 5.2 
N-h ee 3.8 3.7 3.7 3.4 6.0 4.3 
Ni Šo E 6 0] 3S2 AT, 

KE Na 3 SRL SB 59 | SAM 7.01058 


to ni? the abbreviations for the groups, O refers to groups which gave won 


associations and overt sr during observation, L refers 
a gave neither word association nor sr, h and | refer 


+ A w 
groups which gave word associations but no overt sr, N refers to groups wi ijd not observe a tape, 


to high 


igh and low rate of sr on the observed tape, and n refers to groups wh 
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TABLE 2 


ANALYSIS OF VARIANCE OF ARC SINES OF PROPORTION 
POSITIVE st DURING 40 TRIALS OF OBSERVATION 
OF THE MODEL 


Source df MS F 

Groups* 2 3.79 10.24** 
Error between groups 42 0.37 
Blocks ‘ 3 0.22 2.00 
Between groups trends 

Linear 0.38 3.46* 

Quadratic 2 0.07 <1.00 

Cubic 2 0.10 <1.00 
Error within groups 126 0.11 


* Groups are; high, low and no model, with subjects’ giving 
word eons and overt sr (V = 15 pergroup). 
p<.05. 


wep <.01. 


low the high-model group (adjusted ¢ test with 
critical difference at the .05 level). The preob- 
servation rate of sr was approximately 5/10; the 
rate recorded on the low-model tape was a con- 
sistent 2/10; and the high-model tape increased 
from 2/10 to 8/10. The immediate effect of the 
model was a sharp decrease in sr rate in the ob- 
servers to a level of approximately 3.9/10. The 
low group continued to decrease to a level slightly 
above its model (3.0/10), while the high group 
increased slightly but did not approach the Block 
4 rate of its model (4.2/10 versus 8/10) and did 
not return to its own preobservation rate of 
5.1/10. 

All nine experimental groups were compared 
by an analysis of variance on the rate of sr 
(converted to arc sines) during the four blocks 
of postobservation trials (Table 3). There was a 

` significant effect for rate of sr of the model, 
F=22.79, df=2/126, p<.01. An adjusted £ 
test (at the .01 level) showed that the three 
groups all differed significantly from one another 


TABLE 3 


ANALYSIS OF VARIANCE OF Arc SINES OF PROPORTION 
Sr DURING 40 POSTMODEL TRIALS 


Source df MS F 

Model’s sr® (A) 2 10.71 2.793 
Subject’s participation? (B) 2 0.06 | <1.00 
AXB 4 0.57 1.21 
Error between groups 126 0.47 

Blocks (C) 3 0.17 2.13 
AXC 6 0.12 1.50 
BXC 6 0.07 | <1.00 
AXBXC 12 0.11 1.38 
Error within groups 378 0.08 
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(overall M rate sr by the no-model group= 
5.2/10, high tape group = 4.2/10, and low group 
= 2.9/10). The effectiveness of the model was 
not related to the degree of participation by the 
subject while observing the model (F < 1.00), 
During the postobservation trials, the groups did 
not differ in trends over blocks and all groups 
combined showed no significant changes over 
blocks, indicating a continuing effect of the model < 
over the 40 trials. 

The effect of the model on the subject’s rein- * 
forcement of another person performing the same 
task for two blocks of trials is summarized in ` 
the analysis of variance on proportion of tria's “ 
reinforcement was given (converted to arc sines) 
in Table 4. As in the analysis of postobservation 
sr, there was a significant effect of rate of model’s 
sr, F = 8.27, df = 2/126, p < .01, but not of de- 


TABLE 4 


ANALYSIS OF VARIANCE OF ARC-SINE PROPORTION RE- — 
INFORCEMENT OF TAPE-RECORDED PERSON 
Grvinc WORD ASSOCIATIONS 


Source df MS K 


Model’s sr (A) 2 1.82 8.274" 
Subject’s participation (B) 2 0.01 <1.00 
AXB 4 0.32 “1.46 
Error between groups 126 0.22 

Block (C) 1 3.90 55.711% 
AXC 2 | 0.12 Teal 
BXC . 2 | 0.09 1.29 
AXBXC 4 0.04 <1.00 . 
Error within groups 126 <1.00 


** p <.01, 


gree of participation of the observer (F < 1.00). . 
The adjusted ¢ test (at the .05 level) showed that 
the three groups were all significantly different 
from one another (M for the no-model. control 
group = 6.0/10, high-model group = 5.3/10, and 
low model = 4.6/10). There was a significant 
overall decrease in rate of reinforcement of the 
other person from Block 1 to 2, F = 55.71, df= 
1/126,  <.01, which was not related to any of 
the treatment effects. 

The analysis of variance on the Goodness of 
Work scores (a high score indicating a high 
evaluation of other person’s work performances) 
indicated a significant effect of model's sr rate, 
F=4.00, df= 2/126, p < .05, but not of degree — 
of participation of the observer (F < 1,00). The 
adjusted ¢ test (critical difference at p<05= © 
2.14) showed that the no-model control group 


^ Three levels are: high sr on tape, low, and no tape control. 
b Three levels are: word association plus overt sr by the 
subject, word association with no overt sr, and no re- 
sponse by the subject. 
HD <0. 


(M = 62.24) was significantly higher than , 
both the high-model group (M = 59.40) and 
the low group (M=60.04) which did not 
differ significantly from one another. 


ye 


, 


` 
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Discussion 


The results of the present study support 
Bandura and Kupers’ (1964) finding of a marked 
‘effect of a model on the sr behavior of the ob- 


> server. In addition to demonstrating the imita- 


. tion phenomenon in sr by college students ad- 
ministering vocal reinforcers, the results show 
‘that model’s rate of sr is differentially effective 
where apparent competence of model is not 
varied. Further, the effect of observation of a 
model’s sr rate appears unrelated to the observ- 
“ers overt behavior during the observation, with 
silent observation equally effective as overt word 


` ‘association and sr in alternation with the model. 


The transfer of the model’s effect to the ob- 
server’s reinforcement of another person on the 
same task supports the long-standing clinical ob- 
servation concerning the relationship of self- 
evaluation and evaluation of others. It also pro- 
vides an experimental analog and confirmation of 
Fey’s (1955) observation of a positive correla- 
tion between self-acceptance scores and accep- 


“tance of others. The implications of these 


results for the understanding of neurotic devalu- 
ation of others and self seem important, par- 
ticularly if the converse of the phenomenon can 
be demonstrated (i.e. differential change in 
a of others transferring to self-evalua- 
tion). 

The finding that rate of reinforcement of the 
other person decreased over blocks may have 


` been due either to a general phenomenon of de- 


creasing evaluation of others over time or to 
slight differences in the actual quality of the 
taped voice’s associations, An additional group of 
14 subjects was run under the no-model, control- 
no observer participation condition, with a re- 
versal in the taped voice’s word associations in 
the two blocks of trials. This reversal yielded a 


+ reversal in the rates of reinforcement given by 


the subject to the other person (F for Groups X 
Blocks = 20.67, p < .01), despite the absence of 
replication effects in the st rates on either the 
pre- or postobservation trials (Fs < 1.00). It ap- 
pears that a slight variation in the quality of the 


, taped voice’s word association was picked up by 


the subjects during the evaluation procedure. 
The rate of reinforcing the taped voice was 


: higher than the rate of postobservation sr. This 


too, could reflect either a general tendency to be 
More generous in evaluation of others than of 
self, or it could reflect the fact that the taped 
Voice did, in fact, give rapid, high-frequency as- 


p: Sociations. That is, in this study, the taped voice 


may have had a higher level of apparent com- 
petence than the subjects doing the evaluation. 
hile it was not possible to check out these al- 
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ternate hypotheses in the framework of the pres- 
ent design, the effect of competence of the other 
person being evaluated was examined in Experi- 
ment IT, 


EXPERIMENT IT 


In discussing the findings on sr and evaluation 
of others (ro) in Experiment I, the author raised 
the question concerning the relative effect of the 
apparent competence of the person being rein- 
forced and the current level of sr of the rein- 
forcer. Using the same procedure of having the 
subjects reinforce a tape-recorded voice for the 
adequacy of giving “the best possible response” 
in a word-association task, Experiment II varied 
the apparent competence of the voice being rein- 
forced by manipulating the “popularity” of as- 
sociations. In addition, an attempt was made to 
replicate and expand upon the finding of the 
effect of a model on rate of sr. The earlier study 
demonstrated a decrease in sr after observation 
of a tape-recorded voice giving a low rate of sr, 
but no increase above premodel sr rate after ob- 
servation of a high rate of sr on the tape. The 
present study modified the tape to have the 
model begin at a higher rate of sr than in Ex- 
periment I and maintain an overall rate greater 
than that of the average premodel observer. 


Method 


Subjects. The subjects were 90 male undergradu- 
ates who volunteered to serve as part of their intro- 
ductory psychology course at the University of Wis- 
consin. 

Apparatus. The subject and the experimenter were 
seated in separate cubicles, All instructions and stimu- 
lus words for the word-association task were tape- 
recorded and transmitted to the subject via ear- 
phones. 

Procedure. The procedure was essentially the same 
as that used in Experiment I for subjects who ob- 
served the model silently without concurrent word 
association or sr. The subject was given 10 trials in 
which he was instructed to give a popular word as- 
sociation to each stimulus word and to say “Good” 
if he judged his response to be “the best popular.” 
He then silently observed 40 trials of a tape-recorded 
voice performing the same task, giving popular word 
associations (first or second in frequency on Bous- 
field et al., 1961, lists) and saying “Good” following 
a predetermined percentage of the trials. One-third 
of the subjects (high-model group) heard the model 
administer sr on 5 of the first 10 trials; then 6/10, 
7/10, and 8/10. One-third (low-model group) heard 
sr at the rate of 5/10, 4/10, 3/10, and 2/10. The re- 
maining subjects (no-model group) served as a con- 
trol for the observation procedure and read a maga- 
zine for 5 minutes, a period of time comparable to 
that taken by the 40 observation trials. 

After the observation (or rest) period all subjects 
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TABLE 5 
MEAN FREQUENCY OF REINFORCEMENTS PER Brocks oF 10 TRIALS 
Preobser- Postobservation sr Reinforcement of other 
vation sr 
Group 
1 1 2 3 4 1 2 
-ha 6.0 5.1 5.9 6.0 5.6 6.9 5.9 3 
E 5.3 3.7 4.0 3.5 3.9 5.6 4.9 
H-n 6.3 5.3 6.0 6.1 5.9 7.0 6.0 
L-h 5.9 5.7 LYA 6.7 5.8 3.8 3.7 
Ll 5.5 3.7 3.7 3.2 35: 3.1 2.9 
Ln 5.5 5.6 5.3 5.4 5.0 4.2 4.1 


e abbreviations for the groups, h and I refer to high and low rate of sr of the observed taped model, and n refers to groups? 
which aig aks observe a tape. Uppercase letters H and L refer to high and low competence of the taped voice being reinforced 


in the ro trials, 


received 40 trials of word association and sr as in 
preobservation trials. They were all then told that 
they would be hearing a tape recording of another 
Person giving associations and that their task was to 
evaluate these responses, saying “Good” whenever 
one met their own criteria for the best popular re- 
sponse. 

Half of the subjects heard the same tape recording 
used in the earlier study, with the voice always giv- 
ing high-frequency associations from the Bousfield 
et al. norms (high competent other group). The re- 
maining subjects heard the voice give relatively low- 
frequency associations (1/150 on the norms), The 
latter were designated the low competent other group. 

Rate of sr by model and competence of other in 
the ro trials constituted a 3 X 2 factorial design, with 
15 subjects in each of the six groups. 

All subjects at the end of the ro trials took Cat- 
tell’s (1955) Goodness of Work Test, 


Results and Discussion 


Table 5 shows the mean frequency of sr for 
the 10 preobservation trials, mean sr for the four 
blocks of 10 trials each in the postobservation 
period, and mean frequency of ro for the two 
blocks of trials in which the subject evaluated 
the tape-recorded voice. 

The mean frequency of sr during preobserva- 
tion trials was 5.9. An analysis of variance on the 
frequencies (converted to arc sines) yielded no 
significant group differences (F < 1.00). 

An analysis of variance on the arc-sine propor- 
tion sr during the postobservation trials showed 
a significant effect of model’s sr frequency, F = 
14.09, df = 2/84, p < .01. Duncan range tests (p 
< .01) yielded significant differences between the 
low- and high-model groups (mean sr frequencies 
per block were 3.6 and 5.9, respectively) and 
between the no-model group (M sr = 5.7) and 
low-model group. The high-model and no-model 
groups did not differ significantly at the .01 or 
.05 level. 

These results replicate the earlier finding of 
the relationship of model’s sr rate and observer’s 


rate, for the low-model group, The mean sr rates 
of 3.2/10 and 3.6/10 for the first and second 
studies, respectively, were comparable. However, 


despite raising the sr rate of the high model in ` 


the second study, the subjects observing him did 
not administer sr at a rate higher than the no- 
model group. The increase in the model’s rate 
did appear to raise the mean observer's rate in 
the high group of the present study, relative to 
the rate in the earlier experiment (5.9 versus 
3.5). These results. leave open the question of 
whether preobservation level sets a ceiling on at- 
tempts to raise sr via a model or whether a 
longer observation period with a high rate model 
would succeed in removing the apparent ceiling. 


As in Experiment I, the groups did not change . 


in sr rate over trials and did not differ in trial 
trends. 

The present study also successfully replicated 
the sr-ro relationship. An analysis of variance on 
the arc-sine proportion ro yielded a significant 
effect of sr model; F = 6.00, df = 2/84, p < .01; 
with the same pattern of group differences as that 
found in the postobservation sr trials. The Dun- 
can range tests showed significant differences be- 
tween low-model and both high- and no-model 
groups, the latter two not differing significantly 
(M low=4.1, high = 5.1, no-model = 5.3). 

Competence of the person being evaluated had 
a significant effect on rate of reward given him; 
F=67.00, df=1/84, p<.01 (M ro fre- 
quency per block of 10 trials for high-compe- 
tence group = 6.1, low-competence = 3.6). The 
interaction of the two variables was not signifi- 
cant (F < 1.00). All groups decreased in ro from 
the first to second block of trials: F = 11.60, df 
= 1/84, p < 01; but the groups did not differ 
significantly in block trends. 7 

Although the pattern of group means on the 
Goodness of Work Test was similar to those for 
the high-, low-, and no-model groups on fre- 
quency of ro, the Cattell test did not significantly 


ee 


. differentiate the groups as it had in Experiment I. 

” It appears that the decision to deliver a reward 
to another person is a function of both prior rate 
of sr and relatively objective characteristics of 
» the receiver’s evaluated responses, and that these 
.two variables do not interact. The sustained ef- 
fect of sr on ro despite a wide range of discrep- 
ancy in quality of responses being reinforced, 
seems to point to the presence of general evalua- 
tion criteria that apply across both reinforcing 
modalities. In the paradigm used in these studies, 
“type of reinforcement (sr and ro) would involve 
response generalization while investigation ‘of 

` Stimulus generalization would involve variations 
in the responses being evaluated. To maintain a 
stimulus-response framework would necessitate 
hypothesizing response-produced stimuli that 
elicit both the sr and ro behavior. 
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- . Gardner and Schoen (1962) recently published 
~ a paper in which they set out 


to explore relations between a dimensional principle 
Of cognitive control called conceptual differentiation 
and a number of other aspects of concept formation 
Ip fet individuals are known to differ widely 
wœ Tn the course of this undertaking they analyze 
» Some previous work concerned with individual 
differences in category width in order to relate 

Ìt to their studies of differentiation. We felt it 

+ decessary to write this note because Gardner and 
Schoen’s manner of reporting previous results 

in the field of individual categorizing consisten- 
(es ces leads to misunderstandings, presents a 
Considerable oversimplification of the findings to 
Which they refer, and altogether does not live up 
to standards of reporting of other people’s work 
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WIDTH OF CATEGORY AND CONCEPT DIFFERENTIATION: 
A NOTE ON SOME COMMENTS BY GARDNER AND SCHOEN 


HENRI TAJFEL 
University of Oxjord 


that one has come to accept as usual in our 
journals. 

Two studies are involved: one by Bruner and 
Tajfel (1961), and another by Wallach and 
Caron (1959). Most of the criticisms raised by 
Gardner and Schoen are addressed to the former. 
We should like to discuss their various points 
in the order in which they are raised. 

1. The first point made by Gardner and Schoen 
is a theoretical one. They report the attempts 
made in the two papers to account for individual 
differences in category width through the use 
of assumptions about cognitive risk and sensi- 
tivity to change. They do not 


believe that these arguments capture the essence 
of category width [p. 4]. 


We cannot, of course, present a cogent argument 
for or against a belief as to what is the “essence” 
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of a phenomenon. There are, however, some 
less metaphysical issues which need to be 
disentangled. 

The basis of Gardner and Schoen’s argument 

against including tests, such as those used by 
Bruner and Tajfel and Wallach and Caron, as 
relevant to their notion of differentiation is the 
fact that these tests are unidimensional. They 
write: 
In contrast to category-width tests, in which each 
item assesses the limits of one conceptual realm, free 
sorting tests require the spontaneous differentiation 
of heterogeneous items into a complex of more or 
less related groups [p. 3]. 


We fully agree with this view. In a recent 
discussion, Wallach (1962) complained of the 
tendency in this type of work to jump to con- 
clusions about sweeping individual consistencies 
in cognitive functioning without a previous clear 
specification of the variables involved in the 
nature of the task. This argument was closely 
followed by a report (Tajfel, Richardson, & 
Everstine, 1964) in which the need to specify 
the nature of cognitive tasks employed was 
stressed again. This study confirmed to some 
extent for adults the consistencies found by 
Bruner and Tajfel for children in two types of 
judgment tasks: length and numerosity. Tajfel 
et al. write: 


In such tasks categorizing decisions are made on 
the basis of unambiguous criteria about the nature 
of the inclusion class—though its subjective limits 
may vary from individual to individual. Would the 
consistency hold in a situation in which the S 
would have to decide not only about the limits of 
the inclusion class but also about its defining 
criteria. 


The simple answer is that in such a task the | 


consistency does not hold. Despite Gardner and 
Schoen’s statement about the essential differences 
between unidimensional and multidimensional 
tests, Gardner, Holzman, Klein, Linton, and 
Spence (1959) included quite recently some tests 
which were undoubtedly unidimensional in their 
demonstrations of consistency of various “cogni- 
tive controls”: for example, the Schematizing 
Test, in which the subjects were judging the 
areas of 


a series of 150 squares which gradually increase in 
size [p. 23], 


“a kinesthetic time error test” in which judg- 
ments of weight were involved, a size constancy 
test, and two size estimation tests. These tests 
were not, it is true, conducted for the purpose 
of demonstrating “differentiation”; they were 
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1 
relevant to the “cognitive control principles” of 
“leveling-sharpening” and of “focusing.” One i 
still entitled to ask for a clear empirical specifica- 
tion of the distinction between “differentiatio 
and those other control principles. Without suc 
a specification it is difficult to understand why, 
Gardner and Schoen should feel that unidimen= 
sional tests would be valid indices of some aspects 
of cognitive consistencies and not of others. y 

There is, however, more confusion. If ont 
accepts Gardner’s contention that unidimensional 
tests, despite his recent use of such tests for 
closely related purposes, are not valid indices 
of conceptual differentiation, then it is difficul 
to see how this present discussion is at all rele: 
vant to the suggestion made by Wallach andi 
Caron and Bruner and Tajfel N 
that category width, equivalence range, etc, ar 
basically matters of attitudes towards risk taking, 
to which Bruner and Tajfel add “sensitivity t 
change” [Gardner & Schoen, 1962, p. 4]. 


It is quite possible that conceptual differentia: 
tion has nothing to do with such attitudes; bul 
Gardner and Schoen argue that conceptual dif- 
ferentiation also has nothing to do with unidi- 
mensional tests. Why then should the validity of 
these assumptions, specifically directed at clearly 
defined dimensional judgment tasks, be at all 


with a phenomenon that they believe to be of a 
completely different nature? This is as if an 
argument were made that Piaget’s theories about 
the development of logical structures were not” 
valid because they did not account for the role 
of cognitive dissonance in attitude change. i 

2. Bruner and Tajfel found evidence that 
“narrow categorizers” are more sensitive to 
changes in the structure of the situation than 
“broad categorizers.” This evidence consisted, i 
the first experiment of their study, of the tenden: 
cies shown by narrow categorizers to increase” 
their proportion of “yes” responses when the 
range of stimuli was extended, and then to 
decrease it again as function of more practice 
with the new extended range; the broad cate- 
gorizers’ performance remained more or less 
stable. Gardner and Schoen write: 


a number of tests of sensitivity to change reported 
by Bruner and Tajfel are based on relationshi 
between scores that bear artifactual relations to 
other, eg, a rho of —.433 is reported betwe 
“breadth of category in Phase I and the shift fro) 
Phase I to Phase II in the percent of calls of 
[Italics and exclamation from Gardner & Schoel 
1962, p. 41! Pae 


Of the “number of tests” to which Gardni 
and Schoen refer, this is the only one they rep 


significant decrease in the proportion of “yes 
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The result of it, they rightly suspect, may well 
be due to regression. They do not mention the 

» 
responses by narrow categorizers from Phase 
II to Phase III. More important, they do not 


< mention the report in the same study of an 


experiment conducted for the explicit purpose of 
checking on the very artifact which is the cause 
of their concern. Bruner and Tajfel write that 
before conclusions are drawn from their first 


experiment 


some very obvious artifactual possibilities must’ be 


‘dealt with [p. 236]. 


In order to do this, in a second experiment the 
range of the stimuli was contracted in Phase II 
instead of being extended. As a result, and as 
predicted by the authors, narrow categorizers 


* decreased their proportion of “yes” responses 


which, as Bruner and Tajfel write, 


goes counter to what would be expected by virtue 


+. of “ceiling” and “floor” effects, or, indeed, according 


to the effect of regression [p. 237]. 


As to broad categorizers, again “their response 
to change is slow and insignificant.” 

A. further check on regression effects was ex- 
plicitly built into the design of this experiment 
and clearly reported in the paper (p. 238). In 
Experiment II, Phase I consisted of more pres- 


_entations of the series of stimuli than in the 


Corresponding phase in Experiment I. Therefore, 
in Experiment II a check could be made of 
the extent, if any, of the regression effect at the 
precise point at which transition from Phase I 
to Phase II occurred in Experiment I. 


There was no evidence of any trend towards increase 
for the narrow categorizers, or towards decrease for 


_ the broad ones [p. 238]. 


As none of this is mentioned by Gardner and 


Schoen, it seems reasonable to conclude that 
their report of previous work is inadequate and 
incomplete, and that the points they omit are 
Precisely those which are the most relevant to 
their criticism. 

3. Gardner and Schoen write that Bruner and 


_ Tajfel’s procedures, in which the subjects were 
“Tequested to respond “yes” or “no” to the pres- 


entation of each stimulus, did not control 


the well-known tendency of response sets toward 
pruement to contaminate (if not dominate) scores 
Or such tests [p. 4]. 


This may well be true. The point is, however, 
that this is by no means clearly relevant to the 
Tesponse of broad and narrow categorizers to 
changes in the structure of the stimulation—and 
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the investigation of such changes, as the title 
of the paper indicates, was one of the primary 
aims of the study by Bruner and Tajfel. The 
pattern of these changes, as summarized above, 
was consistent with their predictions; and it 
would require considerable mental gymnastics to 
attribute its progress from phase to phase in the 
judgments to suddenly changing response sets. 
At least, an alternative hypothesis as to how 
response sets would determine such changes ` 
should be offered. This is not done by Gardner 
and Schoen. 

4. Bruner and Tajfel found, according to 
Gardner and Schoen, 


a highly significant relationship between initial 
breadth of category and IQ. The correlations they 
found may actually, however, indicate a negative 
relationship between response set for agreement and 
IQ. Their attempts to partial out IQ involve com- 
bination of the artifacts referred to above [p. 4]. 


The “artifacts” were already discussed. The 
rest of the statement can only be described as a 
misrepresentation. Bruner and Tajfel’s attempt to 
“partial out IQ” had nothing to do with initial 
category breadth as related to intelligence. Again, 
they were concerned with the relation between 
initial category breadth and change. The relevant 
part of the report reads as follows: 


a rank correlation between the amount of shift and 
IQ was obtained, and found to be .27, reflecting the 
fact that narrow subjects shifted more. However, a 
partial correlation between IQ and shift, holding 
breadth of category constant, reduced this figure 
to .05. A complementary finding is that the partial 
correlation between breadth of category and amount 
of shift, this time holding IQ constant, remained 
as high as —.35. Thus, it would seem that, though 
breadth of category is consistently related to intel- 
ligence, the amount of judgmental shift that occurs 
when the environment changes is related not to 
intelligence but to breadth of category [Bruner & 
Tajfel, 1961, p. 234]. 


5. With regard to the study by Wallach and 

Caron, Gardner and Schoen refer to one point 
only: the difference in category breadth between 
boys and girls, girls tending to respond in terms 
of narrower categories than boys. This finding is 
dismissed by Gardner and Schoen in the follow- 
ing terms: 
Girls may only appear to have narrower (i.e, more 
“conservative”) category widths than boys. In actu- 
ality, they may simply be showing their well-known, 
apparently socially conditioned, inhibitions with 
respect to quantitative thinking [p. 5]. 


Two points only need to be made: (a) several 
other workers, including Pettigrew (1958), found 


264 


a similar difference. Pettigrew’s paper is referred 
to at least eight times by Gardner and Schoen; 
no mention is made by them of the fact that his 
results support this particular result of Wallach 
and Caron, (b) No attempt is made by Gardner 
and Schoen to show how the girls’ “inhibitions” 
with regard to “quantitative thinking” should 
lead to an articulate prediction that they would 
display a tendency towards one or another mode 
“of categorizing, particularly in a situation such 
as the one used by Wallach and Caron in which 
no manipulation of quantitative terms by the 
subjects was involved. 

The points raised in this note may be of some 
importance in relation to the empirical issue of 
individual differences in categorization. Another 
point seems, however, more important: with the 
rapidly increasing research output in psychology, 
it becomes more and more difficult to check 
directly on brief accounts of past work that each 
new study contains. It is therefore more essential 
than ever to be able to have complete faith in 
the accuracy of such accounts and in a direct 
and demonstrable relation between comments and 
the facts on which these comments are presumed 
to be based. 
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REPLY TO THE NOTE BY BRUNER AND TAJFEL 


RILEY W. GARDNER anp ROBERT A, SCHOEN 


Menninger Foundation, Topeka, Kansas 


Our critique of the Bruner-Tajfel (1961) pa- 
per, “Cognitive Risk and Environmental Change,” 
was directed primarily toward methodological 
flaws in experimental design, score computation, 
and the use and interpretation of statistical tests. 
In our opinion, the uncontrolled variables we 
pointed to raise serious questions concerning their 
interpretation of the results of the series of ex- 
periments they report. We regret that Bruner 
and Tajfel (1965) feel that we have dealt unfairly 
with their paper. Our necessarily brief critique 
appeared in a survey of literature relevant to a 
study of individual consistencies in conceptual 
differentiation (Gardner & Schoen, 1962). In it, 
we drew from a more extensive critique two 
major points concerning uncontrolled variables 
operative in experiments Bruner and Tajfel in- 
terpret as demonstrating a relationship between 
category width and sensitivity to change. Having 
done this, we felt our critique to be adequate, if 
unexplicated. Since Bruner and Tajfel do not 


agree, and since the methodological points we 
made may be of value to others, we shall present 
a somewhat more detailed commentary oh their 
experiments. 

Our first concern is the measure of category 
width Bruner and Tajfel employed. Our objection 
is that agreement response set (yeasaying-nay- 
saying) is uncontrolled. It even seems possible to 
us that their results may indicate a relationship 
between agreement response set and sensitivity to 
change, rather than between category width and 
sensitivity to change. The degree to which either ' 
of these hypotheses can be evaluated is seriously 
restricted, however, by their use of an inappro- 
priate experimental design, their use of unac- 
ceptable methods of scoring responses, and their 
use of incorrect statistical tests. 

In their first experiment, which served as a 
basis for their subsequent experimental opera- 
tions, Bruner and Tajfel measured category width 
by presenting subjects with patterns of dots 
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ranging in number from 20 to 26 and requiring 
them to say “yes” if they thought a pattern con- 
tained 20 dots and “no” if they thought the pat- 


«tern did not contain 20 dots. To measure cate- 


gory width in this way is to introduce the possi- 
bility that individual-difference variance attribu- 


` table to agreement response set is involved in 


their category-width score. Agreement response 
set is not a minor variable that can be disposed 
of without experimental control, especially when 
it is so clearly allowed to operate as it is in this 


experimental design, As Couch and Keniston 


(1960) have demonstrated, this response set is a 


edimension of individual consistency with notable 


generality in its own right. The only statement 
by Bruner and Tajfel that touches upon this 
uncontrolled variable is the following remark 
about narrow categorizers: 


* Perhaps they have a tendency to minimize risk of 


error by the naysaying route, preferring the conse- 
quences of error that come from avoiding contact 


with threatening objects [p. 231]. 


D ag 


This elision of the concepts of category width 
and yeasaying-naysaying may be the key to this 
portion of their methodological difficulties. If 
their statement about narrow category width and 
naysaying was a hypothesis, it might have been 
tested. If it was not a hypothesis, agreement re- 


» sponse set should have been controlled in the 


measurement of category width. Bruner and 


* Tajfel did neither. 


An investigation of relations between a criterion 
Score and a change score must also meet generally 
accepted criteria for the measurement of change 
and for the independence of criterion and change 
Scores, The change scores employed by Bruner 
and Tajfel are inadequate. It is well known (see, 
eg, Lacey, 1956; and, for more recent discus- 


» sions of the relevant issues, Bereiter, 1963; Lord, 


1963) that use of the difference between final 
level and initial level as a change score is inap- 
Propriate. When the criterion score is also the 
initial-level score, it is not possible to ascertain 
the relation of the criterion score to the correct 


, change score, since the correlation between them 


is automatically zero. The correlation between 


, initial level and the difference score is determined 
-by the variances of initial and final level and the 


Correlation between them. Use of such difference 
Scores as change scores can even make it appear 
that one subgroup of subjects shows more change 

an another when the opposite is true, by virtue 
of the fact that difference scores give the same 
Weight to equal numerical changes at the top and 
bottom of the initial-level scale. As Lacey so 
effectively pointed out, only a residual score 
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(which controls the effect of initial level) pro- 
vides an adequate measure of change in such 
experimental designs. Other experimental designs 
might have been used to eliminate this uncon- 
trolled variable from the Bruner-Tajfel experi- 
ments, but we shall not deal with them here. 

Having allowed two sets of extraneous variables 
to remain uncontrolled, Bruner and Tajfel com- 
plicated matters still further by performing in- 
correct statistical tests that involve interactions 
among these artifacts. Since they complain of 
inadequate quotation, let us be explicit in the lim- 
ited space available here. We shall do so by 
presenting relevant sequences of passages from 
their discussion of relations between IQ, breadth 
of category, and change in category width, with 
our commentary on each passage following it in 
brackets. The first sentences quoted below follow 
Bruner and Tajfel’s discussion of Hamilton’s 
(1957, p. 203) point that 


there are few forms of behavior, especially in the 
cognitive sphere, which cannot be shown to vary 
with intelligence [p. 234]. 


Bruner and Tajfel then state: 


In pursuit of this point, a rank correlation between 
IQ and breadth of category in Phase I was obtained. 
It was of the order of — .53. [As pointed out in our 
original critique, this correlation involves an inade- 
quate measure of category width.] The initial breadth 
of category appears then to be determined, in part 
at least by the subjects’ intelligence—more intelli- 
gent subjects tended to use narrower categories, 
{Since response set for agreement has not been con- 
trolled in obtaining the measure of category width, 
this conclusion is questionable.] 


In their next paragraph (also p. 234), Bruner and 
Tajfel state: 3 


The testing of predictions outlined above depended, 
however, on the relationships found between the 
breadth of category shown by the subjects in Phase 
I and the amount of shift from Phase I to Phase 
II. A rank correlation between the amount of shift 
and IQ was obtained, and found to be .27, reflect- 
ing the fact that narrow subjects shifted more. [In 
addition to the inadequacy of the measure of cate- 
gory width, the measure of change does not meet 
acceptable rules for such scores in that initial level 
is not controlled. The relationship to IQ is therefore 
distorted by the relationship between IQ and initial 
level.] However, a partial correlation between IQ 
and shift, holding breadth of category constant, 
reduced this figure to .05. [The measure of category 
width is questionable. In addition, the appropriate 
test is a partial correlation between IQ and final 
level, with initial level held constant. The chain of 
reasoning into which this result is incorporated is 
also questionable for the reasons spelled out above 
and below.] A complementary finding is that the 
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partial correlation between breadth of category and 
amount of shift, this time holding IQ constant, re- 
mained as high as — .35. [The correlation between 
category width and amount of shift is determined 
by the correlation between initial level and final 
level and their variances. This correlation is not a 
result. To do a partial correlation involving this cor- 
relation is statistically meaningless.] Thus, it would 
seem that, though breadth of category is consistently 
related to intelligence, the amount of judgmental 
shift that occurs when the environment changes is 
related not to intelligence but to breadth of cate- 
gory, [This conclusion is questionable because of the 
confounding of multiple artifacts.] 


To continue this discussion of artifacts in the 
basic measure, scoring, and statistical handling 
of scores would be tedious. Commonly accepted 
criteria for the control of extraneous variables 
are repeatedly violated. A complete methodo- 
logical critique of this paper would continue, 
passage by passage, with points similar to those 
above. 

Bruner and Tajfel’s key contention, in defense 
of these results, is that when they reverse the 
situation in a second experiment (by presenting 
larger numbers of dots first, then smaller num- 
bers), the findings support their hypothesis. Their 
subsequent remarks apparently indicate that the 
results of Experiment II justify their interpreta- 
tion of Experiment I and their repeating the 
findings of Experiment I in their Conclusions 
section (pp. 240-241). We think it not justifiable 
to confirm one experiment by means of another 
when both contain the same artifacts. But what 
about Experiment II itself? We have questioned 
the measure of category width on the basis of 
failure to control for agreement response set. This 
is, of course, a hypothesis. Results of agreement 
response set studies by others are impressive 
enough, however, to make Bruner and Tajfel’s 
Experiment II results questionable on this basis 
alone, to say nothing of the incorrect change 
scores they use. 

It is doubtful that Bruner and Tajfel could 
have eliminated these scoring and statistical arti- 
facts even if they had employed the residual- 
score method suggested by Lacey. The problem 
lies in their experimental design: when one uses 
a criterion score also as the score for initial 
level in the measurement of change, he will find 
it impossible to perform adequate tests of hy- 
potheses concerning relations between the cri- 
terion and change scores. Some of the problems 
we point to here could have been solved by the 
use of an independent, and adequate, measure of 
category width. To have included such a measure 
would not, however, have relieved Bruner and 
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Tajfel of the requirement to employ appropriate 
change scores, 

In their reply to our brief critique, Bruner and 
Tajfel have not limited themselves to the facts 
of their experiments and of our original critique. | 
We shall deal first with their comments concern- 
ing relations between conceptual differentiation 
(the subject of our study) and leveling-sharpen-' 
ing. Contrary to what they indicate, two published 
reports contain clear evidence that the criterion 
measure of conceptual differentiation employed in ‘ 
our study is not related to sensitivity to change ~ 
in *the form represented by leveling-sharpening 
(Gardner, Holzman, Klein, Linton, & Spence; 
1959; Gardner, Jackson, & Messick, 1960). In 
view of the methodological inadequacies of the 
Bruner-Tajfel experiments, in which several im- 
portant sources of error were uncontrolled, the 
Schematizing Test may provide a better measure 
of sensitivity to change than their experiments. 
We do not suggest, however, as Bruner and 
Tajfel seem to imply, that sensitivity to change 
is a single dimension. 

Bruner and Tajfel also raise a question con- 
cerning multidimensional versus unidimensional 
categorizing tests. This issue is a great deal more 
complicated than their remarks imply. Our feel- 
ing that tests of conceptual differentiation “have 
more kinship to everyday concept formation than 
adequate category-width tests is, of course, a 
matter of opinion. This opinion does not rest, 
however, on the presumption that multidimen- 
sional scores represent conceptual differentiation. 
One must distinguish sharply between the com- 
plexity of a test and the dimensionality of scores 
derived from it. 

The question of sex-linked differences in cate- 
gory width is, we feel, not as clearly answered as 
Bruner and Tajfel indicate. Perhaps some of the 
difficulty here lies in our use of the word, “quan- 
titative,” which may have been ambiguous. What 
we meant to convey was that requiring a subject 
to categorize in terms of a single dimension 
ordinarily forces him to respond in terms of 
quantity (e.g., how large, how fast, how pointed, 
etc.). Unidimensional tests of category width 
have, we believe, been quantitative in this sense 
and may have introduced attitudes toward quan- 
titative problems to a greater degree than multi- 
dimensional tests, Whether the “sex-linked con- 
servatism” (Wallach & Caron, 1959) observed in 
such tests is attributable to categorizing disposi- 
tions or to attitudes toward quantitative prob- 
lems is, we feel, open to question. Our concert 
here is the finality with which this group of ex- 
perimental results has been interpreted, when 
other interpretations may be tenable. i 


We agree with Bruner and Tajfel’s point that 
those who refer to the work of others should be 
inclusive enough to be accurate. We believed that 
we were, but perhaps the somewhat more ex- 

«tended critique of their paper presented here 
will be more helpful. 
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The reply by Gardner and Schoen (1965) has 
. Not only done nothing to justify their original 
inadequate reporting of previous experiments 
(this cannot be altered by delayed afterthoughts) ; 
„~ it has also managed to create even more con- 
' fusion. Some of the omissions pointed to in our 
Previous note (e.g., control of regression, lack of 
teference to Pettigrew’s results) are again gin- 
gerly bypassed; new references are quoted with 
great gusto, whether relevant or not; empirical 
points are made which have nothing to do with 
. Our previous comments about the limping logic 
(not the empirical validity) of their original 
. argument concerning uni- and multidimensional 
» tests, All this cannot be due entirely to lack of 
. ‘Space: the point about the “artifactual” agreeing 
Tesponse set is made nine times in their short 
Teply, 
The methodological points made by Gardner 
* and Schoen could be of “value to others” if only 
they did not consist of such busy hiding of clearly 
Visible needles in huge haystacks. Consider, for 
example, their endless reiteration of the point 
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HENRI TAJFEL 
University of Oxford 


about the agreeing response sets. In the case of 
category-width tests, agreeing to what? There is 
a good deal of evidence concerning such sets in 
the careful study by Couch and Keniston (1960); 
not a scrap of it is, however, directly relevant to 
a judgment task in which the subjects are only 
requested to decide whether a simple physical 
stimulus is or is not of a certain stated value. 
Couch and Keniston define the agreeing response 


set as 


a general “tendency to agree or disagree with ques- 
tionnaire items, regardless of their content” [p. 152]. 


The stimuli they used were items from such 
scales as the Couch and Bales Value Profile, the 
F Scale, the Dogmatism Scale, personality inven- 
tories, etc. In all these the subject is requested 
either to state his agreement,or disagreement with 
a statement, or to decide whether a statement 
js true or false. Gardner and Schoen do not even 
condescend to mention that all the evidence for 
the agreeing response sets comes from situations 
which are related to decisions about the value of 
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4 
a physical stimulus present at the time of judg- 
ment just about as closely as chalk is to cheese. 
Another example of adequate reporting of other 
people’s work? The next logical step would be 
presumably to extend the “contamination” of 
agreeing responses to situations where a subject 
has to press one button or another to indicate 
whether he does or does not perceive a stimulus 
to be red. As we indicated in our previous note, 
one might just accept the possibility that there 
may be some slight degree of such contamination 
even in tests of this type. But this farfetched 
possibility hardly entitles anyone to ninefold hue 
and cry about artifacts. 

Once Gardner and Schoen have thus in their 
own peculiar manner disposed of our measure of 
breadth of category, they proceed to discussion 
of our change scores. This discussion is primarily 
statistical, and its path is considerably smoothed 
by an implicit and sudden Promotion of the 
presumed empirical artifact of agreeing response 
to the status of a statistical artifact (see the 
series of quotations with their Hegelian antithe- 
ses, and discussion of our Experiment II). They 
cannot possibly have it both ways: statistical 
Operations performed on a set of scores do not 
become statistically invalid because of a possible 
disagreement about the empirical interpretation 
of the scores. 

With regard to the meaning of change scores 
Lacey’s (1956) paper is quoted. Lacey writes: 


A basic physiological error is involved in the com- 
mon practice of quantifying the reaction of autono- 
mically innervated structures by computing algebraic 
or percentage changes, and .. . additional error is 
committed by seeking arbitrary mathematical trans- 
formations that irrationally (by which I mean em- 
pirically, and without regard to physiological theory) 
remove the dependence of such computed changes 
upon prestimulus levels of physiological functioning 
in a specific experimental situation. I Propose that 
such procedures in effect ignore the basic fact that 
one of the main functions of the autonomic nervous 
system and associated endocrine glands is to main- 
tain a homeostatic norm [p. 127]. 


And further: 


A huge number of experimental observations sug- 
gest that any induced excitation or inhibition of an 
autonomically innervated structure instantly initiates 
a series of changes that serve to nullify the dis- 
turbance [p. 127]. 


In his conclusions, Lacey writes that 
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the magnitude of stimulus-produced activation of 
the autonomic nervous system, in general, is related 
negatively to the prestimulus level of physiological 
function. We have argued that this is not simply an 
arithmetical artifact of there being but a limited 


range of response left available to an organism’ 


already functioning at high levels of autonomic exci- 
tation, but that it is a physiological consequence of, 
and deducible from, the facts of homeostatic re- 
straint of.response [p. 155]. 


Perhaps Gardner and Schoen can see clearly how 
“homeostatic restraint of response” relates to our 
experimental results which can, once again, be 
briefly summarized as follows: (a) as function of 
repeated presentation of certain stimulus pat- 
terns, one group of subjects did not change their 
frequency of including responses, the other did. 
The difference between these groups was highly 
significant as shown by a nonparametric test. 
(b) Two Separate experimental controls were 
introduced to check on possible regression effects. 

We have no doubt that other statistical meas- 
ures could have been used, some of them more 
sensitive; but what is meant by Gardner and 
Schoen when they refer to Lacey’s considerations 
about “adequate measures of change in such 
experimental designs”? What experimental de- 
signs? The last paragraph of our previous, note 
applies here, though it is a little difficult to resist 
the temptation to use stronger terms. 

Gardner and Schoen’s discussion of our theo- 
retical point took us by surprise. We discussed 
the logic of their argument, not the possible em- 
pirical relations between cognitive control prin- 
ciples, breadth of category, conceptual differenti- 
ation, etc. Obviously, we were not understood. 
Perhaps the kindest would be to ask the reader 
to go back to our Point a, especially its last 
paragraph and then to reread the relevant part of 
Gardner and Schoen’s note. The only alternative 
would be to spell out the argument again. This, 
we feel, is not really necessary, 
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BIRTH ORDER, RECRUITMENT CONDITIONS, AND 
VOLUNTEERING PREFERENCE * 


; ABRAHAM WOLF 
Temple University Medical Center 
F anp JONATHAN H. WEISS ° 
f Children’s Asthma Research Institute and Hospital, Denver, Colorado 


$ The response of members of different birth ranks to recruitment of volunteers 
S for a group study was measured in 2 experiments, In Exp. I, 561 students who 
were not required to participate, responded yes-no to the recruitment request, In 

Exp. II, 288 Ss who had experimental participation as a course requirement 

ranked their preferences for a group, individual, and isolation study. In Exp. 

II, 1st-born males preferred the group study significantly more than did 

members of other birth ranks. This was not true in Exp. I, and was not true 

for females in either case. The hypothesis is that 1st-born males submit to 

5 recruitment pressure with anxiety and therefore, affiliate as a means of reducing 
: anxiety. The complexity of the results casts serious doubt on the generalizabil- 
ity of the Capra and Dittes finding that Ist-born males volunteer for a group 
experiment more frequently than do later borns and on the appropriateness of 


a simple “need for affiliation” hypothesis to explain that finding. 


Capra and Dittes (1962), reported that first- 
born children volunteer for a group experiment 
more frequently than do later-born children. They 
interpreted this as evidence of greater affiliative 
need in first borns and suggested that volunteer- 
ing as a means of subject recruitment for group 
studies would attract a population whose results 
would be extremely limited in generalizability, 


` especially when variables related to birth order, 


for example, dependency, were being studied. 
Examination of the Capra and Dittes experiment 
reveals, however, that there are several limita- 
tions to the interpretation and suggestion offered 
by these authors. First, subjects were given the 
Opportunity to volunteer for participation in a 
group study only, so that no adequate control 
for general volunteering behavior had been em- 
ployed, and it could not be determined whether 
the results indeed reflected greater need for 
affiliation rather than a tendency of first borns 
to volunteer more readily for any type of study. 
Second, solicitation of volunteers was done in a 
one-to-one, face-to-face situation so that the 
findings could have reflected a tendency in first 
orns to acquiesce to conformity pressure 


“Becker & Carroll, 1962; Sampson, 1962). If this 


*A portion of this paper was read at the 1963 
erican Psychological Association meetings, Phila- 
delphia, Pennsylvania. 
R The authors wish to express their thanks to 
obert G. Wiltsey for his cooperation in the part 


of this study conducted at the University of 
Rochester 


were true, recruitment under less pressuring cir- 
cumstances, for example, in a large class, might 
eliminate birth-order-related differences in volun- 
teering. Finally, Capra and Dittes restricted their 
experiment to male subjects, and their results 
could not, therefore, be generalized to first-born 
females. Certainly, many small group studies 
which employ volunteers are run with females 
or at least mixed samples. 

The experiments reported here were, therefore, 
undertaken to investigate the relationship be- 
tween birth order, recruitment conditions, and 
volunteering preferences of males and females for 
studies varying in affiliation potential. 


EXPERIMENT I 


This experiment attempted to discover three 
things: first, whether members of different birth 
ranks show consistently different patterns of 
volunteering, regardless of the type of study for 
which volunteers are recruited; second, whether 
the Capra and Dittes findings could be replicated 
with a group-recruitment procedure, which pre- 
sumably would be less pressuring than the one- 
to-one recruitment situation; and third, whether 
sex differences in volunteering patterns could be 


identified. 


Method 


Three groups (classes) of male and female college 
students at the University of Rochester, with a total 
N of 561, were employed. One class, selected at 
random, was asked to volunteer for a study involv- 
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TABLE 1 


[UMBER OF MALE SUBJECTS OF EAcH Birth RANK 
VOLUNTEERING FOR GROUP, INDIVIDUAL, AND 
ISOLATION EXPERIMENTS 


Experiment 
Birth order — oe 
Group | Individual | Isolation 
‘irst and only born | 
Volunteering 29 34 | 42 
Not volunteering 34 23) ayes: 
Jther born | T 
Volunteering 27 36 35 
Not volunteering 22 19 | 29 


D x = 7.25, df = 6, p<.30 


1i 


ng a “cooperative group venture” * lasting about an 
hour; a second class, again chosen at random, vol- 
anteered for a study in which subjects would inter- 
uct for about an hour with only an experimenter; 
and the third class volunteered for an isolation study 
involving approximately 1 hour to be spent alone. 
To reduce external pressure to volunteer, recruit- 
ment was by a graduate student, and in classes for 
which experimental participation was not a course 
requirement. In addition, instructors were absent. 
Following recruitment, birth-order data were ob- 
tained by means of a “Family Size Survey” ques- 
tionnaire, 


Results 


A chi-square analysis (see Table 1) of the data 
shows, first of all, no significant differences be- 
tween the frequencies with which members of 
different birth ranks volunteered, regardless of 
the study for which volunteers were being re- 
cruited. This was true for males and for females 
(see Table 2). Only one trend, opposite in direc- 
tion from that predicted by Capra and Dittes, 
emerged, namely, first- and only-born males 
treated together appeared, if anything, to volun- 
teer Jess for the group study than did members 
of other birth ranks. However, this tendency was 
considerably diminished when only-born children 
were excluded from the first-born category and 
considered separately. 


Discussion 


On the basis of these findings, two things 
seemed likely, First, any differences in rate of 
volunteering for group studies between first borns 
and other borns which might be found under such 
conditions as those of the Capra and Dittes ex- 
periment, could probably not be attributed to a 


3 Capra and Dittes had recruited for a study in- 


volving, “a group performing a common task coop- 
ratively,” 


general difference in volunteering behavior. Sec- 
ond, if birth order could in fact be shown to be 
related to volunteering, its effect would be pro- 
scribed by the recruitment conditions, Thus, in’ 
the present experiment, a group-recruitment pro- . 
cedure seemed to suppress the effect, at least 
when subjects were required to either “volunteer” 
or “not volunteer” for a group study. The ques- 
tion arises, however, whether this would also be 
the case if subjects were given the opportunity to 
express preferences among studies varying in 
affiliation potential. It seemed possible that under 
conditions of decreased pressure, the effect might 
be diminished to the point that a “yes-no” re- 
sponse was not sufficiently subtle to detect it. 
Experiment II was undertaken to answer this 
question. 


EXPERIMENT II 
Method 


Male and female undergraduates numbering 288 in 
an introductory psychology lecture section at the 
University of Colorado ranked their preferences for 
participating in the three studies described above. 
Subjects were told that they could volunteer for 
one or more studies or for none at all, but were 
asked to rank order their preference for participating 
in all three studies in any case. This could be ,done 
by using “not volunteer” as a category. For ex- 
ample, if the subject chose to volunteer for the 
group study only he would rank “group” as his 
first choice, and rank “not volunteer” as his second 
choice. However, he would then go on to rank the 
remaining two studies in his order of preference as 
if he were in fact going to participate in them. It is 
important to note that in the class from which 
recruits were solicited, students were required to 
participate in a study, but not necessarily one of 
ours, to fulfill course requirements. Therefore, they 
were told that although participation in studies 
would be credited to them, they could choose not 
to participate anyway and fulfill their requirements 


TABLE 2 


NUMBER or FEMALE Supjects or Eacu BIRTH 
ORDER VOLUNTEERING FOR GROUP, INDIVIDUAL, 
AND ISOLATION EXPERIMENTS 


Experiment 
Birth order 
Group Individual | Isolation 

First and only born 

Volunteering 20 26 13 

Not volunteering 34 40 28 
Other born 

Volunteering 27 16 17 

Not volunteering *) 42 31 28 

ETIA CE AL DNE Sa A 
2 = 6.19, df = 6, p<-50 


®. 
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TABLE 3 
MEAN PREFERENCE RANK ASSIGNED To Each STUDY BY ALL BIRTH ORDERS 


Birth order 
Study Males Females 
First Only Middle | Ye t Fi 
w=60) | w= | W=32) | wes) | a eeo arate) | eso. Ores 
Group 2.4 3.0 2.8 2.9 2.5 2.7 2.5 2.5 
Individual 2.0 1.9 1.9 1.9 1.7 13 1.9 1.4 
Isolation 1.9 1.4 17 1.6 2.0 2.3 1.9 2.3 
Not volunteer 3.7. 3.7 K AN iad 3.7 3.7 3.8 3.8 3.8 
Note.—1 = first choice, 2 = second choice, 3 = third choice, 4 = not volunteer. 


elsewhere. After the rankings, birth-order data were 
obtained in the same manner as in Experiment I. 


t+ Results 


In all following analyses, first borns and only 
borns were treated separately. All p values re- 


: ported are for two-tailed tests. 


_ Considering males first, it was found that all 
birth orders, treated as groups, preferred the iso- 
lation study first, the individual study second, and 
the group study last (see Table 3). However, first 
borns tend to choose the group study as their 
first option more frequently than did middle 


. borns (p <.10) or only borns (p<.10) (see 


Table 4), In addition, first borns tended to elect 


«not to volunteer for the group study less fre- 


quently than did youngest borns (p<.10) or 
only borns (p < .10) (see Table 5). The statis- 
tical test in each was of the difference between 
uncorrelated proportions (Guilford, 1956, pp. 
190-192). 

In addition ¢ tests of the difference between 
mean ranks assigned by birth orders to each of 
the studies were calculated, These showed that 
first borns ranked the group study significantly 
higher than did all other birth orders (p < .05). 

For the individual and isolation studies, no 

ferences at all between birth orders were noted 
on any of the measures. 

Results of females were quite different. First, 


«the overall pattern of preference, for all but 
Middle borns, was for the individual study first, 


the isolation study second, and the group study. 
last (see Table 3). Second, no differences be- 
tween proportions of different birth orders that 
chose the group study as their first option (see 
Table 4) or elected not to volunteer for it (see 
Table 5), were noted. For the individual study, 
youngest borns chose it as their first option sig- 
nificantly more frequently than did first borns 
(p <.05), and only borns chose it as their first 
option significantly more frequently than did 
first borns or middle borns (p < .05 and .025, 
respectively). No differences between birth orders 
on any measures were noted for the isolation 
study. 


Discussion 


It is clear from the results of Experiment II 
that under certain conditions the relationship 
described by Capra and Dittes, between. birth 
order and volunteering behavior in first-born 
males, does emerge. More problematic is the 
nature of those conditions. An analysis of the 
operational and inferred differences between Ex- 
periments I and II suggests two possibilities. 
First, that the ranking technique used in Experi- 
ment II merely revealed what was present but 
undetected in Experiment I because of the 
subtlety of the effect under group-recruitment 
conditions. Second, that the pressure to volunteer, 
brought to bear in Experiment II by the course 
requirement to participate, is the basic operative 


TABLE 5 


x TABLE 4 PERCENTAGE ELECTING Not TO VOLUNTEER 
` Percentace CHoosmo Group Struny as FIRST OPTION For GROUP STUDY 
Birth order Males Females Birth order Males Females 
First 20.2 First 8.3 14.4 
Middle ee 22:2 Middle 15.6 11.1 
Youngest 111 12.5 Youngest 22.2 125 
Only 0.0 8.3 Only 27.2 8.3 
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factor. That such pressure existed is implied by 
the fact that only 1 of the 288 subjects chose 
not to volunteer for any studies. Although the 
first possibility cannot be ruled out in the absence 
of data on the responses of group-recruited sub- 
jects under pressure and required to give a yes- 
no response, the fact that Capra and Dittes’ 
subjects, who gave a yes-no response under pres- 
sure, albeit in a one-to-one recruitment environ- 
ment, showed a response pattern very similar to 
that shown by subjects in Experiment II, sug- 
gests that the second possibility is the more 
likely. This suggestion is strengthened by the 
observation that the rank order of the three 
studies (Capra and Dittes, Experiment I, and 
Experiment II) by the degree to which the effect 
is shown is precisely the same as that which 
would describe the position of the studies on a 
gradient of “pressure to participate.” Further- 
more, it accords well with the findings of Becker 
and Carroll (1962) and Sampson (1962) that 
first-born males tend to be more sensitive to 
conformity pressure than are members of other 
birth ranks, And finally, since Schachter (1959) 
found greater affiliative tendencies in first-born 
females, the fact that preference for group par- 
ticipation is lacking in our sample of first-born 
females suggests that something more than simple 
need for affiliation is operating. That is, if need 
for affiliation is stronger in first-born females, 
and if recruitment for volunteers appealed di- 
rectly to this need as Capra and Dittes believe, 
then the Schachter result should have been repli- 
cated. Our failure to do so, then, implies another 
operating factor, apparently acquiescence to pres- 
sure, However, an explanation of the effect in 
first-born males solely in terms of their suscepti- 
bility to pressure would be inadequate since it 
would not account for their selective preference 
of the group study under pressure. In other words, 
if first borns were merely submitting to volun- 
teering pressure, they should have volunteered 


as frequently for the individual and isolation. 


studies in Experiment II as for the group study. 
What is implied, therefore, is some mediating 
mechanism, in first-born males, between sub- 
mitting to pressure and the affiliative behavior 
reflected in preference for group participation. 
The most likely mechanism is anxiety which, as 
Schachter (1959) and Wrightsman (1960) have 
shown, is associated with affiliative behavior in 
first borns. Relating this to the results of Experi- 
ments I and II it would appear that when there 
is little pressure to participate, that is, when 
Subjects are given a free choice in a group-re- 
cruitment paradigm, first-born males are not dif- 
ferent from other borns in volunteering for a 
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group study. However, when pressure is increased, 
in this case by requiring participation, first borns 
react to submitting to such pressure with anxiety. 
The choice of group participation at such a time’ 
serves, then, not simply as a means of satisfying + 
a need to affiliate, but rather as a means of re- 
ducing anxiety. 

What antecedents to the first-born male’s 
anxiety in reaction to conformity pressure may 
be, cannot be delineated at this time. However, 
it seems likely that they will be related to defin- 
able, antecedent socialization patterns. In this ' 
connection, the finding of Becker and Carroll | 
(1962) that first-born male Puerto Ricans are 
more resistant to conformity pressures than are 
their American counterparts is a promising lead. 
Further cross-cultural studies which would relate 
differences in response to pressure to differences 
in socialization practices could illuminate this . 
matter. 

As for the male-female differences noted in the 
present investigation, Sampson’s (1962) finding 
that first-born males are more acquiescent than 
later borns but that the reverse is true for 
females, may be of some relevance. Thus our 
findings may reflect underlying sex-related dif- 
ferences in degree of resistance to conformity 
pressure. On the other hand, the possibility must 
also be considered that under our conditions no 
differences existed in degree of acquiescence, but 
that differences did exist in the degree of anxiety A 
provoked by conformity. It may be that the 
culturally defined passive role adopted by women 
makes acquiescence less anxiety provoking for 
them. 

As for Capra and Dittes’ caution against using 
volunteering as a means of recruitment in small 
group research, the results reported here indi- 
cate that this method has not been ruled out in 
all cases. However, the prospective investigator « 
should keep in mind that the response to such 
recruitment may be influenced by the degree of 
recruitment pressure, the type of response re- 
quested, the situation in which volunteering is 
requested, the sex of the subject, the cultural 
population from which the samples are drawn 
(Glass, Horowitz, Firestone, & Grinker, 1963), 
and possibly by the guarantee of a cooperative 
group venture. 
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EGO-CONTROL PATTERNS, AND ATTRIBUTION OF HOSTILITY 
TO SELF AND OTHERS: 


ROBERT L. McDONALD 2 
Medical College of South Carolina 


This study was designed to determine if Ss who use different types of ego 
controls in response to threatening situations would differ in their attribution 
of hostility to themselves or others. 4 ego control patterns—expressors, repres- 
sors, sensitizers, and expressor-sensitizers—were derived by combinations of 
scores on the MMPI-based repressor-sensitizer and expressor scales. The Inter- 
personal Check List was administered to 177 white single pregnant females 
at the 7th mo, of pregnancy to obtain ratings of self and parents. Reliable 
differences in the predicted direction were found between the 4 subgroups with 
regard to attribution of hostility to self and parents. Attribution of hostility to 
others (parents) was also found to be related to the S’s parental identification. 


Efforts to devise reliable measures of ego 
control with the MMPI have ranged from 
Meehl’s (1945) N scale and other single dimen- 
sional approaches (Block, 1953; Cuadra, 1953; 
Pearson, 1954; Sullivan, 1954 unpublished *) to 
More complex configurational approaches involv- 
ing a combination of scales (Eriksen & Davids, 
1955; Sanford, Webster, & Freedman, 1957). 
One of the more recent developments in relating 
individual behavioral differences to relatively 
Stable patterns of personality functioning, by 
Means of a configurational approach, has been 
the delineation of a Repression-Sensitization 
Scale (Altrocchi, Parsons, & Dickoff, 1960; 
Byrne, 1961; Byrne, Barry, & Nelson, 1963). 

_ The Repression-Sensitization (R-S) scale, given 
Impetus by earlier work on perceptual defense 
(Lazarus, 1954), classifies people on a continuum 
with respect to their primary mode of adaptation 


_ to threatening stimuli. Repressors are defined as 


' 1 This study was supported by a grant from the 
United Health and Medical Research Foundation of 
South Carolina. The author also expresses grateful 
appreciation to J. Altrocchi for suggestions and 
- D. Gynther for assistance in manuscript prepa- 
ration, 
? Now at the Department of Psychiatry, Emory 
niversity Medical School, Atlanta, Georgia. 
Study entitled “A Scale for Evaluating the Need 


for Psychological Treatment,” 


those persons who respond to threatening situa- 
tions predominantly with use of repression, 
denial, and avoidance, while sensitizers are de- 
fined as those responding to such situations with 
ruminative obsession, manifest anxiety, and 
intellectualization. 

Previous research has shown the use of these 
defenses to be consistent on various tasks 
(Lazarus, Eriksen, & Fonda, 1951). Furthermore, 
in comparison to sensitizers, repressors have 
been shown to avoid the expression of unac- 
ceptable needs on projective tests (Eriksen, 
1951), recall fewer disturbing words (Carlson, 
1954), manifest smaller self-ideal-discrepancies 


. (Altrocchi et al., 1960), have more positive self- 


concepts (Altrocchi, 1961), obtain higher authori- 
tarian scores (Adorno, Frenkel-Brunswik, Levin- 
son, & Sanford, 1950), select fewer deviant 
questionnaire responses (Byrne, 1961), and are 
less likely to encounter obstetric complications 
during pregnancy (McDonald, Gynther, & 
Christakos, 1963). 

The formulation of two additional patterns of 
ego controls—expressors and expressor-sensitiz- 
ers—has been derived from recent studies dealing 
with the relationship of hostility to ego controls. 
Although not as well experimentally validated 
as the foregoing categories, expressors are de- 
fined as people who respond directly to threaten- 
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ing situations, are uninhibited, and manifest a 
tendency to express impulses relatively openly 
and with little anxiety or guilt (Altrocchi & 
Perlitsh, 1963). Conceptually, they are similar to 
Leary’s (1957) aggressive-sadistic personalities 
and Michael’s (1959) impulsive characters. 
Expressor-sensitizers are persons whose MMPI 
protocols reflect personality characteristics com- 
mon to both expressors and sensitizers: impulsiv- 
ity and lack of inhibitions like typical expressors, 
as well as guilt, anxiety, and obsessive concerns. 
The purpose of this paper will be to assess 
the relationships between these various patterns 
of ego control and the attribution of hostility 
both to self and others. The results of earlier 
studies (Altrocchi & Perlitsh, 1963; Altrocchi, 
Shraugher, & McLeod, 1964) involving student 
nurses as well as both male and female under- 
graduate psychology students, while not con- 
clusive, tended to support the hypotheses that: 
sensitizers attribute more hostility to themselves 
than expressors or repressors, and, expressors at- 
tribute more hostility to others than repressors 
or sensitizers, The failure to fully substantiate 
these hypotheses was attributed by the authors 
to the artificiality of the experimental situation, 
the lack of a real-life threatening situation, 
and the relatively short time the raters had 
known each other. The present study attempted 
to surmount these obstacles by obtaining ratings 
of self and parents from unmarried female 
subjects in a more clinically oriented situation, 
pregnancy, which has shown to be anxiety 
provoking (McDonald & Parham, 1964). 


METHOD 
Subjects 


Subjects for this study consisted of 177 single 
white pregnant females. These patients were residents 
at a Florence Crittenton Home (FCH) from ap- 
proximately the fifth month of pregnancy until 
after delivery and were attended prenatally by the 
Obstetrics Department at the Medical College of 
South Carolina, Their ages ranged from 14 to 39 
with a mean age of 19.1 years, and educational levels 
ranged from 8 years to BA degree with a mean of 
11.0 years. Parents of these patients were derived 
primarily from the lower middle socioeconomic class, 


Instruments 


The MMPIs which had been collected in con- 
nection with another study (McDonald & Parham, 
1964) were scored for the R-S scale (Byrne, 
1961). This scale consists of the items comprising 
the D, Hy, Pt, L, and K scales plus Welsh’s Anxiety 
scale items (Welsh, 1952). Inconsistently scored items 
(eg, items contained in two or more of the 
six scales and are scored “false” in one and 
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“true” in another) are excluded from the analysis, 
The omission of such items results in a 156-item 
scale with high scores indicative of sensitization and 
low scores of repression. The R-S scores for the 
present group ranged from 29 to 116 with a mean 
of 65.15 and a standard deviation of 17.47. These 
figures are comparable to those obtained from 
Byrne’s normative group of female introductory 
psychology students as their R-S scores ranged from 
20 to 119 with a mean of 61.8 and a standard 
deviation of 16.2, 

Derivation of the Expressor (Exp) scale scores 
involves handling Pd, Denial of Hysteria (HyD) 
(Littlé & Fisher, 1958), and Repression (R; Welsh, 
1952) scores in the following manner: 2 Pd —(R 
+HyD), with both Pd and R raw scores being 
converted to T scores, the former without K allow- 
ances, and R by means of Welsh’s conversion table. 
The T scores for the HyD scale were calculated from 
the entire distribution of raw scores obtained from 
the present sample. Scores on the Exp scale ranged 
from —22 to 109 with a mean of 36.48 and a stand- 
ard deviation of 26.19. In comparison, Exp scale 
scores obtained from Altrocchi et al. (1964) student 


nurses ranged from —40 to 103 with a mean of- 


5.8 and standard deviation of 25.1. 

The Interpersonal Check List (ICL; LaForge & 
Suczek, 1955) is an instrument containing 128 adjec- 
tives or phrases (e.g., dictatorial, shy, considerate) 
with 16 items for each of eight interpersonal 
categories (ie. octants). The octants are: mana- 
gerial-autocratic (1); competitive-narcissistic (2); 
blunt-aggressive (3), skeptical-distrustful (4), 
self-effacing-masochistic (5), docile-dependent (6), 
cooperative-overconventional (7), and responsible- 
overgenerous (8). Dominance (Dom) and Love 
(Lov) T scores are derived by means of form- 
ulae given in the ICL manual (Leary, 1957). 
Octant ratings for self, mother, and father were 
obtained by locating the intersect of the Dom and 
Lov T scores on a two-dimensional grid. The dis- 
crepancies between parental and self-descriptions 
were obtained by determining the absolute distances 
beween the summary points of the parental and 
self- (ie, self versus mother, self versus father) 
ratings plotted on the diagnostic grid. 

Group comparisons may be made on the basis of 
various combinations of octant scores. For example, 
one standard cut compares the idealized “virtues” 
of strength, responsibility, and conventionality 
versus the more “neurotic” factors of dependence, 
submission, distrust, and anger (i.e, Octants 7812 
versus 3456). As the present study focused spe- 
cifically on attribution of hostility, results for those 
octants (3 and 4) containing the heaviest loading 
for this factor were used for analysis. 


Procedure 


The MMPIs and the ICLs were administered at 
the beginning of the third trimester (seventh month 
of gestation) to all patients who delivered at the 
FCH over a 2-year period. On the ICL, the subjects 
were instructed to rate self, mother, and father 
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TABLE 1 


MEAN ITEM ENDORSEMENT (By OCTANTS) FOR RATINGS or SELF, MOTHER, AND FATHER BY REPRESSORS (R) 
Expressors (E), SENSITIZERS (S), AND EXPRESSOR-SENSITIZERS (ES) 


Self Mother Father 
Octant 
R E S ES R E s ES R E S ES 
1 5.85 5.57 | 543 | 4.91 7.34 | 7.64 | 7.54 | 813 | 650] 835] 8.08 | 8.90 
2 6.03 7.43 | 5.79 | 5.47 | 5.94 | 6.68 | 5.92 6.19 | 6.46 | 7.65 | 7.42 7.10 
3 4.74 6.57 | 5.86 | 6.41 5.53 | 7.23 | 6.54 | 6.34 | 5.25] 6.94 | 7.42 TAT 
4 3.41 5.57 | 6.07 7.71 2.82 | 5.64 | 408 | 4.69 | 2.92] 435 | 4.67 | 4.93 
5 4.21 6.04 | 6.71 7.50 | 3.88 | 486] 4.23 | 447 | 3.39] 3.65 | 3.08 | 2.57 
6 7.21 7.74 | 8.21 9.32 | 647 | 6.50] 7.08 | 691 5.21.) 5.06 | 5.83 | 4.93 
7 7.53 7.43 | 7.79 | 8.97 | 844] 864) 9.31 8.84 | 7.61 | 7.53 7.58 | 7.77 
8 6.79 6.91 5.57 7.41 8.78 | 8.50 | 10.46 | 9.03 7.18 | 6.24 | 6.50 | 7.00 
Total | 45.76 | 53.25 | 51.42 | 57.70 | 49.21 | 55.68 | 55.14 | 54.59 | 44.53 | 49.76 | 50.58 | 50.36 
N 34 23 14 34 32 22 13 32 28 17 12 30 


although divorce or earlier death of a parent pre- 
vented obtaining complete sets of parental ratings 
in all cases. 

Combinations of the R-S and Exp scale scores 
were used to select subjects comprising extreme 
groups on the four patterns of ego control. Cutoff 
points on both the R-S and Exp scales for the 
various groups were chosen so as to approximate 
one half a standard deviation above or below the 
respective means. Repressors were defined as those 
subjects obtaining scores of 54 and below on the 
R-S scale, 23 and below on the Exp scale; sensitizers 
included those subjects with scores of 74 and above 
on the R-S scale, 23 and below on the Exp scale; 
expressors were those subjects with scores of 73 
and below on the R-S scale, 49 and above on the 
Exp scale; expressor-sensitizers included those sub- 
Jects with scores of 74 and above on the R-S scale, 
49 and above on the Exp scale. This selection re- 
sulted in a total sample of 105 distributed in the 
following manner: 34 repressors (R), 23 expressors 
ee 14 sensitizers (S), and 34 expressor-sensitizers 


RESULTS AND DISCUSSION 


Table 1 summarizes the means of total items 
endorsed (possible range per octant: 0-16) by 
the four subgroups for the self, mother, and 
father ratings, Inspection of this table indicates 
that all four groups marked more items for self 
in those octants representative of dependent and 

_ COoperative behavior (Octants 6 and 7). All 
. 8toups attributed predominantly cooperative and 
generous behavior to their mothers (Octants 7 
and 8), this emphasis being most evident with 
the sensitizer group. The father ratings, in addi- 
tion, stressed cooperative qualities; generally, 

Owever, greater importance was placed on 
descriptions of managerial behavior (Octant 1). 

Analysis of these data indicates that with the 
€Xception of the expressor-sensitizers’ endorse- 
Ment of a greater number of items in the 


“neurotic” octants (3456) (x? = 11,06, p < .001) 
for the self-rating, there was a tendency for all 
subgroups to endorse more items in the “healthy” 
octants for the three sets of ratings. Further 
analysis reveals that the four groups differ sig- 
nicantly in their endorsement of total items on 
the various ratings. The repressors endorse sig- 
nificantly fewer items for all sets of ratings than 
any other subgroup. While it was hypothesized 
that repressors would attribute less hostility to 
others as well as to themselves, the fewer total 
items endorsed by the present group implies 
that repressors, in addition, tend to deny more 
positive behavioral qualities, such as friendliness 
This finding may reflect a type of response set, 
in this case, a tendency to disagree with (not 
endorse) any item, regardless of content, Re- 
sponding in this fashion would be consistent with 
Couch and Keniston’s (1960) “naysaying” group 
which was characterized by resistance to change, 
slowness to respond, and marked utilization of 
rationalization and denial. 

The expressor-sensitizers endorse a significantly 
greater number of items for the self-ratings than 
the repressors, sensitizers (y? = 46.86 and 12.26, 
respectively;  <.001 both cases), and expres- 
sors (x? = 6.18, p < .05). This subgroup, which 
is at the positive extreme of the item endorse- 
ment continuum, might be depicted as “yea- 
sayers.” Couch and Keniston (1960) showed that 
yeasayers, as compared to naysayers, were more 
extraverted, impulsive, and possessed an emo- 
tionally oriented approach to problems, attributes 
which largely conform to the expressor-sensi- 
tizers’ description. Expressors and sensitizers en- 
dorse more items than the repressors for the 
mother ratings (x? = 12.64, p < .001) and the 
father ratings (x? = 11.56, p< .001), respec- 
tively. The remainder of the statistical compari- 
sons controlled for these different response totals 
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of the various groups as well as differences in 
group sizes. 

Turning now to the major question posed by 
this study, Table 2 presents the results of chi- 
square analyses involving totals for items meas- 
uring hostility (Octants 3 and 4) endorsed by 
the four subgroups for parental and self-ratings. 
Inspection of this table reveals that with regard 
to the self-ratings, expressors, sensitizers, and 
expressor-sensitizers endorse a relatively greater 
number of items in Octants 3 and 4, after making 
adjustments for total item differences, than do 
the repressors. These findings support the hy- 
pothesis that sensitizers, when faced with a 
threatening situation, would attribute more hos- 
tility to themselves than would repressors. 
However, the prediction that sensitizers, under 
similar circumstances, would attribute more 
hostility to themselves than would expressors 
was not confirmed. Although unpredicted, both 
the expressors and expressor-sensitizers also 
attributed more hostility to themselves than the 
repressors. Collectively, these findings are con- 
sistent with Altrocchi and Perlitsh’s (1963) and 
Altrocchi et al. (1964) earlier results, but the 
present data support the predictions in a much 
more convincing fashion. 

Further inspection of Table 2 indicates that 
our data, after adjustments for total response 
differences, also support the hypothesis that 
expressors will attribute more hostility to others 
than repressors, In addition, it can be seen that 
expressors attribute greater hostility to mothers 
than sensitizers. The only other consistent finding 
in regard to the parental ratings is the attribu- 
tion by the expressor-sensitizers of greater hos- 
tility to both parents than repressors, While 
Altrocchi previously demonstrated a significant 
difference on this variable between his student 
nurse expressor and repressor subgroups (E 
>R), he found no differences between the 


TABLE 2 


CHI-SQUARE VALUES FOR Comparisons or OCTANTS 
3 AND 4 ITEMS ENDORSED BY REPRESSORS (R), 
Expressors (E), SENSITIZERS ( S), AND 
EXPRESSOR-SENSITIZERS (ES) 

FOR PARENTAL AND 
SELF-RATINGS 


Self Mother Father 
RE | 10.10, E>R | 24.90, E>R 6.39, E>R 
RS _ | 11.54, S>R ns 10.15, SSR 
R-ES | 17.83, ES>R | 4.50, ES>R 10.61, ES>R 
E-S ns 13.20, E>S ns 
E-ES ns 8.89, E>ES ns 
S-ES ns ns ns 


Note.—p <,001 =10.00, p <.01 =6.63, p <.05 =3.84. 
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TABLE 3 


MEANS OF CORRESPONDENCE BETWEEN SELF-MOTHER 
(S-M) AND SELF-FATHER (S-F) RATINGS BY 
THE REPRESSORS (R), Expressors (E), 
SENSITIZERS (S), AND EXPRESSOR- 
SensrTIzERS (ES) 


ES 


18.10 


21.33 


expressor and sensitizer subgroups for any of 
his three samples. 

The findings described above suggest marked 
differences in personality characteristics between 
the repressors and the other subgroups. In an 
attempt to determine other possible subgroup 
differences, Table 3 presents the means for the 
four subgroups’ discrepancies between parental 
and self-ratings. Inspection of these data shows 
that of the four subgroups, repressors describe 
themselves most like they describe their parents, 
while expressor-sensitizers’ self-ratings are most 
dissimilar to those of both parents, These find- 
ings further support our interpretation of these 
two subgroups’ differential endorsement of items 
as a response set, since according to Couch and 
Keniston (1960), yeasayers (i.e. our sensi- 
tizers) do not internalize parental edicts; the 
naysayers (i.e, our repressors), on the other 
hand, “completely internalize parental control 
functions, perhaps related to an unusually strong 
identification with the parents.” 

It is also evident that the subjects in the 
repressor and expressor-sensitizer subgroups are 
“closer” to their mothers (i.e. less self- 
mother-discrepancy) than fathers, while there is 
a slight reversal in the expressor and sensitizer 
subgroups. These findings lead to the speculation 
that like-sex identification would foster develop- 
ment of different patterns of ego control than 
opposite-sex identification. 

Analysis of these data reveals significant mean 
differences between the four subgroups: repres- 
sors differ less than the expressors, sensitizers, 
and expressor-sensitizers on the self-mother- 
discrepancies (t=3.73, 4.66, and 4.67, respec- 
tively; p<.01 in all cases), and repressors 
differ less than expressor-sensitizers on the self- 
father-discrepancies (¢ = 4.12, p<.01). 

These results, along with those of Table 2, 
Suggest that the attribution of hostility to par- 
ents by the various subgroups is related to the 
discrepancy between the self- and respective 
Parental ratings. On the father ratings, for ex- 
ample, the significant differences between the 
repressors and other three subgroups range in 
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increasing magnitude from expressors to expres- 
sor-sensitizers (cf. Table 2). A similar pattern 
is evident on the self-father-discrepancy means 
presented in Table 3; together, these findings 
demonstrate that the more disidentified all sub- 
jects are with fathers, the more hostility is 
attributed to the fathers. Similar relationships 
hold between the chi-square results of mother 
ratings and self-mother-discrepancy means, par- 
ticularly in regard to the expressor-sensitizer 
groups. 

Our data raise interesting considerations for 
future research. It is not clear whether the re- 
sults found for attribution of hostility to others 
are related uniquely to parents or would also 
be found using other authority figures. Also, 
presumably the degree of acquaintance of subject 
to ratee (as Altrocchi implied) as well as the 
nature of the relationship between subject and 
ratee (e.g., both members of some peer group) 
would affect attribution of hostility to others. A 
future study involving ratings of self and peers 
obtained from unmarried pregnant girls at 
monthly intervals during the last 3 months of 
pregnancy will investigate various aspects of 
this problem. 
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DIRECT AND VICARIOUS EFFECTS OF A SHIFT IN 
MAGNITUDE OF REWARD ON PERFORMANCE + 


JAMES L. BRUNING? 
Ohio University 


The design of the experiment permitted examination of the effects on a lever- 
movement response of: (a) differential magnitude of reward during acquisi- 
tion, (b) a subsequent shift in reward magnitude, (c) observation of a model 
S's performance on the later performance of the observer, and, (d) the presence 
of an observer on the performance of the model. During the acquisition phase 
of the experiment, Ss who received small reward (1 M & M per trial) performed 
significantly faster than Ss who received a larger amount (5 M & Ms per trial). 
The presence of an observer was found to have no effect on the response 
level of the model.. Following the shift in reward magnitude, the mean speeds 
of Ss changed from large to small reward increased while the response level of 
Ss changed from small to large reward decreased. No difference was noted 
between the mean speeds of Ss who actually performed the acquisition trials 


and those who merely observed the performance of a model, 


In the present investigation several variables 
are considered as they affect the lever-movement 
speeds of kindergarten children. The basic de- 
sign of the experiment involves, first, an examina- 
tion of performance during acquisition as a func- 
tion of different magnitudes of reward. Following 
this series of acquisition trials, a shift in magni- 
tude is initiated whereby some of the children 
receive an increased quantity per trial, while 
others receive a decreased amount. 

In a variation of this design, a model subject 
responds during acquisition while an observer 
subject merely watches the model’s performance 
and receipt of reward. After this period of ob- 
servation, the observer subject then performs, 
but a shift in magnitude is initiated so that an 
increase or decrease, relative to the amount dis- 
pensed to the model, is experienced. 

Four general areas of research serve as the 
background for this investigation. On the basis 
of these previous studies, several hypotheses rele- 
vant to the variables under consideration in the 
present study, are suggested. 


1 This article is based on a dissertation submitted 
to the Graduate College of the State University of 
Iowa in partial fulfillment of the degree of Doctor 
of Philosophy. This research was conducted while 
the author was a predoctoral fellow under the Na- 
tional Defense Education Act. It was supported in 
part by National Science Foundation Grant G-16011 
under the direction of Milton E. Rosenbaum to 
whom appreciation is expressed for his advice and 
assistance throughout the course of the investigation. 

2 Thanks are due the Iowa City Board of Educa- 
tion and Superintendent of Schools Buford Garner 
for permission to obtain subjects from the public 
school system, 
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1. Differential magnitude of reward during ac- 
quisition. Relatively few experiments with human 
subjects have been reported in which magnitude 
of reward was varied and speed was used as a 
performance measure. In those available (Blank, q 
1958; Bruning, 1964; Weinstock, 1962), varied 
quantity of reward was found to have no dif- 
ferential effect on response speed. Thus, đe- 
spite numerous findings for subhuman organisms 
(Pubols, 1960), no difference between the termi- 
nal acquisition levels of any of the experimental 
groups is predicted to result as a function of 
varied magnitude of reward. 

2. Performance in the presence of a passive 
audience. In a summary of the relevant litera- 4 
ture, Kelley and Thibaut (1954) indicate that ` 
quantity of work initially increases when a pas- 
sive audience is present, but that the effects of 
the audience wear off as the individual adapts to 
the social situation. On the basis of these’ find- 
ings, it is predicted that in the conditions which 
include a model and observer, the effect of the 
Passive observer will be to initially facilitate the 
response speeds of the model. However, this su- 
periority is expected to gradually disappear as the 
number of trials increases and the model subject 
becomes accustomed to the presence of the ob- 
server. 

3. Shift in reward magnitude. While no data 
are available in which reward quantity was varied 
following acquisition training, several studies 
have been reported which employed a shift in 
reward quality (Heber, 1959; O’Connor & Cla- 
ridge, 1958; Stevenson & Snyder, 1960). Consist- 
ent with the findings of these investigations and 
the several animal studies which have employed 
a magnitude shift (Pubols, 1960), it is predicted 
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that an increase in amount received per trial will 
effect an increment in mean speeds while a de- 
crease in magnitude will result in a speed decre- 
ment. 

4. Performance following observation. Two 
studies have been reported in which subjects per- 
formed after watching the performance of a 
model who experienced differential reinforcement 
(Lewis & Duncan, 1958; Rosenbaum & Bruning, 
1963 *), Lewis and Duncan found the resistance 
to extinction shown by the observer approxi- 
mated the usual partial reinforcement effect, but 
this effect occurred only after the observér had 
received reward each time the model was re- 
warded. In contrast, Rosenbaum and Bruning 
found that mere observation of the model sub- 
ject’s responses and reinforcement pattern was 
sufficient to induce behavioral changes in the ob- 
server similar to those induced in the model. 
Since the apparatus and subjects employed in the 
present investigation are more similar to those 
of the Rosenbaum and Bruning study, it is hy- 
pothesized that observation will induce behav- 
ioral changes in the observer which are the same 
as if he had performed the initial trials himself. 
Consequently, the shift in magnitude is predicted 
to have the same effect on subjects in both the 
“Alone” and “model observer” conditions. 


METHOD AND PROCEDURE 
Subjects 


A total of 144 subjects were drawn from kinder- 
garten classes of the Iowa City Public School Sys- 
tem. Twelve subjects were randomly assigned to each 
of the four conditions where the same subject per- 
formed both phases of the experiment and 24 sub- 
jects were assigned to each of the four conditions 
where an observer performed after watching a model 
respond during the initial trials, Each condition con- 
tained an equal number of males and females and 
the model and observer were always of the same sex. 


Abparatus 


The apparatus has been described in some detail 
elsewhere (Bruning, 1964; Rosenbaum & Bruning, 
1963 8). Briefly, however, it consisted of the follow- 
Ing parts: a single lever which was moved horizon- 
tally from right to left; a retractable aluminum bar 
which served as the start signal; a faceted, red light 
and buzzer to orient the subject toward the lever; 
and a goal box into which the reinforcement was 


“automatically dropped. The entire face of the ap- 


Paratus was painted a flat grey. 
Procedure 


The subjects were brought from their kindergarten 
Class into the experimental room and the apparatus 
——————— 


° Unpublished manuscript entitled “Direct and 
Marious Experience of Variations in Percentage of 
einforcement.” 


was introduced as a candy machine. In the four con- 
ditions where a single subject performed both the 
acquisition and postshift phases, the same basic pro- 
cedure was employed as has been described previ- 
ously (Bruning, 1964). Each of the 48 subjects in 
these conditions was given 30 acquisition trials dur- 
ing which he received either one or five M & Ms 
per trial. Following these first 30 trials, all 48 sub- 
jects received an additional 30 trials, In the two ex- 
perimental conditions, the 12 subjects who received 
a single candy per trial during acquisition were 
shifted to five and, similarly, the 12 subjects who 
received five per trial initially were shifted to a 
single piece. In the two control conditions, 12 sub- 
jects were given five candies per trial throughout 
the experiment and 12 subjects received only a single 
piece. The subjects in all conditions were instructed 
not to eat any of the candies during the experi- 
mental session, but to place them in a container be- 
side the goal box, All subjects were told they could 
keep the candy earned during the session. 

In the four conditions which employed a model 
and observer, the model was told to stand in front 
of the apparatus and inside of a circle marked on 
the floor. The observer was instructed to stand 
quietly to the left of the model and watch closely 
to learn how to operate the machine, The model sub- 
ject was given a single practice trial after which the 
experimenter went behind a shield, out of sight of 
both subjects, and the experimental session began. 

Either one or five M & Ms was given to the 48 
model subjects for each of the 30 trials which they 
performed. After these trials, each model was given 
the candy which he had earned and was sent back 
to the classroom. The observer was then given his 
opportunity to perform for 30 trials. 

The 24 observer subjects in the experimental 
groups received an amount of reward different from 
that received by the model. If the model received 
five pieces per trial, the 12 observers then received 
only one piece, and similarly, if the model received 
a single piece, the 12 observers then received five 
when they performed. 

The amount of reward received by the 24 subjects 
in the “observer-control” conditions was the same 
as that received by the model. Hence, if the model 
received five pieces, the 12 observers also received 
five and, similarly, if the model received a single 
piece, the 12 observers then received a single piece. 


REsuLTS 


The two measures of response latency* for 
each trial were transformed into speeds of 1/time 


4Measures for the first 6 inches (starting speeds) 
and the next 9 inches (movement speed) of the range 
of movement were recorded. Only the movement 
speeds will be presented since many subjects failed 
to orient to the manipulanda at the onset “ready” 
signals and hence, did not respond immediately to 
the retraction of the restraining bar which signaled 
the start of a trial. This resulted in the inclusion of 
excessive latencies in the starting measures which 
probably masked any effects resulting from ma- 
nipulation of the variables. 
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X 100, These were then combined for analysis 
and graphing purposes into blocks of six trials 
each, Figure 1 represents the mean movement 
speeds of the subjects during the acquisition 
phase of the experiment. Figure 2 presents the 
mean speeds following the initiation of the mag- 
nitude shift, In this figure, Point 5 refers to the 
block of trials just prior to the magnitude shift. 


Acquisition Phase 


From Figure 1 it can be seen that throughout 
the acquisition phase of the experiment, the mean 
speeds of the subjects who received a smaller re- 
ward were faster than the speeds of subjects who 
received a larger amount. This difference in over- 
all mean speed was sufficient to attain significance 
beyond the .05 level (F = 4.34, df = 1/88). In 
addition to the general performance superiority 
of the subjects who received a smaller reward, 
a gradual divergence of the performance curves 
of the two reward groups was also evidenced. 
This was reflected in a significant Trials X Mag- 
ane interaction (p < .01, F=3.55, df=4/ 
352). 

The presence of an observer in the model-ob- 
server conditions was not found to have any fa- 
cilitative effect on performance during the initial 
trials of the experiment. The failure of this vari- 
able to affect performance was reflected by the 
nonsignificance of the Trials X Social Condition 
interaction (F <1, df = 4/352). However, while 
no systematic variation over trials was noted, the 
presence of an observer was found to facilitate 
the overall mean response speeds of the female 
subjects, but to inhibit the overall performance 
of the male subjects. This differential effect re- 

sulted in a significant Sex X Social Condition in- 
teraction (p <.025, F=6.23, df=1/88). All 
other statistical tests were found to be nonsig- 
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nificant with the exception of a simple trials ef- 
fect (p <.001, F=7.69, df = 4/352). 


Postshift Phase 


A shift in reward magnitude was found to have 
essentially the same effect on subjects who per- 
formed both phases of the experiment and those’ 
who merely observed a model during acquisition. 
This similarity in postshift response speeds was 
reflected by the nonsignificance of the Trials X 
Shift x Social Condition interaction (F <1, df 
= 5/390). 

As’ can be seen from the performance curves 
presented in Figure 2, considerable systematic 
variation did occur as a function of the shift in 
reward magnitude. This interactive effect was 
substantiated by the statistical analysis which 
indicated the Trials X Magnitude x Shift inter- 
action to be significant beyond the .025 level 
(F = 3.60, df = 5/390). A further breakdown of 
this interaction indicated that it was due to an 
increase of the mean speeds of the subjects 
shifted from large to small reward (p <.001, 
F = 38.07, df = 5/390), coupled with a decline 
in mean speed for the subjects shifted from 
small to large reward (p < .001, F = 18.25, df = 
5/390). The curves for the subjects in the two 
control conditions showed no systematic varia- 
tion over trials (F<1, df = 5/390; F=2.20, 
df = 5/390). 

In addition to the above effects, one other in- 
teraction was found to be marginally significant, 
that of Trials x Sex x Shift X Social Conditions 
(F = 2.50, df = 5/390). Further examination of 
this interaction revealed that it was due to a 
significant Trials X Shift X Sex interaction in 
the observer condition only (p < .005, F = 3.77, 
df = 5/390). 
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Discussion 


It is apparent that Hypothesis 1 must be re- 
jected since the mean speeds of the subjects who 


* received a small reward were significantly faster 


than the speeds of the subjects given a larger 
amount, The speed increase exhibited by the 
small-reward subjects appears to be largely due 
to a learning effect which occurred as a func- 
tion of trials. However, no readily apparent ex- 
planation can be offered for the lack of improve- 
ment and generally inferior performance of the 
large-reward subjects. A possible interpretation 
is in terms of satiation effects which resulted 
from the rather sizable amount of candy re- 
ceived on each trial. While no candy was actually 
eaten during the experiment, the incentive value 
may have declined as the amount obtained in- 
creased and thus canceled any learning effects. 

The results of the postshift phase of the experi- 
ment indicate that the effects on performance of 
a magnitude reversal were essentially opposite 
those predicted. Hence, while performance tends 
to be positively related to quality of reward 
(Heber, 1959; O'Connor & Claridge, 1958; 
Stevenson & Snyder, 1960), an inverse relation 
appears to be true when quantity is varied. 

A reasonable explanation for the speed increases 
noted for the subjects shifted from large to small 
reward is in terms of a frustration effect. If the 
subjects formed an expectancy of a large amount 
of reward during the acquisition phase (either by 
observing or performing), then any lesser amount 
would have resulted in frustration which added 
to their general motivational level (Brown & 
Farber, 1951). Hence, the responses elicited 
when frustration was aroused would be expected 
to be more intense than those elicited when 
frustration was absent. 

A combination of two factors appears to have 
affected the general decline in performance of the 
subjects shifted from small to large reward. First, 
the incentive value of the candy may have de- 
clined as a result of the same type of satiation 
effects suggested previously to account for the 
inferior performance of the large-reward subjects 
during acquisition. Second, any heightened moti- 
vation due to novelty factors would likely have 
dissipated since all subjects had either operated 
ot had observed the operation of the apparatus 
during the acquisition phase of the experiment. 
Hence, not only had the subjects in this group 
Previously received or observed the receipt of 
candy, but also, they had become accustomed to 

machine which dispensed candy whenever 
they moved the lever. 

General support was given to the final hypoth- 
esis that the same changes occur in the response 


repertory of the subject whether he actually 
performs the initial trials or merely observes the 
performance of another. In both the “alone” and 
the “model-observer” conditions, those subjects 
who experienced a magnitude shift exhibited 
changes in response speeds, while the subject who 
did not experience a shift continued to respond 
at the terminal acquisition rate. 

This finding lends support to the theoretical 
notions of deCharms and Rosenbaum (1960) 
concerning vicarious experience. According to 
these writers, the designation of vicariousness in- 
volves exposure of the observer and model to the 
same stimulus, observation of a response to the 
stimulus situation by the model, and finally, 
demonstration of the same or similar modification 
of the observer’s response repertory as occurred 
in the repertory of the model. deCharms and 
Rosenbaum also note that in the vicarious situ- 
ation, it is necessary that the observer not re- 
spond overtly to the presented stimuli, but 
rather, that he only make implicit responses to 
the stimulus situation. 

In the present investigation the use of a pas- 
sive observer during the initial trials appears to 
satisfy all of the criteria necessary for demon- 
stration that the changes which occurred in the 
response repertory of the observer during acqui- 
sition were, in fact, the same as if he had ac- 
tually performed. Thus, the results of this study 
indicate that the reward of one subject affects 
not only his expectancies and future behavior, 
but also, the expectancies and behavior patterns 
of observers who are present in the situation. 
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EFFECT OF SOCIOENVIRONMENTAL THERAPY ON 
AWARENESS OF OTHERS IN CHRONIC PSYCHOSIS * 


ROBERT S. SMITH, RICHARD SANDERS, AARON SMITH,? ano BERNARD WEINMAN 
Philadelphia State Hospital 


The effects of 3 socioenvironmental treatment programs on awareness of others 
were examined within the context of a rehabilitation program for chronic psy- 
chotic patients. The criterion of awareness was the ability to identify fellow 
patients from recent photographs. Awareness was considered an early stage 
in the process of resocialization. Best results were obtained when patients were 
required to participate in an intensified group interaction program. Changes in 
the treatment milieu which were not accompanied by direct pressure to inter- e 
act were relatively ineffectual. In the 2 treatment programs which involved 
participation in group activities, qualitative aspects of group experience seemed 
of greater importance than the number of hours spent in group activity in 


determining the results. 


The use of socioenvironmental therapy as an 
approach to the treatment of chronic psychosis 
is based on the assumption that social isolation 
is an integral part of the primary illness. More 
specifically, it is assumed that the psychotic 
patient ayoids interaction with others primarily 
because such contacts have become progressively 
unrewarding. By the time the patient reaches a 
back ward in the typical state hospital, the 
problem is not that he is unwilling to enter into 
meaningful relationships, but rather that he is 
unable to do so. In theory, if the patient can be 
helped to become more effective in his relation- 
ships with others, he might have less need to 


1 The authors wish to acknowledge the contribu- 
tion of B. J. Fitzgerald and Janice Kenny to this 
research through their participation in the develop- 
ment and implementation of the project on which 
it is based. We also acknowledge our debt to Albert 
Pepitone of the University of Pennsylvania for 
guidance on theoretical and technical problems. 

This research was supported, in part, by Project 
Grant OM 126 from the National Institute of Men- 
tal Health, United States Public Health Service. 

2Now at Haverford State Hospital, Haverford, 
Pennsylvania. 


retreat from reality through such autistic mech- 
anisms as delusions and hallucinations. Thus a 
direct attack on the social symptomatology of 
chronic mental illness should eventuate in gen- 
eralized improvement in the- patient’s overall 
psychiatric condition. 

Socialization is a process which has its incep- 
tion in the differentiation between self and other. 
Before one can engage in meaningful interaction 
with another, one must first be aware of the 
other’s identity as a person—as a human being 
distinguishable in at least some respects from all 
other human beings. Awareness, then, is a pre- 
requisite for interaction. Therefore, it seems rea- 
sonable to expect that progress in the direction 
of resocialization should be accompanied by in- 
creased awareness of others. The purpose of this 
paper is to examine the effects of three socio- 
environmental treatment programs on awareness 
of others, as measured by the patient’s ability 
to identify fellow patients from recent photo- 
graphs. 

METHOD 


The Project which forms the basis for the present 
study was initiated at the Philadelphia State Hos- 
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TABLE 1 


TREATMENT Units, NATURE OF LIVING SITUATION, AND CONTENT OF THERAPY PROGRAM ASSOCIATED 
WITH EACH TREATMENT CONDITION 


Treatment 
condition 


Living situation 


Content of therapy program 


Maximalstructure| A E 
ment 


Partial structure B F | Group living 


Minimal structure} C G | Group living 


Group living with patient govern- 


Group therapy 

Interaction program 

Individual work assignment 
Cottage meeting 

Conference with unit administrator 


Interaction program 

Individual work assignment 
Cottage meeting 

Conference with unit administrator 


Individual work assignment 
Cottage meeting 
Conference with unit administrator 


pital in the spring of 1957. A Rehabilitation Center 
was established in an area of the hospital which 
had previously been used to house attendant per- 
sonnel, Availability of these facilities made possible 
development of a living situation which differed 
radically from the environment of the average 
hospital ward. The essential features of this “thera- 
peutic community” were increased privacy, more 
freedom of movement, and greater assumption of 
responsibility on the part of the patient. Male pa- 
tients were assigned individual rooms in three small 
cottage-type buildings. There were 20 patients in 
each cottage. Female patients were housed in a 
somewhat larger building, the ground floor of which 
was devoted to office space for project personnel. The 
two upper floors afforded sleeping quarters for 36 
patients, There was no resident supervision on the 
male cottages. In the female building, an attendant 
was on duty during the day and evening shifts." 

Within the context of this altered milieu, three 
separate treatment conditions were established. 
These conditions differed primarily in the extent to 
which the social living situation and therapeutic 
program were structured to promote interaction. 
Treatment units were organized for both male and 
female patients, as shown in Table 1. 

In the minimally structured treatment condition, 
neither the living situation nor the therapeutic 
program was structured to elicit interaction. The 
treatment program consisted of an assigned job in 
the hospital industry system, a weekly cottage 
meeting held primarily for administrative purposes, 
and individual conferences with the unit adminis- 
trator (psychologist). While there was ample oppor- 
tunity for the development of interaction within 
the cottage living unit, no direct pressure was 
applied to insure that this would take place. 

In the partially structured treatment condition, 
the hospital industry assignment was reduced to 
half a day. In the time thus made available, patients 
— 


For a more complete description of this project, 
See Sanders, Weinman, Smith, Smith, Kenny, and 
Fitzgerald (1962). 


attended a number of group activities in which the 
primary requirement was for interaction among the 
participants. The content of these activities ranged 
from performance tasks such as bowling or working 
on a model train layout, to discussion groups deal- 
ing with such topics as job hunting, good nutrition, 
transportation facilities, and community resources. 
There were seven activities in all. Patients attended 
each activity once weekly, the typical session lasting 
an hour to an hour and a half. Thus each patient 
received approximately 10 hours of programed 
group activity per week. The group Jeaders were 
instructed to stimulate interaction among the mem- 
bers of the group and to selectively reinforce socially 
adaptive behavior. 

Activity groups were organized on a coeduca~ 
tional basis in a manner such that one fourth of the 
patients from Treatment Units A, B, E, and F were 
included in each activity group. Once assigned to an 
activity group schedule, a patient attended all seven 
activities in the company of the same group of 
patients. Thus each patient was systematically ex- 
posed to repeated contact with one fourth of the 
patients from three other treatment units. 

In the maximally structured treatment condition, 
both the living situation and the therapeutic program 
were structured to promote interaction. Patients 
were required to attend the program of group ac- 
tivities just described and, in addition, attended two 
group therapy sessions each week. The social living 
situation was structured to promote interaction 
through the introduction of patient government. 
Interaction was achieved by defining the responsi- 
bilities associated with each office in a manner such 
that contact with other patients was a virtual neces- 
sity, and by manipulating the rules concerning elec- 
tion to office to guarantee maximum rotation of pa- 
tients through offices. The unit administrator helped 
the patient to understand and carry out his responsi- 
bilities in an effective manner. 

Patients were selected for the project by screening 
teams composed of two psychologists from the 
project staff. These teams systematically surveyed 
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TABLE 2 


COMPARISONS BETWEEN TREATMENT GROUPS AT 
INITIAL TESTING : NAMING OF Own UNIT 


M photos 


Treat, | N | identifed | Comparisons | £ 
Maximal (A) | 24 4.33 A versus B | 2.30* 
Partial (B) 24 3.00 A versus C | 3.49** 
Minimal (C) | 19 2.42 B versus C | .94 


* p =.025, one-tailed test. 
** p =.01, one-tailed test. 


every building in the hospital. Referrals were solic- 
ited from staff at all levels, and from patients as 
well. Ultimately, however, it was necessary to rely 
on the screening teams to provide a steady flow of 
appropriate patients. Patients selected for the proj- 
ect were those who exhibited symptoms of apathy, 
social withdrawal, or excessive dependence, in con- 
junction with manifestations of a psychotic process. 

Male patients selected for the project ranged in 
age from 19 to 72 years, the average being 45 years. 
Seventy-two percent had never been married. Thirty- 
nine percent had completed high school. The median 
length of hospitalization was 8.6 years, with a 
range of 1 to 36 years. Female patients were very 
similar to male patients in all respects save two: 
marital status and length of hospitalization. Only 
35% of the females had never been married. The 
median length of hospitalization was 7 years. 

All treatment units were open-ended, with new 
patients being transferred in whenever a vacancy 
occurred, Incoming patients were assigned to one of 
the three treatment conditions by a member of the 
research staff in a manner such that all treatment 
units were continuously equated for age, diagnosis, 
and length of hospitalization. 

Approximately 2-4 weeks after his or her transfer 
to the rehabilitation project, and again immediately 
prior to leaving on completion of the treatment 
program, each patient was presented with a series of 
33 photographs of patients then in residence on the 
various treatment units and asked to identify them 
by name. The series was composed of one third of 
the total complement of each of the six treatment 
units involved, that is, seven from each male unit 
and four from each female unit. Photographs of 
patients who had left the project were removed 
from the pack, and those of new arrivals added as 


TABLE 3 


COMPARISONS BETWEEN TREATMENT Groups AT POST- 
PROGRAM TESTING : NAMING OF Own UNIT 


Treatment N M photos 


unit identified | Comparisons | 1 
Maximal (A)| 24 6.38 A versus B | 1.73* 
Partial (B) | 24 5.67 A versus C | 2.80** 
Minimal (C) | 19 5.16 B versus C | 1.17 


*p = .05, one-tailed test. 
**p = .01, one-tailed test. 
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they became available. The test sample was ran- 
domly redrawn at intervals to insure that the sample 
was both current and representative. The test was 
scored by crediting the patient with one point for 
each photograph correctly identified. Either first or 
last name was acceptable. 

Although data were obtained from both male and 
female patients, the data reported on here will be 
that for male patients only. The physical facilities on 
the female building were such that partial confound- 
ing of the treatment conditions was unavoidable.‘ 

The experimental design previously described 
allows for three sets of predictions. The first has to 
do with the patients ability to identify other pa- 
tients within his own living group. Since the treat- 
ment programs were ongoing, each new patient 
experienced the full impact of whatever degree of 
interaction was characteristic of his living group 
from the very first day of residence, As the maxi- 
mally structured treatment condition was the only 
one in which the living situation had been struc- 
tured to promote greater interaction, one would 
expect patients assigned to Group A to be more 
aware of the members of their own cottage than 
patients assigned to Groups B and C, 

Scheduling problems made it impossible to test all 
patients at exactly the same interval after transfer 
to the project. This introduced the possibility that 
differences in the number of days in residence prior 
to testing might influence the initial group means. 
In order to determine whether an analysis of co- 
variance would be required, the correlation between 
number of days in residence and number of patients 
correctly identified was computed. This correlation 
proved to be —.01 for 67 subjects, indicating that 
exposure time is not a factor in determining aware- 
ness, 


RESULTS 


Table 2 shows the mean number of photo- 
graphs of fellow cottage residents identified by 
the patients in Groups A, B, and C at the initial 
testing. The number of photographs in the test 
sample was seven. The overall F ratio for these 
data is 5.66, which, for df = 2/63, is significant 


4Since the three cottages available to male pa- 
tients were physically separate, it was possible to 
assign all patients in a given cottage to the same 
treatment condition, Thus, for male patients, one 
could manipulate both the treatment program and 
the character of the living situation independently. 
Female patients, on the other hand, shared the same 
lounge and lavatory facilities. Moreover, they fre- 
quently slept two or three to a room, whereas 
male patients had individual sleeping quarters. An 
attempt was made to establish discrete treatment 
units comparable to those in the male cottages by 
assigning each unit to a different area of the build- 
ing; however, this met with limited success. Oppor- 
tunities for spontaneous interaction, both within 
and between treatment units, were greater for 
female patients throughout the period of the exf zri- 
ment. 
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4 TABLE 4 
x COMPARISONS BETWEEN TREATMENT GROUPS AT POSTPROGRAM TESTING : NAMING OF PATIENTS IN 
4 OTHER MALE UNITS 
; Z 
+ Treatment unit | N pests sh yar sie hy Rea ea Comparisons t 

Maximal (A) 24 2.42 6.29 A versus B 30 
Partial (B) 24 2.96 6.06 (A versus C) (2.49)** 
Minimal (C) | 19 1.89 4.29 (B versus C) (2.18)* 


* p = 025, one-tailed test. 
** > = .01, one-tailed test. 


beyond the .01 level. The comparisons between 
individual treatment units show that the predic- 
tions regarding the effect of structuring the social 
living situation were confirmed. 

These results suggest that patient government 

_ was an important factor in shaping the social 
` structure of the primary living group, at least 
during the first few weeks of residence. Theo- 
retically, this influence should continue through- 
out the patient’s stay on the unit, and thus be 
* apparent in the postprogram means as well. How- 
ever, this expectation is tempered by the fact 
that there were only seven pictures in the test 
sample. Since some patients would have been 
able to identify more than seven photographs, 
giveñ the opportunity, the postprogram means 
obviously suffer to some extent from the presence 
of a “ceiling effect.” This effect is most pro- 
nounced for Group A, the group having the 
‘largest number of “perfect” scores. 

Table 3 shows the mean number of photo- 
graphs identified at the postprogram testing. The 
overall F ratio for these data is 4.00, which, for 
df= 2/63, is significant beyond the .025 level. 
Despite the presence of the “ceiling effect,” the 
Comparisons between individual pairs of treat- 
Ment units which were significant at the initial 

_ testing’ remain significant. Again, a zero-order 
Correlation was obtained between time in treat- 

_ Ment and number of photographs identified. 
__,, The remaining predictions have to do with 
| Wentification of patients in treatment units other 
_ than the patient’s own living group. From the 


description of the experimental design, it is clear 
that certain differences would be expected simply 
on the basis of unequal contact between treat- 
ment units, due to the different requirements of 
the three experimental conditions. Failure to 
obtain such differences would mean that in- 
creased contact with others is ineffectual in re- 
ducing social isolation, and would thus be quite 
damaging to the case for socioenvironmental 
therapy. Comparisons which were expected to be 
significant purely on the basis of unequal pro- 
gramed contact between treatment units are 
identified by parentheses in the tables which 
follow. In this instance, the data must be exam- 
ined not so much for the presence of significant 
differences, as for their absence. Aside from the 
factor of differential exposure, it was hypothe- 
sized that awareness of patients in other treat- 
ment units would increase in direct relationship 
to the extent to which the social living situation 
and treatment program were structured to pro- 
mote interaction. 

Since the problem of reestablishing normal 
channels of social intercourse between male and 
female patients is of particular interest in a 
state hospital setting, the data concerning gen- 
eralization of awareness were analyzed separately 
for like-sex and opposite-sex criterion groups. 
Analysis of covariance was employed to insure 
that differences observed in postprogram means 
actually arose during exposure to the treatment 
conditions. Since a genuine preprogram base-line 
measure was not available, the scores obtained 


TABLE 5 
CoMPARISONS BETWEEN TREATMENT GROUPS AT PosTPROGRAM TESTING: NAMING OF PATIENTS 


1N FEMALE UNITS 


—_ 
Treatment unit N M Lilie he ek pera M Comparisons t 
Maximal ï 3.14 A versus B 1.75* 
Partial (By 34 & 230 Gv | giene 
| Minimal (C) 19 AT 1.19 (B versus C) 2.18) 


as? 7-05, one-tailed test. 
+? = -025, one-tailed test. 
? = .01, one-tailed test, 
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TABLE 6 


COMPARISONS BETWEEN TREATMENT GROUPS AT POST- 
PROGRAM TESTING: NAMING OF PATIENTS IN 


Group G 
Number named 
Treatment Comparisons x? af 
unit 
o1 | 2-4 
Maximal (A)| 14 10 | A versus B | 3.80*| 1 
Partial (B) 21 3 | Aversus C | 3.40*| 1 
Minimal (C) | 16 3 |BversusC| .02 | 1 


* > = .05, one-tailed test. 


during the first month on the unit were used as 
the control variable. The effect of this procedure 
is to make the test a more conservative one, as 
any advantage gained during the first 2-4 weeks 
of residence is thereby discounted. 

Table 4 shows the mean number of like-sex 
photographs of patients in other treatment units 
identified at the postprogram testing. There were 
14 photographs in the test sample. The overall 
F ratio for these data is 3.60, which, for df = 
2/63, is significant beyond the .05 level. In this 
instance, the predicted difference between Groups 
A and B failed to materialize. However, the 
differences based on unequal exposure to the 
test sample were significant, as expected. Since 
Group A had no advantage over Group B at the 
initial testing, the failure to obtain a difference 
at the postprogram testing cannot be attributed 
to the conservative nature of the test. 

Table 5 shows the mean number of opposite- 
sex photographs identified at the postprogram 
testing. There were eight photographs in the test 
sample. The overall F ratio for the data in Table 
5 is 7.20, which, for df = 2/63, is significant 
beyond the .01 level. In this instance, all predic- 
tions were confirmed. 

There is only one possible comparison in which 
Groups A, B, and C can be compared simul- 
taneously on a test sample with which all three 
have had equal programed contact. The criterion 
group in this instance is Group G, the female 
minimally structured treatment condition. This 
group was represented in the test sample by only 
four photographs. Fortunately, awareness of 
members of Group G at the initial testing was 
practically nil. Only 12 of the 67 patients were 
able to identify anyone from Group G, and 10 
of these could identify only one photograph. 
These 12 patients were equally distributed among 
Groups A, B, and C, a circumstance which made 
it possible to use the postprogram scores without 
reference to an initial base-line measure. 

Accordingly, the postprogram scores on the 
Group G subsample were dichotomized into 
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scores of 0 or 1, and scores of 2 or more. This ` 


division was employed to insure that patients 
who were able to identify one photograph at the 
initial testing would not automatically be counted 
as “successes” in the postprogram scoring with- 
out having learned any additional names. The 
data were analyzed by chi-square. 

Table 6 shows the results of this analysis, The 
overall chi-square for these data is 6.57, which, 
for df=2, is significant beyond the .05 level. 


The comparisons between individual treatment ' 


units show the maximally structured treatment 
condition to be superior to both the partially 
structured and minimally structured treatment 
conditions, whereas the difference between the 
latter two treatment conditions is not significant. 


Discussion 


Comparison of Tables 2 and 3 and examina- 


tion of Tables 4 and 5 reveal that all three 
treatment conditions resulted in increased aware- 


ness of others over a period of time. However, 


it seems clear that the maximally structured 
treatment condition was more effective than 
the partially structured or minimally structured 
treatment conditions in this respect. 

With but two exceptions, all predictions con- 
cerning the effect of structure on awareness of 
others were confirmed. The superiority of Group 
A in naming of patients of the opposite sex is 
probably attributable to the effects of group. 
therapy. This additional programed experience 
with female patients in a very personal, problem- 
oriented situation, may have enabled the mem- 
bers of Group A to initiate more contacts with 
other female patients. However, neither group 
therapy nor patient government proved strong 
enough to give Group A the edge in identifying 
patients of the same sex. Thus the socializing 


effects of patient government appear to be con- ' 


fined to the patient’s own living group. 

The failure of Group B to surpass Group C 
in naming of patients of the opposite sex was 
surprising. With respect to amount of programed 
contact between the sexes, the difference be- 
tween Groups B and C was clearly greater than 
that between Groups A and B. But Group B's 
extensive experience with female patients in the 
group activity program was apparently no help at 
all in promoting generalization of awareness to 
an outside criterion group. This could mean that 
the quality of programed interaction is much 
more important than sheer quantity, insofar aS 
influencing awareness of others is concerned. In 
retrospect, the data suggest that the important 
qualitative dimension may be a person-centered, 
as opposed to a task-centered, interaction. 


nw 
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PREFERENCE FOR DISSONANT INFORMATION * 


JONATHAN L. FREEDMAN 
Stanford University 


Ss heard a tape-recorded interview which was designed to make the interviewee 


sound either very well qualified for an 
qualified. Ss rated the interviewee and w 
evaluation of him that agreed with their rating or one that disagreed 
17 chose the evaluation that disagreed with their rating. A number 


Of 18 Ss, 


international conference or very poorly 


ere then given a choice of reading an 
with it. 


of possible explanations of this result are offered, and supplementary data are 


presented to support them. 


If people are given a choice of exposing them- 
selves to consonant or dissonant information, 
which will they prefer? A number of studies have 
investigated this problem (Adams, 1961; Brod- 
beck, 1956; Mills, Aronson, & Robinson, 1959; 
Rosen, 1961; Sears & Freedman, 1963, 1965), 
and this work has been generally interpreted as 
demonstrating a preference for consonant infor- 
mation. As Steiner (1962) recently pointed out, 


„however, none of these studies has produced 


unequivocal results. The weakness and ambiguity 
in the findings suggest that subjects’ preference 
may depend to a large extent on the particular 
experimental situation employed. This depend- 
ence appears to be well illustrated in the present 
study, which was designed as one more attempt 
to demonstrate the expected preference for 
consonant information but which, as will be seen, 


` produced very strong results in the opposite 


direction, Since the explanation of this finding 
which will be offered is admittedly post hoc, it 
is more appropriate to present the experiment 
first and the explanation later rather than at- 


. tempting to suggest that the results were ex- 


pected, 


METHOD 


The general design was to have subjects make a 
decision and then give them a choice between read- 
ing a consonant or dissonant communication. The 
~ 


1This study was supported, in part, by Grant 

S-196 from the National Science Foundation. The 
author is grateful to John Steinbruner for assisting 
in the running of the experiment and to David Sears 
with whom many of the ideas and problems in this 
Paper have been discussed. 


subjects were 18 students, 9 men and 9 women, in 
an introductory education course at Stanford Uni- 
versity. They were run individually. The instructions 
informed the subject that the study was an investi- 
gation of person perception and interpersonal judg- 
ment, and the importance of these abilities was 
explained in detail. It was mentioned that the ability 
to evaluate others correctly was a critical factor in 
career success and that it correlated very highly with 
general intelligence. The subject was told that he 
would hear an interview between a student who was 
applying to participate in an overseas conference and 
the person in charge of the conference, and that it 
was the subject’s task to evaluate the student after 
hearing the interview. 

The subject heard either an interview designed to 
make the student sound very good or one designed 
to make him sound very bad. The interview was 
prerecorded on magnetic tape and lasted approxi- 
mately 6 minutes. 

‘After the interview, the experimenter told the 
subject to think carefully about what he had heard 
and then to record his judgments on an evaluation 
form, The major question on this form was: 
“Should the applicant be accepted to participate in 
the conference?” which was answered by placing a 
mark on an 8-point scale labeled “definitely no” at 
one end, and “definitely yes” at the other. The sub- 
ject then rated the applicant on nine scales similar 
to the one just described except that the end points 
were labeled with contrasting adjectives such as 
unfriendly-friendly, intelligent-stupid, etc. These 
additional rating scales were included to make it 
seem more plausible to the subject that we were 
studying interpersonal perception and that this. was 
the only interview he would have to rate. A 

When the evaluation form was completed, the 
experimenter said that we would now like the ‘sub- 

f àluations of the candidate that 


ject to read some ev: : 
had been written by others, that we had two such 
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evaluations written by people who knew him well, 
and that he could read either of these. The experi- 
menter also explained that one of these evaluations 
was quite favorable, while the other was quite un- 
favorable. The subject was then asked which he 
would like to read. His choice was recorded, and the 
experimenter questioned the subject for some time 
concerning why he had made that choice. The 
experiment was terminated, the deception explained, 
and the subject was asked not to tell others about 
the study. 


RESULTS 


The eight subjects who heard the unfavorable 
interview all rated the candidate unfavorably, 
with a mean rating of .75 on a scale ranging from 
O to 8. Nine of the 10 subjects who heard the 
favorable interview rated the candidate favor- 
ably, with a mean of 5.94. None of these 17 
gave a rating that fell within 1 point of the 
middle of the scale. That is, they were all quite 
extreme. One subject who heard the favorable 
interview gave the candidate a rating of 1.5, a 
very unfavorable rating. The results will be 
presented separately for this subject. 

The major results are the preferences shown 
by the subjects between the consonant or dis- 
sonant information. All 17 subjects chose to read 
the dissonant communication. Subjects who had 
rated the candidate favorably chose the unfavor- 
able evaluation; those who had rated him un- 
favorably, chose the favorable evaluation. Al- 
though statistics on such a unanimous choice 
seem superfluous, it is a highly significant effect 
(p < .001, from binomial distribution). 

The one subject who heard the “favorable” 
interview and rated the candidate unfavorably 
chose to read the unfavorable evaluation. This 
was a choice of information consonant with her 
decision but dissonant with the interview, If it 
is considered a choice of consonant information, 
however, it does not change the total result 
ated (p for a 17 and 1 split is still < 
.001). 


Thus, subjects in this situation strongly prefer 
nonsupportive to supportive information. This is 
not simply a lack of preference such as has often 
been reported in the past (eg., Brodbeck’s 
1956, overall finding; Feather, 1963; the nega- 
tive groups in Mills et al., 1959), but a clear 
preference for dissonant information. How may 
this result be explained? 

One explanation is simply that the dissonant 
information was seen as more useful and was, 
therefore, preferred. There is considerable evi- 
dence that the more useful information is, the 
more subjects seek it out (Canon, 1964; Freed- 
man, 1965; Mills et al., 1959). Although 
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subjects in the present study were not told to | 
expect more than one interview and, from the 
amount of time remaining, it must have been 
clear that additional interviews were unlikely, | 
it is possible that they may still have expected 
to have to evaluate other candidates. Under these | 
circumstances it is possible that the dissonant: 
evaluation (i.e the evaluation that disagreed 
with the subject) was expected to contain more | 
new information than the consonant evaluation, | 
and the former was, consequently, expected to | 
be more useful in improving performance on | 
the future decisions. During the question period 
after the choice, only four of the subjects 
choosing the dissonant communication specifically 
mentioned usefulness, utility, or a similar factor | 
in explaining their choice, but this does indicate 
that at least for some subjects information utility 
was an important consideration. E 
Another factor which may have influenced the 
results was the perceived interest of the informa 
tion. Subjects had just gotten a very one-sided” 
picture of the candidate. An evaluation which | 
disagreed with this picture would presumably | 
present a new outlook, possibly new facts, 
certainly a new opinion. Other things being equal, 


picture presented in the interview and the sub 
ject’s own opinion, If the dissonant evaluation 
were seen as potentially more interesting, sub- 
jects would probably tend to prefer it to the 
consonant evaluation. This possibility is giveni 
some support by subjects’ responses during the 
question period. Subjects often said that they 
were “curious about the other side of the 
picture” or that they “wanted to know whati 
someone else thought about this guy.” Of the 17 
subjects choosing to read the dissonant evaluari 
tion, 8 made statements similar to these. 

Although both usefulness and interest of the 
information probably contributed to the prefer- 
ence for dissonant information, the size of the 
effect (as compared with those generally reported 
in related studies) suggests that additional fati 
tors may be operating. Freedman and Seals 
(1963) have offered a rather complex analysis 
of the present problem which appears to 
consistent with the results reported here. 
analysis involves the subject’s confidence in 
position and the extent to which he perceives 
issue as controversial. In most studies on selet- 
tive exposure, the subject takes a position 0. 
an issue and is then asked to rate articles whi! 
either agree or disagree with that position, T 
means that he is presented with the titles 
articles which supposedly disagree with him. i 
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the subject did not previously know that the 
issue was controversial, this in itself should 
arouse dissonance because it essentially tells the 
. subject that a dissonant article exists. Under 
these circumstances the question is not one of 
E avoiding dissonance, but of reducing dissonance 

“that is already present. 

"At this point confidence may be a major 
determinant of the subject’s strategy. If he is 
` very confident that he is correct, it may well be 

“that the best course of action is to seek out 

the dissonant article and attempt to refute it. 

He is confident that he can accomplish this, and 

hé may prefer to try it rather than not looking 

at the article and being uncertain how strong 

the counterarguments are. If the subject has 
= little confidence, however, it should generally be 
‘Safer for him to avoid the dissonant article and 
‘attempt to reduce the dissonance by internal 
processes. Exposure to the article will probably 
increase dissonance even more since he will be 
unable to refute the arguments. Thus, with issues 
not previously known to be controversial, high- 
confident subjects should seek out dissonant 
information; whereas, low-confident subjects 
should prefer consonant information. 

If, ọn the other hand, the subject already 
knows that the issue is highly controversial, the 
knowledge that a dissonant article exists will 
arouse little or no dissonance because the sub- 
ject already knows that such articles must exist. 
| Under these circumstances, it is a question of 

avoiding dissonance and all subjects should tend 
to prefer consonant articles. 

The present situation fits rather well into the 
_. low-controversial, high-confidence cell of this 
analysis, The interviews were quite one-sided and 
Presented a candidate who was either obviously 
|. Very good or obviously very bad. There was evi- 
‘dently some room for doubt since one subject 
did disagree with the others’ judgment; but all 
other subjects gave very similar ratings to their 
respective candidates. Thus, it is reasonable to 
Suppose that subjects saw the decision as rather 
clear-cut with little room for error or disputa- 
tion. In addition, they were probably quite 
confident of their decision. When they were 
told that a dissonant evaluation existed, dis- 
sonance should have been aroused and they 
sought out the dissonant evaluation in an attempt 
to refute it. 

Over half of the subjects choosing the dis- 
sonant evaluation said, in essence, that they 
Were confident of their own decision and could 
hot believe that the other was correct. As Op- 
Posed to this, the one subject who rated the 
candidate inappropriately and who was probably 
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less confident of her decision than were the other 
subjects, chose the consonant evaluation and said 
that she wanted to see if anyone really agreed 
with her point of view. Thus, this analysis seems 
to fit the results fairly well and to be supported 
by statements made by the subjects. It is, how- 
ever, purely speculative at this stage, and an 
experiment designed to test it explicitly is needed 
before it can be adequately assessed. 

The one point which should be clear from this 
study is that preference between dissonant and 
consonant information is considerably more 
complicated than it has generally been pictured. 
Many factors probably affect it, and at least 
some of them tend to make subjects prefer 
dissonant information. The present study pre 
sents strong evidence that under some circum- 
stances subjects will choose dissonant over con- 
sonant information. They may be seeking out 
the dissonant information only because they hope 
to refute it, but the fact remains that their 
preference is for information that is clearly 
labeled as dissonant. 
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CATEGORY WIDTH AND RESPONSES TO INTERPERSONAL 
DISAGREEMENTS * 


IVAN D. STEINER anb HOMER H. JOHNSON 


University of Illinois 


In-an experimentally created conflict situ 


Pettigrew Category Width Scale tended to underrecall disagreements with 
respected associates, whereas narrow categorizers relied upon conformity as a 
means of reducing dissonance. These findings are consistent with Berkowitz’ 


suggestion that broad categorizers tend 
narrow categorizers do not. 


Research by Gardner (1953) and Pettigrew 
(1958) suggests that there are reliable individual 
differences in the widths of the cognitive cate- 
gories people employ when they judge phe- 
nomena. Some people regard rather diverse events 
as equivalent, whereas other people do not. 

The concept of category width appears to 
have considerable relevance for studies of con- 
formity behavior. People who employ wide cate- 
gories should see little difference between their 
own views and the divergent opinions of their 
respected associates. And because conformity 
tends to be positively related to the perceived 
discrepancy between one’s own views and those 
advocated by others (Harvey, 1962; Sherif & 
Hovland, 1961; Weiss, 1958), people who use 
wide categories should be less inclined to con- 
form than are people who use narrow categories. 
On the other hand, wide categorizers should be 
prone to minimize or underrecall disagreements. 
As Berkowitz (1960) has suggested, they should 
“assimilate” differences. 

The present research employs the Pettigrew 
(1958) Category Width Scale as a tool for meas- 
uring category width. Rosen (1961) found that 
male subjects who were identified by this scale 
as narrow categorizers were significantly more 
prone to employ a dissonance-reducing response 
(seek information which supported their deci- 
sions) than were wide categorizers. And in a pre- 
liminary investigation the present authors ob- 
tained a significant negative correlation between 
the Pettigrew scale and Fiedler’s ASo Scale. The 
latter findings suggests that wide categorizers are 
especially prone to minimize differences when 
making judgments about people. 


PROCEDURE 
Subjects 


The experimental subjects were 64 male under- 
graduate students enrolled in the introductory psy- 

1 This research was supported by Grant M-4460 
from the National Institutes of Health, United States 
Public Health Service. 
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ation Ss who had scored high on the 


to “assimilate” differences whereas 


chology course at the University of Illinois, Twenty- 
three additional male students from this course 
served as control subjects. All subjects received re- 
search credit for their participation. 
Preliminary Sessions f 

The experimental subjects reported to a classroom 
in groups of 10-20 students. At these sessions they 
responded to the Pettigrew scale. Only the 10 items 
which Pettigrew had found to correlate most highly 
with laboratory tests of category width were used 
in the present study. Scores based on these 10 items 
ranged from 5 to 49 with a mean of 34.1. 


Experimental Treatments 


At a designated time during the 2 weeks which 
followed the administration of the Pettigrew scale) 
each subject reported to the laboratory where he 
met two other students who were serving as accom- 
plices of the experimenter. The same two male stl- 
dents served as accomplices for all experimental ses- 
sions, Subjects were randomly designated to receive 
one of two experimental treatments, Thirty-two sub- 
jects received each treatment. 

Treatment 1. In response to oral questions by the 
experimenter one accomplice reported that he was k l 
premedical student, a junior, and had an academic 
average higher than B. The other accomplice reported 
that he was a physics major, a junior, and also had 
an average higher than B. A previous study (Steiner 
& Johnson, 1963) has indicated that subjects who 
receive this bogus information are almost invariably 
convinced that their associates are highly competent } 
and resourceful persons. i 

After the naive subject had reported his major, 
class at the University, and academic average, the 
experimenter announced that he would administer & 
test which would be answered orally. Participants 
were urged to pay attention to one another’s an- 
swers so that they might improve the “accuracy 0 
their interpersonal perceptions.” The test to whit 
they were subjected consisted of 32 multiple-choice 
questions which called for judgments about politi 
and social events, the characteristics of animals, the 
sizes and shapes of geometrical figures, and numbet 
series which had been included in IQ tests. Each 
question was printed, along with three possible an- 
Swers, on a card approximately 9 X 11 inches in siz 


p 


E. BRIEF ARTICLES 


= Bach card was presented for a period of 30 seconds 
and participants were requested to announce their 
, answers orally. The experimenter suggested that it 
was much easier for him to record the answers if 
` participants always adhered to the same order when 
announcing their replies. He then instructed the “pre- 
s medical student” to give his answers first and asked 
the “physics student” to announce his answers second. 
. ‘The accomplices had been coached to give a pre- 
determined set of answers. In Treatment 1 the two 
* accomplices gave identical answers to 28 questions, 
419 of these being the replies which had most often 
been given by 23 control-group subjects who had 
responded privately to the questions without any 
knowledge of other people’s replies. Nine of the an- 
swers given by the accomplices were those which 
had least frequently been given by members of the 
control group. To the 4 remaining questions each 
@ accomplice gave two “most popular” answers and 
two which had been second most popular among 

‘members of the control group. Consequently, the 
+ two accomplices disagreed with one another a total 
of 4 times, but each gave 21 answers which were 
Most popular, 9 which were least popular, and 2 
which were intermediate in popularity among mem- 
hers of the control group. 

After all 32 cards had been exposed the subject 
and the accomplices were asked to provide written 
estimates of the number of times they had disagreed 
with one another. 

Treabment 2. Except for the answers which the ac- 
Complices gave to the 32 questions, this treatment 
Was exactly like Treatment 1. Each accomplice again 
gave 21 popular answers, 9 least popular answers, 
and 2 answers which were intermediate in popu- 
larity, But the questions to which the “physics stu- 
dent” gave least popular answers were always dif- 
ferent from those to which he had given least popu- 
lar answers in Treatment 1, while the “premedical 
Student” gave exactly the same answers he had given 
het Treatment 1. Consequently the two accomplices 
disagreed with one another 22 times instead of 4, 

ut each gave the same number of most popular an- 
. SWers he had given in Treatment 1. 

Response measures. A conformity score was com- 
o for each subject by counting the number of 
Hy he had agreed with the two accomplices. Agree- 
Se with one accomplice were added to agreements 
A the other accomplice. For Treatment 1 the 
x ee was 45.4 and the standard deviation was 
aa or Treatment 2 the mean was 34.6 and the 

ndard deviation was 2.0. 
ated estimates of the number of disagree- 
pe pee each accomplice were subtracted from 
TO ctual number of disagreements to yield an index 

Underrecall of disagreements with the accomplices. 

3 tia score indicates the degree to which the subject 

ees his disagreements with the two accom- 
hae The means and standard deviations were 5.9 
ihe 1 for subjects who received Treatment 1, and 

© and 75 for subjects who received Treatment 2. 
Pali Didi indicating the subject’s tendency to under- 
ities agreements between his associates was ob- 

by subtracting his estimate of the number of 
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TABLE 1 


CORRELATIONS BETWEEN CATEGORY-WipTH SCORES 
AND RESPONSES IN EXPERIMENTAL SESSIONS 


RAREN TE I Tatort i 

Conformity to ac- 

complices —.78* —.02 
Underrecall of disagree- 

ments with accomplices —.10 „51+ 
Underrecall of disagree- 

ments between ac- 

complices —.10 .28 

*p<.01. 


times such disagreements had occurred from the 
actual number of such disagreements, For Treat- 
ment 1 the mean score was O and the standard 
deviation was 5.0. For Treatment 2 the comparable 
statistics were 12.0 and 5.1. 


RESULTS 


Table 1 reports relationships between category 
width and scores on conformity and underrecall. 

Subjects who received Treatment 1 were free 
to conform, and the data indicate that narrow 
categorizers did so to a significantly greater de- 
gree than did wide categorizers. In Treatment 2 
the accomplices disagreed with one another so 
many times that it was virtually impossible for a 
subject to earn a high conformity score or for 
one subject to conform much more often than 
another. (The standard deviation of the scores 
was only 2.0.) Under these very restrictive cir- 
cumstances there was no relationship between 
category width and conformity. 

Among subjects who received Treatment 2, 
underrecall of disagreements with the accom- 
plices was significantly greater for wide cate- 
gorizers than it was for narrow categorizers. How- 
ever, in the case of subjects who received Treat- 
ment 1 the anticipated positive relationship was 
not obtained. 

Among subjects who received Treatment 2, 
wide categorizers were more inclined than were 
narrow categorizers to underrecall disagreements 
between accomplices (p < .12). In Treatment 1 
the accomplices actually disagreed with one an- 
other only four times and the subject’s freedom 
to underrecall was seriously restricted. Under 
these conditions a very low negative relationship 
was obtained. - 


DISCUSSION 


The experimental conditions seriously limited 
the freedom of subjects to conform in Treat- 
ment 2 and to underrecall disagreements among 
accomplices in Treatment 1. Consequently it 
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seems reasonable to suggest that only four of the 
six correlations presented in Table 1 should be 
regarded as evidence bearing on the effects of 
category width. Three of these four correlations 
are in the anticipated direction and two of them 
are significant at the .01 level. 

The fact that wide categorizers did not show 
a special proneness to underrecall disagreements 
with the accomplices in Treatment 1 seems to 
contradict the thesis of this paper. However, it 
should be noted that in Treatment 1 neither the 
wide nor the narrow categorizers showed much 
tendency to underrecall disagreements with the 
accomplices, The mean underrecall score was only 
about half as large as it was for subjects who re- 
ceived Treatment 2. In Treatment 1 the predomi- 
nant response was conformity; in Treatment 2 it 
was underrecall. And in both treatments category 
width was related to the predominant response in 
the manner demanded by our thesis. 

Underrecall of disagreements with the accom- 
plices was significantly related to category width 
in Treatment 2, but underrecall of disagreements 
between accomplices was not (p < .12). The fail- 
ure of the latter correlation to reach an accept- 
able level of significance may indicate that dis- 
agreements between associates are less anxiety 
provoking than are disagreements between one’s 
self and one’s associates. If such is the case even 
narrow categorizers may tend to ignore or “as- 
similate” differences between associates while 
concentrating on the more important differences 
between themselves and their associates. 

However, an alternative explanation seems 
more plausible. The range and standard devia- 
tion of the scores for underrecall of disagree- 
ments with accomplices are larger than they are 
for underrecall of disagreements between accom- 
plices. Inspection of the scatterplots indicates 
that a few extreme cases contributed markedly 
to the correlation between category width and 
underrecall of disagreements with accomplices, 
whereas this was not true for the other correla- 
tion, In order to minimize the effect of extreme 
cases Kendall’s tau was employed in a reap- 
praisal of the correlations. The obtained tau 
values were .29 and .28, both significant at the 
.05 level. In view of these findings it seems prob- 
able that category width had very similar effects 
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on the two types of underrecall examined 
study, K 

The theories of Berkowitz (1960) and 
(1961) suggest that there may be condi 
der which narrow categorizers will ext 
“contrast effects.” Thus if a narrow catego 
views are frequently and emphatically 
dicted by a person of questionable com} 
he may tend to overrecall disagreements 
is no evidence in the present study that 
action occurred, but the high prestige of 
complices and the relatively mild disagre 
they created could scarcely be expected to 
courage this response. If our accomplice 
been introduced as less prestigeful person 
if they had expressed more extreme vi 
would have expected them to be rejected b 
row catgorizers. Under such conditions over 
of disagreements might have occurred. 
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AUTHORITARIANISM, DOGMATISM, AND USAGE OF 
CONCEPTUAL CATEGORIES * 


B. JACK WHITE, RICHARD D. ALTER, AnD MAX RARDIN 
University of Utah 


Ss who were high on both the Dogmatism Scale and the F Scale (high D-F Ss) 
and Ss who were low on both scales (low D-F Ss) were compared with regard 
to their usage of conceptual categories in the classification of stimulus materials 
which were either high or low in relevance to the personality syndromes— 
undesirable social acts and occupational names, respectively. According to 
expectations, high D-F Ss used fewer,and broader categories in their judgments 
of the high-relevance materials than low D-F Ss, but the groups did not differ 
in their judgments of the low-relevance materials, Thus, the ascribed tendency 
of authoritarians and dogmatics to render black-white judgments was exhib- 
ited only in the judgment of syndrome-relevant stimuli. 


© In the absence of exertion of pressures to 

“change by authority figures, subjects who are 
high on the California F Scale are more resistant 
to attitude change than their low counterparts 
(Harvey & Beverly, 1961). Differential resist- 
ance to change has also been found in studies of 
judgment of lifted weight in the presence of a 
remote anchor (White & Alter, 1965), judgment 
of distance (Harvey & Caldwell, 1959), and clas- 
sification of progressively changing hues, num- 
bers, and animal figures (Frenkel-Brunswik, 
1949), The fact that resistance to change is 
characteristic of the authoritarian, even in the 
judgment of stimuli which have no apparent mo- 
tivational relevance and no especial relationship 
to the authoritarian syndrome, indicates that this 
particular characteristic is one which has wide 
generality. 

There is also reason to expect that personality 
syndromes such as authoritarianism and dog- 
matism would produce tendencies to use fewer 

and broader categories in judgment (Frenkel- 

_ Brunswik, 1949); however, the generality of 

these tendencies is open to question. For example, 

in the study of judgment of lifted weights men- 

. tioned above, White and Alter found no differ- 
ence in coarseness-fineness of categories used or 
in absolute number of categories used by subjects 
who were high or low on both the F Scale and 
Dogmatism (D) Scale. While any of a number of 

. Procedural and other variables may account for 

e failure to find differences, one should not 
Overlook the possibility that it may be unrealistic 

. to expect authoritarianism and dogmatism to pro- 

duce characteristic modes of category usage for 

any but syndrome-relevant stimuli. 


i 1 Financial assistance for the study was provided 
S the University of Utah Faculty Research Fund in 
tant 153-510706 to the first author. 


At least one other study indicates such a lack 
of generality. Rokeach (1951) found that “nar- 
row-minded” subjects gave more concrete defini- 
tions of religious concepts and organized the con- 
cepts less comprehensively than “open-minded” 
subjects, but the same subjects did not differ in 
their conceptualization of political-economic con- 
cepts, which, according to his measures, were less 
“salient” and less “central.” If the implications 
of Rokeach’s study are correct, subjects who 
differ in authoritarianism and dogmatism should 
differ in the manner in which they classify stimuli 
which are highly relevant to the personality syn- 
dromes, but they should not differ in their classi- 
fication of stimuli which have low relevance. 

In order to assess the foregoing proposition, 
the stimulus materials for the present study were 
chosen to represent two degrees of relevance to 
authoritarianism and dogmatism. Since one of 
the characteristics of authoritarianism and dog- 
matism is intolerance of behavior which deviates 
from major social norms, statements describing 
undesirable social acts were used as stimuli 
with high syndrome relevance. Occupations which 
varied in prestige were selected as having lower 
relevance. On the basis of Rokeach’s (1951) 
study, it was anticipated that subjects who were 
high on both the D and F scales would manifest 
greater intolerance of the undesirable social acts 
by classifying a larger proportion of them in the 
most undesirable categories and by using fewer 
categories than subjects who were low on both 
scales. The two groups were not expected to 
differ in their sortings of the occupations. 


METHOD 
Subjects 


The D Scale (Rokeach, 1960) and the F Scale 
(Adorno, Frenkel-Brunswik, Levinson, & Sanford, 
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Fic. 1, Percentage of acts and occupations placed in each category by high and low D-F subjects. 


1950) were administered to 410 introductory psy- 
chology students. Of the 24 subjects selected, 12 
scored in the upper 15% of both the D and F dis- 
tributions (high D-F subjects) and 12 scored in the 
lower 15% (low D-F subjects). The high D-F sub- 
jects’ range of scores on the D Scale was 170-202 
(M = 185) and on the F Scale was 129-157 (M= 
137). Comparable ranges for the low D-F subjects 
were 86-123 (M = 105) and 48-71 (M = 60). Stu- 
dents who were asked to volunteer were informed 
that they represented a random sample from the class 
roll, and that they would be given extra grade points 
for their participation. The subjects who served in 
the present study also served in the study of judg- 
ment of lifted weight by White and Alter (1965). 


Stimuli 


The 149 occupations and 149 undesirable social 
acts were chosen from the lists of those used by Mc- 
Garvey (1943), The prestige of occupations ranged 
from “street cleaner” to “university president.” The 
undesirable acts ranged from “cheating at solitaire” 
and “fishing without a license” to such acts as “put- 
ting your deformed child in the circus” and “having 
incestuous relations with your parent.” Each act or 
occupation appeared on a separate slip of paper in 
a shuffled packet which was given to each subject 
to sort. 


Procedure 


Each subject was seated at the side of a 3 X 6 foot 
table which was not in view of other subjects. In 
sorting the social acts, they were instructed first io 
look through the packet in order to become familiar 
with the items, then to sort the items on the basis of 
undesirability, letting the pile of slips at the far left 
on the table represent the most undesirable acts. 
They were told to “use as many or as few piles as 
you think are necessary for the sorting.” Instruc- 
tions for sorting the occupations were the same eX- 
cept that they were to be sorted on the basis of 
Prestige, the most prestigeful occupations belonging 
in the pile on the subject’s far left. Between sort- 
ings, the subjects were engaged in judging lifted 
weights for a 20-minute interval. The order of sort- 
ing the acts and occupations was counterbalanced. 


RESULTS 


For the purpose of analysis of category usages 
each pile of slips was considered to represent # 
category, the pile on the subject’s far left being 
Category 1, the next pile, 2, etc. The percentage 
of items placed in each category by the tw? 
groups is portrayed in Figure 1. 


aS 
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, Comparison of Distribution of Items 


. It was expected that low D-F subjects would 
_ classify fewer of the acts in the most undesir- 
able categories than high D-F subjects, that is, 
that low D-F subjects would produce a more 
-yectilinear distribution of items across the vari- 
ous categories. As the first step toward the com- 
parison of rectilinearity of judgments in the two 
groups, each subject’s distribution of judgments 
was converted to an H value, an index which re- 
flects rectilinearity (Attneave, 1959). Compari- 
son of the H values for acts indicated that the 
low D-F subjects produced a more rectilinear 
distribution (¢= 2.42, df= 22, p<.025, one- 
tailed) than high D-F subjects. Inspection of 
| Figure 1 indicates, as expected, that high D-F 
subjects placed a greater proportion of the acts 
in the more undesirable categories. Comparison 
“of the H values for occupations indicated there 
was no significant difference between groups (t 
= 22, df = 22, p > .70). 
. In view of the results just reported, it is not 
surprising to find that high D-F subjects pro- 
duced a more rectilinear distribution in their sort- 
ing of the occupations than in their sorting of the 
acts (t= 3.58, df = 11, p < .01), that low D-F 
subjects did not differ in the rectilinearity of 
their sortings of the acts and occupations (t= 
65, df=11, p> .50), and that high D-F sub- 
jects evidenced a greater difference between H 
values for their sortings of the acts and occupa- 
tions than low D-F subjects (t= 3.58, df = 22, 
$ < 01). 


f. Comparisons of Number of Categories Used 


The low D-F subjects used more categories for 
acts than high D-F subjects (Mann-Whitney U 
| =37.5, p < 05, one-tailed) but there was no dif- 
ference in the number of categories used for oc- 
cupations (U = 61.5, p > .10). Another view of 
the results can be obtained by a look at the num- 
ber of subjects who used more categories for 
acts than occupations, the number who used 
equal numbers of categories for both sets of 
stimuli, and the number who used more cate- 
Fe for occupations than acts. The respective 
_ figures in the high D-F group were 0, 4, and 8, 

and for the low D-F group the figures were Hah 

ae 2. It is especially noteworthy that five of 
_ the low D-F subjects made finer discriminations 
m judgments of the more affective stimuli than 
in judgments of the less affective stimuli, whereas 
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none of the high D-F subjects performed in this 
manner. 


Discussion 


The results clearly indicate that high and low 
D-F subjects differed only in their classification 
of syndrome-relevant stimuli. Further, the fact 
that high D-F subjects used fewer and broader 
categories in their judgment of the stimuli with 
high syndrome relevance than in the judgment 
of stimuli with low relevance indicates that the 
tendency of authoritarianian and dogmatic indi- 
viduals to render black-white judgments does not 
pervade their usage of conceptual categories in 
all judgmental tasks. As Sherif and Hovland 
(1961) pointed out, a 


requirement for the study of systematic variations in 
judgment produced by attitude is that the stimulus 
material and procedures be effective in arousing the 
attitude under study [p. 951. 


Their statement seems equally applicable to the 
study of the effects of personality factors on 


judgment. 
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A COMPARISON OF INDIVIDUAL AND GROUP LEARNING OF 


NONSENSE SYLLABLES ! 


BERNARD L. RYACK 


United States Naval Submarine Medical Center, Groton, Connecticut 


Individuals and 2-man groups learned lists of either 6, 12, or 18 nonsense 
syllables. Data from individuals are analyzed in 3 ways: (a) average individual 
performance was compared directly to 2-man group performance; (b) per- 
formance of the best member of artificially created 2-man groups was com- 
pared to true group performance; (c) pooled learning of the 2 members of 
the artificial groups was compared to the learning of the true groups. 2-man 
group learning was significantly superior to average individual learning and to 
the learning of the best member of artificial groups, but did not differ sig- 
nificantly from the pooled learning of the members of the artificial groups. 
Regardless of the method of analysis, errors increased with increases in the 
length of the list. The results are consistent with a 


of group performance. 


Shaw (1932) compared the solutions of indi- 
viduals on eureka-type problems to the solutions 
obtained from the same individuals working as 
groups of four. Individuals were correct in 7% 
of the solutions attempted and groups in 40%. 
Marquart (1955) subsequently noted that by 
comparing one individual to a group Shaw failed 
to allow for the fact that group solution to a 
problem might have been the result of solution 
by one individual. The probability of solution by 
one of four individuals working as a group is 
greater than the probability of solution by one 
individual working alone. Taking this into con- 
sideration in analyzing her own data for four of 
Shaw’s problems and for four similar problems, 
Marquart found that groups were not superior 
to individuals. A similar point has been made by 
Taylor (1954) and by Lorge and Solomon (1955) 
whose mathematical model of group problem 
solving is based upon the assumption that group 
solution to a problem may be the result of the 
performance of any of its members rather than 
of the group as a whole, Lorge and Solomon ap- 
plied the model to Shaw’s data to test the hy- 
pothesis that the superiority of groups in Shaw’s 


1 The research was conducted while the author was 
a Postdoctoral Research Associate of the National 
Academy of Sciences-National Research Council at 
the Aerospace Medical Laboratory, Wright-Patterson 
Air Force Base, Ohio. It was sponsored by Julian O, 
Morrisette, whose helpful advice has been appreci- 
ated. Portions of this Paper were presented at the 
1959 meeting of the Ohio State Psychological Asso- 
ciation. 

The final manuscript was prepared under Bureau 
of Medicine and Surgery, Navy Department, Project 
MRO005.14-1100. George Moeller was the source of 
many valuable comments and suggestions concerning 
the manuscript. 
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pooling of abilities model 


study resulted from the pooling of abilities of 
members rather than from other types of inter- 
action, There was close correspondence between, 
the number of obtained and predicted group 
solutions. 

Perlmutter and DeMontmollin (1952) com- 
pared the performance of groups and individuals 
on equivalent lists of nonsense syllables. One list 
was learned while each person worked individu- 
ally in the presence of two others; the other list 
was learned while the three persons worked to- 
gether as a group. The average group recalled 
more words correctly than the average individual, 
and group scores tended to be better than or 
equal to the best individual score. Hoppe (1962) 
applied the pooling of abilities model proposed 
by Lorge and Solomon (1955) to both a portion 
of Perlmutter and DeMontmollin’s data and a 
replication of the latter study. The hypothesis 
tested was that groups remembered more words 
than individuals as the result of the pooling of 
the words remembered by each individual. In 
both cases, obtained scores corresponded. closely 
to group scores which were predicted from the 
Scores of individuals. 

In Hoppe’s (1962) application of the Taylor 
and Lorge-Solomon model, the proportion of the 
entire list recalled by a group is estimated from 
the proportion of the entire list recalled by indi- 
viduals. Taylor (1954) suggests an alternative 
approach to evaluation of the model. Subsequent 
to the experiment proper, data from individuals 
are combined randomly to form “nominal” (arti- 
ficial) groups of the same size as the true groups- 
The performance of the artificial groups is then 
scored as though the members “had actually 
worked together. 

Considering the above studies together, it be- 


| 
: 


BRIEF ARTICLES 


= 


comes evident that the conclusions one arrives at 
regarding the relative superiority of individuals 
and groups depends upon the assumptions made 
regarding the nature of the interaction process 
and the resulting technique of data analysis. The 
present study was designed to investigate the ef- 
‘fects of three assumptions regarding the nature 
of the interaction process. These were that group 
performance is a function of: best individual 
j performance; average individual performance; 
‘and a pooling of individual abilities. In the latter 
“case Taylors (1954) artificial group approac’ 
C was employed. v 
‘Perlmutter and DeMontmollin (1952) state a: 
one of their objectives the investigation of group 
and individual learning of nonsense syllables. 
_ Subjects were run for five trials on a list of 19 
_2-syllable nonsense words. No information is 
given regarding either the rate of presentation 
‘of the words or their characteristics. Hoppe 
(1962) controlled rate of presentation, but did 
not report the association value of his nonsense 
‘material. Eight 5-letter nonsense words, con- 
structed by combining 3-letter nonsense syllables 
from Hilgard’s (1951) lists, were presented to the 
ie for nine trials at a 5-second repetition 
rate. 

The objectives of the present study were to 
evaluate the relative performance of individuals 
and two-man groups when different assumptions 
were made regarding the group interaction proc- 
E and to assess the relative leaming of indi- 
viduals and two-man groups using several dif- 
ferent lists of classical 3-letter nonsense syllables 

of known association value. 


; METHOD 
Subjects 


The subjects were 120 male undergraduates at the 
University of Dayton. They were randomly assigned 
to group or individual conditions of learning. 


MEANS AND STANDARD DEVIATIONS FO 
UNDER EACH CONDITION OF ANALYSIS 
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Design 

Two conditions of learning of nonsense syllables, 
two-man group or individual, were paired with each 
of three lists of nonsense syllables resulting in three 
group and three individual experimental conditions. 
There were 10 replications for each two-man group 
condition and 20 for each individual condition. 


Materials and Apparatus 


Stimulus material consisted of lists of 6, 12, and 
18 nonsense syllables of 0% association value which 
were prepared from the Glaze lists (Hilgard, 1951). 
A given syllable appeared in only one list. Two 
Dunning Animatic film-strip projectors, synchronized 
by Hunter timers, were used to present the stimulus 
material on two rear-projection screens. 


Procedure 


Using the method of complete recall, individuals 
and two-man groups learned one of the lists of syl- 
lables to a criterion of two errorless trials. Syllables 
were presented in a fixed order at 2-second intervals. 
The beginning of a trial was indicated by projecting 
an “X” for 4 seconds on the screens. All subjects 
were instructed to repeat each “word” to themselves 
as it appeared and were informed that order of re- 
call was not important. At the end of each trial 
there was a 5-minute recall period during which sub- 
jects recorded all of the words that they could re- 
member. 

Subjects assigned to the individual condition 
learned the words by themselves. Members of two- 
man groups simultaneously viewed the words on 
separate screens. They were instructed to work co- 
operatively in learning the words and to prepare a 
single list of the words they recalled. Communication 
was restricted to the recall period. 

Prior to the experimental session, individuals and 
two-man groups received two practice trials with a 
nine-word list. Subjects entered the experimental ses- 
sion without any knowledge of the length of list 
they would be required to learn. 


RESULTS 


The learning of groups and individuals was 
measured by the mean number of errors to cri- 


TABLE 1 
r Lists oF 6, 12, AND 18 SYLLABLES 


6.20 


Length of list 
Condition 6 12 18 
M SD M SD M SD 

True 6.60 4.09 18.40 6.54 55.60 10.31 
Individual s 13.85 8.92 37.55 18.93 98.00 31.02 

est individual 
oe groups 7.90 3.32 26.50 12.66 69.20 17.37 

“atlical en 2.89 18.90 10.79 45.50 10.77 


artificial groups 
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terion. An error was defined as the inclusion of 
an incorrect response or the omission of a cor- 
rect response, The data were analyzed in three 
ways. The means and standard deviations for 
each analysis are summarized in Table 1. Differ- 
ences between the means for each analysis were 
tested with a 2 X 3 factorial analysis of vari- 
ance.? The two main effects were group or indi- 
vidual condition of learning and length of list of 
nonsense syllables. 

In the first analysis average individual per- 
formance (mean number of errors to criterion) 
was compared directly to average group perform- 
ance. Group performance was significantly su- 
perior to individual performance over all lengths 
of list, F = 35.23, df = 1/84, p <.001. The dif- 
ference between lengths of lists was also highly 
significant, F=129.2, df =2/84, p< .001. No 
significant interaction was obtained. 

As Marquart (1955), Taylor (1954), and 
Lorge and Solomon (1955) have pointed out, 
this type of analysis fails to take into considera- 
tion the fact that the achievement of the cri- 
terion by groups may have been the result of 
learning by only one member. Furthermore in 
tasks such as the learning of nonsense syllables, 
it is important to take into consideration per- 
formance on specific trials. 

Because of these considerations, the second 
analysis was carried out as follows. For each 
length of list, individuals were randomly as- 
signed to artificial two-man groups, Under the 
assumption that the learning of a true two-man 
group reflects only the performance of the best 
member on each trial, the measure of learning of 
an artificial two-man group was taken as the per- 
formance of the best individual on each trial, 
The learning of true two-man groups was com- 
pared with best individual performance in the 
artificial two-man groups. When these data were 
subjected to analysis of variance, performance of 
true groups was still significantly superior to that 
of the artificial groups, F = 6.63, df=1/54, p 
<.05. The difference between the means of 
the true groups and the artificial groups was, 
however, less than that obtained with the group- 
individual comparison (Table 1). Differences in 
performance with the 6-, 12-, and 18-word lists 
temained highly significant, F = 140.26, df = 2/ 
54, p<.001. The interaction of condition of 
learning and length of list remained nonsignifi- 
cant. 

The third technique of analyzing the data was 
derived by combining Taylor’s (1954) artificial 


2 The transformation (Y-2)- was applied to the 
data. The author is indebted to H, Leon Harter for 
development of this transformation. 
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group technique with Lorge and Solomon’s (1955) 
extension of the mathematical model of group 
problem solving to multistage problems. The ex- 
tension takes into consideration the fact that 
certain problems may be composed of stages and 
that the solution of such problems can be accom- 
plished by a pooling of abilities if at least one“ 
member can solve a stage. Translating these aš- 
sumptions to meet the conditions of nonsense- 
syllable learning, the learning of each word 
within a trial can be considered to be a stage in 
achieving criterion. Within each trial, therefore, 
ascore for artificial groups was obtained by pool- 
ing the responses of the two members. A corrett 
response was scored for a given syllable if it was 
recorded correctly by either member, An error 
was scored if either member recorded more words 
than were contained in the list. Since a given 
word can be correct only once, this technique of | 
analyzing the data eliminates the first trial serial 
position effects reported by Hoppe (1962). No 
significant difference was obtained between the 
performance of true two-man groups and arti- 
ficial two-man groups. The F for length of list 
was significant, F= 125.34, df=1/54, p< 
001. The interaction was not significant. Com- 
parison of first trial serial position effects for 
artificial groups and true groups revealed no sig- 
nificant differences. 


Discussion 


When the mean number of errors made by 
groups is compared directly to average indivdiual 
performance, group learning of nonsense syl- 
lables is significantly superior to learning by in- 
dividuals. If the comparison is made in terms of 
the performance of the best member of arti- 
ficially constituted groups, the performance of 
the true groups remains significantly superior, 
but the difference between the absolute values 
of the means is reduced. When, however, the as- 
sumptions of the Taylor and Lorge-Solomon 
(Lorge & Solomon, 1955; Taylor, 1954) model 
are applied to the artificial groups, no significant 
difference is obtained between the learning of the 
artificial groups and the learning of the true 
groups. 

The reduction of the magnitude of the differ- 
ence between the means of groups and of indi- 
viduals when the data from individuals are 
treated in different ways is in agreement with 
Marquart’s (1955) contention that differences 
between individual and group performance may 
be reduced or eliminated by changing the tech- 
nique of analysis of the data, and indicates the 
importance of the assumptions relating to the 
interaction process. Comparison of average indi- 


< vidual scores to group scores assumes that each 
| individual contributes to the group process in 
either a positive or negative way. The final meas- 
. uré of group performance gives equal weight to 
the performance of all members. A score for 
artificial groups based on the performance of the 
best individual assumes that the poorer of the 
| two individuals makes no contribution, either 
positive or negative, to the group product. The 
Taylor and Lorge-Solomon model is based upon 
the assumption that group performance is the 
result of a pooling of abilities by the group mem- 
bers. The derived measure assumes that while it 
is possible for one individual to be the sole con- 
tributor to group performance, the group product 
may represent the pooling (in some unspecified 
way) of the contributions of all members. 
_ In the present study the nonsignificant differ- 
pence between the errors of the true groups and 
the pooled error measure of artificial group per- 
formance, generated from the Taylor and Lorge- 

Solomon model, provides support for a pooling 
of abilities approach to group interaction. The 
random pairing of individuals to form artificial 
two-man groups represents 1 of 190 possible pair- 
ings. Although additional pairings were generated 
Which yielded similar results, it was not possible 
to test all of the combinations. Conceivably some 
of these alternative pairings might have resulted 
in a different interpretation of the data. Further- 
more, the Taylor and Lorge-Solomon model pro- 
vides only a mathematical framework for the 
Prediction of group performance from individual 
performance. It provides no information on the 
He form of the pooling process. In preparing 
i a artificial group lists, the present treatment of 
b e data assumes a perfect pooling of responses 
y individuals on each trial and perhaps under- 
estimates errors, 

Some additional support for the pooling-of- 
[ities model of group performance is available 
ane a postexperimental questionnaire adminis- 
ered to the subjects of the present study. Group 
Ania were asked to describe the techniques 

at they used to learn the words. Of the 30 true 
eee 9 reported dividing the task between 
are in some way and then pooling the results 

the major techniques employed here were to 
ae the list into a first and last half or to take 
Soe words); 15 reported that each member 
ee R learn all of the words by himself (in this 
= at e recall period was used for checking and 
bi ining lists); and 6 groups used some com- 
nation of dividing the task and working alone. 
aaa onsidering the data obtained for 3-letter non- 
pa syllables in relation to the data obtained by 
ge and DeMontmollin (1952) with 2- 
le nonsense words and by Hoppe (1962) 
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for 5-letter nonsense words, similar relationships 
are found for group and individual performance. 
In all instances, when average group perform- 
ance is compared directly to average individual 
performance, significant superiority of group per- 
formance is found. When, however, the individual 
data is treated within the framework of the Tay- 
lor and Lorge-Solomon pooling-of-abilities model 
(Hoppe, 1962), differences between group and 
individual performance disappear. The fact that 
this relationship is maintained regardless of dif- 
ferences in the learning task would appear to in- 
dicate that the effects obtained are a function of 
the nature of the group interaction rather than 
of the type of stimulus material. Further support 
for such a hypothesis is to be found in the simi- 
lar results reported by Lorge and Solomon (1955) 
in their reanalysis of Shaw’s (1932) data for 
eureka-type problems. The generality of this re- 
lationship for other types of tasks needs to be 
established. 

Differences between means for the 6-, 12-, and 
18-word lists were all significant ($ < 01) when 
the Duncan multiple range test (Duncan, 1955) 
was applied. Although the mean errors increased 
progressively with increasing length of list, the 
relative performance of groups and individuals 
remained unchanged. Within the constraints of 
the present investigation length of list of non- 
sense syllables would not appear to be an im- 
portant parameter effecting the relative efficiency 
of group and individual learning. 
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The findings of the APA’s Project on Scientific Information Exchange in Psychology call 
attention to the difficulties of the scientist in establishing and maintaining contact with current 
work relevant to his interests. Accordingly, the project is sponsoring a series of changes during 
the next 12-15 months which are designed to enhance scientific communication. These changes 
are planned as limited trials. Moreover, they are wholly supported by the Project and hence 
do not cut into our page allotment. 


d 
The Editor has agreed to make possible a test of one of the changes, that is, the titles and 
authors of accepted papers will be listed following the last articles in the journal. Such listing 
will make it possible for our readers to become aware of research many months in the advance | 
of its journal publication. At the end of the year’s trial the results of the trial will be evaluated. 
and further consideration will be given at that time as to whether the service is worth continuing. 


This listing plus those published in the preceding 1965 issues are the entire backlog of manu- 
scripts accepted by this journal through February 1965, 


1965 (February) Manuscripts Accepted for Publication in the 
Journal of Personality and Social Psychology 


Some Factors Influencing the Assimilation of Disparate Information in Impression Formation: Paul S. Rosen- 
bas and Walter H. Crockett*: Psychology Department, Clark University, Worcester, Massachusetts 

Sex and Birth-Order Differences in Conformity as a Function of Need Affiliation Arousal; William C. Carrigan and 
pme W. Julian*: Department of Psychology, State University of New York at Buffalo, Buffalo, New York 

Interaction of Anxiety and Ability in Complex Learning Situations: Martin Katahn*: Department of Psychology 
Vanderbilt University, Nashville, Tennessee 37203. 

Effect of Certainty about a Decision upon Postdecision Exposure to Consonant and Dissonant Information: 
Judson Mills*: Social Psychology Laboratory, University of Missouri, Columbia, Missouri 65202. 

The Generality of Social Schemas: Rae Carlson* and Mary Ann Price: California State College, 800 North State 
College Boulevard, Fullerton, California. 

Effects of Three Social Responses on Vascular Processes : Jack E. Hokanson* and Robert Edelman: Department of 
Psychology, Florida State University, Tallahassee, Florida. 

Differential Communication of Affect by Head and Body Cues: Paul Ekman*: Nonverbal Behavior Research 

„Project, San Francisco State College, 1405 Fourth Avenue, San Francisco, California. 

Attitude Manipulation in Restricted Environments: II. Conceptual Structure and the Internalization of Prop’ 
ganda Received as a Reward for Compliance: Peter Suedfeld* and Jack Vernon: Department of Psychology 
University of Illinois, Urbana, Illinois 61803. 

Effects of Prisoner’s Dilemma Format on Cooperative Behavior: Gary W. Evans* and Charles M. Crumbaugh! 
Parsons Research Project, Parsons State Hospital, Parsons, Kansas. 


* Asterisk indicates author for whom address is supplied. 
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< An important area of inquiry into the 
_. psychology of thinking has been much neg- 
^ lected of late, namely, the study of subjective 
logic. Research in this area some 15 or 20 
years ago reached the limited conclusion that 
-distortions in syllogistic reasoning commonly 
occur because of emotional and other factors 
(Lefford, 1946; Thistlethwaite, 1950). Not- 
withstanding the current interest in “naive 
psychology” (Heider, 1958), in “psychologic” 
(Abelson & Rosenberg, 1958; Osgood, 1960; 
Piaget, 1957, p. 25), and in personal calculi 
distinct from normative or objective logic 
(Colby & Gilbert, 1964; Shneidman & Far- 
berow, 1959), there has been a paucity of 
good experimental research aimed at the 
study of subjective logic in its own right. The 
_ few novel studies that have been done seem 

not to have aroused a great deal of interest 
(De Soto & Kuethe, 1959; De Soto, London, 
& Handel, 1965; McGuire, 1960). 

Our approach to this area flows indirectly 
from an interest in cognitive resistance to 
Persuasive communications, Modes of resist- 
ance to persuasion are referred to in the 
Psychological literature with various terms: 
‘defensive avoidance” (Hovland, Janis, & 
Kelley, 1953, pp. 88-89; Janis & Terwilliger, 
1962), “denial” (Abelson, 1959), “misunder- 
Standing” (Kendall & Wolf, 1949), “in- 

, Credulity” (Osgood & Tannenbaum, 1955), 

etc, Additional terms have been invoked to 

characterize the lack of critical resistance to 

Communications, for example, “rationaliza- 

5 *This research was supported by Grant MH-04682 

Ea the National Institute of Mental Health, 
ted States Public Health Service. 


THE SUBJECTIVE USE OF INDUCTIVE EVIDENCE: 


CHARLOTTE GILSON anp ROBERT P. ABELSON 


Yale University 


E 34 Ss were given questionnaires with 128 questions of the form, “Do subjects 
verb objects?” Information preceding each question consisted of 1 positive and 
2 negative instances. Answering “yes” to a question necessitated generalizing 
from one positive instance of the subject or of the object to the more general 
subject or object. 8 different subjects, objects, and verbs were used. Significant 
differences (p < .001) in the proportion of “yesses” were primarily a function 
of the verb and the form of generalization required. There was more generaliza- 
tion: (a) with “manifest verbs” than with “subjective verbs,” and (b) over- 
objects than over subjects. These results are preliminary steps toward the de- 
velopment of an understanding of the laws of subjective verbal logic. 


tion” and “wishful thinking” (McGuire, 
1960). These types of processes are assumed 
to be motivated by affects such as anxiety, 
or by “desires,” or by unstable tension states, 
such as “cognitive imbalance.” All of these 
dynamics have in common the supposition 
that resistance or overacceptance is triggered 
by the evaluative implications of the com- 
munication. For example, the “incredulity 
response” is postulated by Osgood and Tan- 
nenbaum (1955) to occur whenever a highly 
positively evaluated object and a highly neg- 
atively evaluated object are associated via a 
positive assertion, for example, “Queen Eliza- 
beth praises hatchet-murderers.” 

It is theoretically insufficient, however, to 
place total reliance on evaluative or motiva- 
tional processes in accounting for the ac- 
ceptance or rejection of communicated as- 
sertions. Clearly there is also a “reality com- 
ponent” involved, Unpleasant realities (e.g., 
“Civil rights progress risks social disorder.”) 
are not always rejected out of hand, nor are 
pleasing absurdities (e.g., “Sophia Loren is 
the Tennis Champion of Japan”) necessarily 
found credible. To complete the theoretical 
picture, it is necessary to interrelate motiva- 
tional and reality components (Abelson, 1963; 
Rosenberg & Abelson, 1960). Since so little is 
known about the reality component, it seems 
wise as a first step to study those mechanisms 
by which individuals evaluate the credibility 
or “subjective truth value” of assertions in 
cases in which the motivational component is 
minimal. 

The purpose of the present study was to 
begin an attack on the broad problem area 
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sketched above. It was postulated that one 
way people assess the credibility of a gen- 
eral statement is by induction from examples 
or instances of that statement. For example, 
to the assertion, “Senators take bribes,” one 
might say yes, Senator X does take bribes, 
or X-type Senators do take bribes, or one 
might say yes, Senators take excessive cam- 
paign contributions (positive instances), But 
what if one also knew that Senator Y, or Y- 
type Senators do not take bribes, or that 
Senators do not take free yachts (negative 
instances)? The process of establishing “in- 
ductive credibility” is similar to the process 
of inductive inference as referred to by 
logicians and mathematicians (Carnap, 1960; 
Watanabe, 1960), and to the processes in- 
volved in concept formation (Bruner, Good- 
now, & Austin, 1956; Hunt, 1962). However, 
we are interested in “psychologic” rather 
than logic. Moreover, the literature on con- 
cept formation turns out to be surprisingly 
irrelevant to our present purposes. Studies of 
concept learning generally involve the recruit- 
ment of a hypothesis to account for the given 
evidence, whereas we are concerned here with 
the recruitment of evidence to test a given 
statement, That is, we are concerned with 
performance rather than learning, with the 
psychology of testing broad generalities rather 
than generating precise hypotheses. Other 
differences from traditional concept-formation 
studies will become clearer below. 

A “sentence” as considered here, consists 
simply of a subject, verb, and object. A 
“positive instance of a sentence” is defined 
as a new sentence formed by leaving the verb 
unchanged and substituting instances of the 
subject and object for the original subject 
and object, respectively. Denote the instances 
of the subject, A, by Ay, Ao... Aj... and 
the instances of the object, B, by B;, Bo... 
Bx . . . . Positive instances of the statement 
“A verb B” are provided by all sentences of 
the form “A; verb By.” (One may also con- 
template sentences in which only one sub- 
stitution is made in the original sentence in- 
stead of two, e.g., “A verb B,,” or “A; verb 
B.” The above examples of instances of 
“Senators take bribes” were all of this special 
form. This form may be included in the gen- 
eral definition by the device of including A 
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itself in the set of instances of A, and B itself 
in the set of instances of B.) Negative in- 
stances of the sentence A verb B are defined 
by the form “A; do not verb B,.” 

The array of evidence provided by in- 
stances relevant to the original sentence can, 
be displayed in an evidence matrix. Reserv- 
ing Row j for sentences involving Aj, and 
Column & for sentences involving B+, one may 
enter in cell (j, &) of the matrix a “plus” to: 
denote the positive instance A; verb By, a 
“minus” to denote the negative instance Aj 
do not verb Bz, and a “zero” to denote the 
absence of either type of evidential instance. 
(Quantitative as well as qualitative variation , 
in these matrix entries is conceivable, though 
not discussed in this paper.) This matrix can’ 
in principle display any pattern of entries,’ 
and we may now define our general research 
task as the study of the inductive credibility 
of sentences as a function of the form of the 
evidence matrix and the semantic nature of 
the sentence components, d 

Certain forms of the evidence matrix will 
produce more or less obvious consequences. 
For example, if the matrix consists almost 
entirely of positive entries, with one or two 
empty cells, there will be a strong tendency 
for an individual to “fill in” these empty 
cells with plusses and to accept the uni- 
versality of the associated assertion. (There is 
good suggestive evidence for effects of this 
general type from a classic study by Wolfle, | 
1932.) However, if plusses and minuses are 
scattered throughout the matrix, it is not 
at all clear what the consequences will be. iq 

Fortunately, a tentative, general hypothesis 
about the evidence matrix was available whe? 
we entered this research program. Part of 4 
computer program by Carroll, Abelson, and 
Reinfeld (reported in Abelson & Carroll, 
1965) is a set of routines which assess the 
inductive credibility of assertions by looking 
at the “evidence matrix” of their instances: 
This computer program was written with the 
intention of simulating the real psychological: 
Processes involved, and several assumptions | 
about these processes were made on the basis 
of introspective intuition, y 

The crucial assumption on the use of in- 
stances was as follows: The general state- 
ment A verb B is considered credible if moré 


than half of the instances Aj of A are found 
to be connected with at least one instance 
B, of B in sentences of the form Aj verb Bx. 
Thus, if the program is concerned, for ex- 
ample, with determining in general whether 
“lawyers play card games, it will proceed by 
“thinking of” instances of lawyers (which 
may be such generic instances as administra- 
tive lawyers, patent lawyers, American law- 
yers, or the like, or may be the specific 
lawyers, Smith, Jones, etc.) and asking if at 
least one instance of “play card games” (e.g., 
“play bridge,” “play poker,” etc.) can be 
found appropriate to each. If more than half 
of the encountered instances of “lawyers” 


meet this test, the general assertion is con- 
», sidered acceptable, 


- In terms of the evidence matrix representa- 
tion, the necessary condition for the program 
„to judge the sentence credible is that more 
than half of the rows of the evidence matrix 
contain at least one positive entry. (If there 
is at least one, it does not matter for the 
program’s purposes how many others there 
might’ be.) Note that the condition is not 
necessarily satisfied if over half of the 
columns contain at least one positive entry, 
since these positive column entries could be 
bunched into a very small number of rows. 
This asymmetrical assumption was deliber- 
ately built into the program because it seemed 


| intuitively correct. 


In order to probe the validity of the above 
assumption, a pilot study was run with 10 


_ different evidence matrices. This study also 


began an exploration of the effects of se- 
eg variables on the credibility of asser- 
ions, 


Pitot STUDY 


Table 1 displays the 10 different evidence 
matrices used in the pilot study. They are 
grouped in the table according to the relative 
number of positive entries and according to 
a categorization of the “specificity” of the 


evidence. By an “object-specific” matrix we 


Mean one in which the positive instances are 
a specific to certain designated objects, and 

e negative instances are all specific to other 
objects, irrespective of the subject. An ex- 
ample of an object-specific evidence matrix 
(of Type a in Table 1) for the assertion 
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TABLE 1 
EVIDENCE Matrices USED IN THE Pitot STUDY 
Fraction of Evidence form 
positive 
instances 
Object specific | Subject specific Mixed 
3/9 Lpt—— | 2p ttt 3p to 
ET ETES 
apes rae Ss SSS 
1/3 4.[+--] | 5. E] none 
6/9 6. TEEI Er rtt 
[+42] [tt] [tee 
++- --- oat 
2/3 Ol tray aol 10, E none 


a See discussion in text. 


“People have gadgets” is as follows: 


Altogether there are three people—A, Brice 
Altogether there are three gadgets—X, Y, Z. 


AhasX A doesn’t have Y A doesn’t have Z. 
B has XB doesn’t have Y B doesn’t have Z. 
Chas X C doesn’t have Y C doesn’t have Z. 


In this example, the entire burden of positive 
evidence lies specifically with the object, ee 
and the entire burden of negative evidence 
with “Y” and “Z.” In order to say yes to the 
question, “Do people have gadgets?” (given 
only the above evidence), an individual must 
generalize from Gadget X to the gadgets in 
general. That is, object-specific evidence 
matrices require generalization over objects 
in order to reach a conclusion. 

Conversely, a “subject-specific” evidence 
matrix is one in which the positive instances 
and negative instances are, respectively, 
specific to different subjects. An example of a 
subject-specific matrix (of Type b in Table 
1) for the same assertion, “People have 
gadgets,” is as follows: 

AhasX Ahas ¥ Ahas Z. 


B doesn’t have X Bdoesn’thaveY B doesn’t have Z. 
C.doesn’t have X C doesn’t have Y C doesn’t have Z. 


In this case, in order to say yes to the ques- 
tion, “Do people have gadgets?” an individual 
must generalize from Person A to the people 
in general. 


In effect, the assumption made in the 


Carroll, Abelson, and Reinfeld (see Abelson 
& Carroll, 1965) computer program specifies 
that subject-specific evidence inhibits gen- 
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eralization, but that object-specific evidence 
does not. In other words, it is easier to gen- 
eralize over objects than over subjects. In the 
case of the two examples above, the first 
example should be more likely to elicit the 
answer yes than the second when the question 
“Do people have gadgets?” is posed. Tests of 
this prediction and its psychological inter- 
retation are discussed below. 

A third type of evidence matrix used in the 
pilot study is a type we have labeled “mixed,” 
in which the positive and negative instances 
are symmetrically distributed. Common sense 
would suggest that a mixed matrix would pro- 
vide the strongest evidential support with a 
fixed number of positive instances. 

In the pilot study, the preliminary in- 
vestigation of semantic variables was begun 
by using 10 different subject-object pairs 
and 2 different verbs. Following Heider’s 
(1946) important distinction between “senti- 
ment relations” and “unit relations,” the 
verbs “like” and “have” were chosen for 
study. 


Method 


Twenty-item questionnaires were constructed. 
Each form of evidence matrix in Table 1 was used 
twice in each questionnaire, once with the verb like 
and once with the verb have. The locations of the 
positive instances were not necessarily in the ordinal 
positions indicated in Table 1, however. Locations 
were randomized from item to item, while preserv- 
ing the appropriate form of the matrix, The 10 
subject-object pairs were as follows: 


Boys (Tom, John, Dick)—winter sports equip- 
ment (skis, skates, sleds) 

People (A, B, C)—gadgets (X, Y, Z) 

Stores (Pegnataro’s, Food Fair, Orange Market)— 
meats (pork, beef, chicken) 

Families (Jones, Brown, Smith)—magazines (Na- 
tional Weekly, Chomley’s, Our Day) 

Girls (Betty, Susan, Barbara)—pets (dogs, cats, 
parakeets) 

Tribes (R, S, T)—animals (deer, buffalo, moose) 

Countries (Uganda, Togoland, Upper Volta)— 
vegetables (cabbage, eggplant, beans) 

Clubs (Metropolitan, Caldwell, Suburban)—party 
games (chess, checkers, cards) 

Students (Bob, Harry, Mike)—history courses 
(ancient, medieval, modern) 

Girls (Nancy, Ginny, Lois)—colored blouses (yel- 
low, blue, pink) 


These subject-object pairs were systematically 
rotated among the evidence matrices to produce 10 
different forms of the questionnaire. On all forms, 
each item was introduced by phraseology indicating 
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that only the presented evidence was to be used in |. 
answering the yes or no question following the item, | 
The general introduction to the questionnaire y 
stressed that none of the questions admitted of 

strictly logical answers, but rather that intuitive 


impressions were to be given. 

Twenty subjects were recruited from a pool of, 
Yale students (typically sophomores) subject to a 
requirement of participation in psychological ex- 
periments, Two subjects received each of the 10 | 
counterbalanced forms. | 


There were no apparent differences among 
the 10 subject-object pairs in the tendency to 
elicit yes responses. The results are given in 
Table 2 for the 10 evidence matrices and the < 
2 verb forms, aggregated over subject-object 
pairs. Each entry is the proportion of yes 
responses, based upon 20 responses, 1 per 
subject, 

It is apparent from the table that when 
two-thirds of the evidential instances were 
positive, there was an almost universal tend- 
ency to agree with the general assertion. Con- 
fining our attention, then, to the upper por- 
tion of the table representing the cases in 
which but one-third of the evidential instances 


Results 


e 


were positive, the following suggestive tend- 
encies should be noted ?: (a) the 3/9 and 1/3 
cases produced virtually identical results; 
(b) the object-specific case produced more 
positive generalization than the subject-spe- 
cific case, but this difference was large only 
for the verb have, not for the verb like, the 
latter having produced generally low agree- | 
ment rates; (c) the mixed case produced 
generally high agreement rates. 


Discussion 


The tendency for object-specific evidence 
to be more powerful than subject-specific 
evidence supports the asymmetrical assump- 
tion of the computer program, The apparently } 
strong difference between have and like came | 
initially as a surprise. Verbs have been found » 
by others, however, to make a considerable 
difference in various ratings of sentence prop- 
erties (Gollob, 1964; Szalay, 1962) and in 


2 The pilot study was designed for a broad and 
not very sensitive look at a number of possibly 
crucial factors, Accordingly, the burden of formal 
statistical statement is deferred until the presenta- 
tion of the main study. 


TABLE 2 


TION OF SUBJECTS GIVING “YEs” RESPONSE AS 
'UNCTION OF EVIDENCE MATRIX AND VERB 


W = 20) 
Evidence form 
Object specific | Subject specific | Mixed 
35 25 70 
75 30 ‘80 
40 .20 p 
‘80 30 = 
1.00 95 1.00 
| 1.00 95 1.00 
1.00 1.00 sr 
‘95 <90 L 


ive and negative instances 
able 1. 


—For the pattern of posit 
enting each matrix form, see 


alizing from one symbolic sentence to 
other (De Soto & Kuethe, 1959). 

a The main study was designed to explore 
|, Properties further, to be more syste- 
ic in the examination of subject and ob- 
properties, and to retain control over the 
interesting variable, subject-specific ver- 
bject-specific evidence. 


Marn STUDY 


ht verb forms were chosen for use in 
main study: three representing “unit 
ons” (produce, buy, have), three repre- 
what might broadly be considered 
timent relations” (get angry with, under- 
d, like), and two with apparently ambig- 
standing on Heider’s dichotomy (steal, 
d). These latter two are ambiguous in 
“steal” may imply both positive senti- 
nt and a condition of ownership, and 
avoid? has connotations both of negative 
ntiment and negative unit relation. (The 
six verbs are not really completely 
übiguous either, and a discussion of verb 
les will prove warranted following the 
tation of results.) 
| the choice of subject-object pairs, it was 
ed to avoid a bothersome syntactic am- 
present in the pilot study. In that 
y, the instances of a plural subject (e-g., 
s”) were often singular (“Betty”). Thus, 


S. 
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evidence such as “Betty likes dogs” perhaps 
did not carry over to “Girls like dogs” for the 
artifactual reason that the “s” on Girls im- 
plied the syntactic necessity, “more than one 
girl.” Therefore, in the main study we used 
plural instances in all cases (e.g, “Fat 
women” as an instance of “Women”). These 
instances are given in Table 3. 

In Table 3 are presented the sets of sub- 
jects, verbs, and objects used in the study. 
These terms were picked in part so as to 
make potentially reasonable any possible com- 
bination of subject, verb, and object (e.g., 
“Families like foods,” “Artists buy animals,” 
etc.). For purposes of greater generality, the 
nonsense words “Gleeps” and “Fung? were 
included along with the variety of more 
commonplace terms. 

As far as the evidence forms were con- 
cerned, only 2 forms, rather than 10, were 
included, as described below. 


Method 


A 128-item questionnaire was constructed, con- 
sisting of 64 core contents presented so as to embody 
each of two forms of evidence: object specific, with 
one (randomly placed) positive instance out of three, 
and subject specific, likewise with one positive 
instance out of three. (See the row of Table 1 
labeled 1/3.) Examples of these two respective forms 


TABLE 3 


SUBJECTS, VERBS, AND OBJECTS USED 
tN MATN STUDY 


j Object: 
PETES Nethe (and iestatnces) 

Gleeps (1 legged. Produce Fung (green, yel- 
3 legged, 5 legged) low, r i 

Families (urban, Buy Magazines (sports, 
suburban, rural) news, fashion) 

Tribes (Southern, | Have Animals (big, small, 
Northern, Cen- medium-sized) 
tral) 

People (poor, rich, | Get angry Gadgets (X, Y, Z) 
middle-class) Tur A a 

Men (old, yow nderstan aughters (red- 
cidla S headed, blonde, 

brunette 

Artists (unknown, | Like Paintings (modern, 
fairly well-known, classical, impres- 
very well-known) sionistic) 

Candidates (A Steal Tides (cronauis, 
party, B party, military, social) 
C party) 

Women (fat, thin, | Avoid Foods (sweet, 
average) salty, spicy) 


EE DES i ee EE E 
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TABLE 4 
PROPORTION OF “YES” RESPONSES AS A FUNCTION OF SEMANTIC ELEMENTS AND EVIDENCE Form 
Subject sss Ose Verb | ss | os Object | ss | os 
Gleeps .62 71 Produce 12 83 Fung — 39 | .67 
Families 59 -69 Buy EK) 81 Magazines 157 | -66 
Tribes 62 71 Have 72 88 Animals 62 | 1 
Candidates 99 69 Steal 70 84 Ideas 62 | de 
People 62 69 Get angry with 62 70 Gadgets .63 | 69 
Men 65 13 Understand At 4H Daughters 61 -69 
Artists 59 71 Like 47 .63 Paintings 64 75 
Women 67 71 Avoid Sl 50 Foods 64 72 
1 


Note.—N = 34 subjects X 8 items = 272. 
^ SS = subject-specific evidence form, 
b OS = object-specific evidence form. 


of items with the content “Tribes have magazines” 
are: 


1. Altogether there are three kinds of tribes— 
Southern, Northern, Central. 
Southern tribes have sports magazines, 
Northern tribes do not have sports magazines, 
Central tribes do not have sports magazines. 
Do tribes have sports magazines? 

2, Altogether there are three kinds of magazines— 
sports, news, fashions, 
Southern tribes do not have sports magazines. 
Southern tribes do not have news magazines, 
Southern tribes have fashion magazines. 
Do Southern tribes have magazines? 


The 64 core contents were selected as a balanced 
subset of the total set of 8X 8X 8= 512 possible 
combinations of subject, verb, and object. A Latin 
Square design was used such that each of the eight 
verbs was used in combination with eight different 
subject-object pairs, balanced over subjects and 
objects. 

Twelve filler items were also included in the 
questionnaire, Since all of the main items presented 
exactly one positive instance out of three, it was 
thought possible that subjects might tend to respond 
“yes” as they began to suspect that no items would 
ever appear with a more favorable Proportion of 


TABLE 5 


ANALYsIs OF VARIANCE OF TRANSFORMED 
Item Proportions 


Source df MS F 
Verb (V) 7 | 1.5703 |58.16*** 
Subject (S) 0390 | 1.44 
Object (0) 7 | 10579 | 2.14* 
Evidence form (E) 1 | 1.4049 |52.03*** 
EXV 7 | .0951 | 3.52** 
EXS 7 | .0206 | <1 
EXO 7 | 0073} <1 
Residual 84 | .0270 — 
(Estimated base-line variance) (0184) 
* 
#5 8 o 
ek < 001, 


Positive instances. In order to identify the potential 
presence of any such context effects, 12 filler items 
(with otherwise unused core contents) containing 
two positive instances out of three were inserted at 
the beginning of half of the questionnaires, In the 
other half of the questionnaires 12 parallel filler 
items each with one positive instance out of three 
were inserted at the beginning. 

The subjects were 34 Yale undergraduates re- 
cruited from the same subject pool as was used for 
the pilot study. v 

The instructions were the same as for the pilot 
study, emphasizing that no test of formal logic was 
involved, and that each item should be answered 
only on the basis of the particular evidence pre- 
sented. 


Results 


There were no significant or even suggestive 
differences attributable to the type of filler 
items, and therefore all 34 questionnaires 
were combined into a single analysis of the 
main items. Table 4 displays the proportion 
of yes responses by subject, verb, object, and 


evidence form. (Data for each semantic ele- , 


ment were aggregated over the eight items 
embracing combinations of the other two ele- 
ments.) 

Cursory inspection of the table suggests 
that the object-specific evidence rather con- 
sistently produced more yes responses than 
subject-specific evidence; and further, that 
the verb made a considerable difference, 
whereas neither the subject nor the object 
seemed to matter. 

The design of the questionnaire permitted a 
systematic analysis of variance. The design 
may be viewed as an 8 X 8 Latin square (of 
alternatively, as a one-eighth fractional fac- 
torial of an 8 X 8 X 8), replicated over the 
two evidence forms. The heuristic assump- 


| 


x 
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tion of negligible interaction among sentence 
subjects, verbs, and objects was made. Since 
„the proportions of yes responses to the items 
can be viewed as binomial proportions (based 
on 34 responses) the arc-sine transformation 
(Winer, 1962) was used, even though those 
"proportions are not strictly independent of 
one another. 

Table 5 summarizes the analysis of vari- 
ance on the transformed proportions for the 

| 128 questionnaire items. The first seven 
„sources of variation listed in the table were 
tested for significance using the “residual” 
Mean square as the error term. This error 
‘term contains portions of all interactions 
among the verb, subject, and object factors, 
as well as higher-order interactions of evi- 
dence form with these combinations.* To the 
extent that any of these interactions were 
systematically present in the data, tests of sig- 
nificance using this error term would tend to 
be biased toward acceptance of the null hy- 
pothesis, Bearing this in mind, we may now 
turn to an examination of these several F 
tests. 

Clearly the factors of verb and evidence 
form are significant. Relative to these effects, 
all others are of much smaller magnitude. The 
interaction of verb and evidence form is sta- 
tistically significant, though small. The object 
factor is of borderline significance, and the 
subject factor seems not to be significant un- 
less one wishes to argue that the error term 
is artifactually large enough to mask its effect. 
The remaining two tests yield rather clearly 
nonsignificant outcomes. 

Turning now to the residual term itself, we 
make use of the valuable concept of the base- 
line variance for the arc-sine transformation. 
(Mosteller & Bush, 1954). This theoretical 
variance is based on the statistical properties 
of binomial proportions, and does not depend 
upon computations from observed data. For 
the particular scaling of the arc-sine values in 


8It is possible to achieve but one separation of 
factors within this residual, namely, a separation 
Of those interactions which do not involve evidence 
form from those which do. The sum of squares 
attributable to the former set of interactions was 
‘11715, and the latter, 1.0926. These two figures, each 
with df=42, were so close to each other as to 
Yecommend the convenience of pooling them into 
the single residual term which appears in Table 5. 
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Winer’s table, the value of the base-line vari- 
ance is the reciprocal of the number of cases 
on which each proportion is based, namely, 
1/34 = .0294. 

However, the base-line figure is strictly 
appropriate only for uncorrelated proportions, 
and proves too large when the proportions 
are positively correlated. The exact extent of 
this exaggeration is unknown, but via an 
approximate calculation* we estimate that 
the base-line figure should be revised down- 
ward to .0184. This value has been entered 
in Table 3 as a standard for assessing the 
residual mean square. The comparison is sug- 
gestive only, since the base-line figure is not 
exact, but it seems reasonable to conclude 
that some slight systematic interactions are 
present in the residual term, and that all sig- 
nificance tests indicated in Table 3 were some- 
what too conservative. 

Statistical subtleties aside, our analysis 
strongly establishes the verb and the evidence 
form as the predominant sources of variance 
in our data, In the discussion below, we shall 
confine ourselves to a discussion of these two 
factors. Our explanations will be admittedly 
speculative, and we prefer not to clutter the 
presentation with speculations about effects 
of small magnitude. 


Discussion 


This study confirmed, once again, as did 
yet a third study by Abelson,’ that the tend- 


4A simple mathematical analysis suggests that the 
base-line variance would be correct to a first ap- 
proximation if reduced by a factor (1—r), where r 
is the average item intercorrelation. We have not 
calculated all intercorrelations in the 128 X 128 
item matrix, but a random sample of paired items 
yielded an average phi coefficient of .373. The 
original base-line figure of .0294 was therefore 
multiplied by .628 to arrive at the final approximate 
value. 

A deeper study of the appropriate correction 
procedure would be worthwhile. Analyses of cor- 
related proportions are quite common, and the use 
of the arc-sine transformation has in our opinion 
considerable advantages over the very cumbersome 
chi-square techniques ordinarily proposed. 

5 In this small-scale study, an eight-item question- 
naire was administered orally to a colloquium 
audience of 39 persons. The verbs “have” and 
“like” were used in subject-specific and object- 
specific forms with one positive instance out of three. 
Two different sentence contents were used. There 
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ency to form inductive generalizations is an 
asymmetric function of the evidence form. 
Subjects in all three studies generalized more 
freely over objects than over subjects, In the 
pilot study, this tendency might perhaps have 
been attributable to the fact that the subject 
instances were singular, while the object in- 
stances were plural, Thus, there was perhaps 
a felt contradiction in generalizing, for ex- 
ample, from Mary (singular) to girls (plural), 
a contradiction not present in the generaliza- 
tion over objects, However, this “bug” in the 
pilot study was eliminated in the main study, 
and the asymmetry between subjects and ob- 
jects remained (though it was not quite as 
strong an effect as it had been in the pilot 
study). 

What explanation is there for the appar- 
ently stronger tendency to generalize over 
objects rather than over subjects? One line of 
explanation is syntactic: the subject of a 
sentence comes first, and is perhaps thereby 
made more salient than the object. The at- 
tachment of two negative instances out of 
three to the salient sentence subject might 
then have a more inhibiting effect on the 
generalization than would the attachment of 
two negative instances out of three to the 
less salient object. This explanation offhand 
does not seem very compelling, but at least it 
is not inconsistent with a second, more “psy- 
chological” explanation, and in any case, it 
can be tested in further studies by using re- 
versed sentences such as, “Animals are 
bought by tribes,” 

The second line of explanation proceeds 
from the assumption that differences among 
types of people are psychologically more im- 
portant than differences between types of 
objects. It has been argued that one forms an 
impression of a person as an individual 
entity, as a unit distinct from all other units 
(Asch, 1946). In our main study, the sen- 
tences involved person types (eg. “fat 
women,” “rural families,” etc.) rather than 
individual persons, but one might argue that 
person types are perceived in unitary fashion 
also, as in the process of stereotyping. There- 


were 36.5% more yes answers to generalizations 
involving haye than like, and 13.5% more yes an- 
Swers to object-specific than subject-specific forms. 
There was no interaction between these two effects. 
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fore, subjects are unwilling to generalize from 


a particular subject type to people in general, f 
That one type of person steals does not mean. 


that most people do. On the other hand, ob- 
jects may be seen as much more readily inter- 
changeable. That one type of object is stolen 
may well indicate that other types of objects 
are stolen, 


The reader may note a flirtation with tau- 


tology in the above explanation: there is less 


of a tendency to generalize over subjects than ` 


over objects because there is more of a tend- 
ency to distinguish among subjects than 
among objects. Certainly if we are to begin 
to avoid circularity of this sort, we must 
specify in structural and/or functional terms 
some cognitive differences between concepts 
representing “things which act” and those 
representing “things acted upon.” 


The computer program which served as the — 


inspiration for our series of experimental 
studies does in fact recognize differences be- 
tween these two types of concepts. They are 
stored in memory in somewhat different 
fashion, and there is a process difference as 
well. While it is not appropriate here to go 
into this process difference in detail, we may 
briefly indicate the nature of the difference 
by noting that memory search operations are 
hierarchically governed by concepts repre- 
senting “actors,” rather than those represent- 
ing “actees.” We believe that one substantial 
asset of a computer simulation approach to 
cognitive processes is that it forces the opera- 
tional specification of otherwise vague or 
tautological notions. 

Regardless of the evidential form of the 
questionnaire items, why do the verbs act 
differently? From Table 4 one sees that the 
verbs “have,” “produce,” “buy,” and “steal” 
generate more yes responses than the other 
four verbs. An intuitive explanation might 
be that one does not have to have all the 
types of X’s to “have X;” no one has all 
the X’s; thus, not to have a particular X 
does not imply a very negative relationship 
between the subject and X. Similar considera- 
tions apply to the verbs produce, buy, and 
Steal, all possessing the character of unit rela- 
tions. This explanation is reminiscent of 
Cartwright and Harary’s (1956) argument 
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that it is difficult to establish negative unit 
relationships. 
With respect to the other four verbs, “like,” 


“understand,” “avoid,” and “get angry with,” 


the situation is different. It is possible for 
someone to like all types of X, or avoid most 


' types of X. Since one may like or under- 


stand many things, an instance of not liking 
or understanding X carries greater negative 
force than not having X. Thus, generaliza- 
tions are not made so readily with verbs,such 
as like when over half the instances are nega- 
tive, 

Another possible clue to the differences be- 
tween verbs may lie in the aforementioned 
tendency to view individuals as units. Asch 
(1946) has shown that some descriptive 
words are much more important in the forma- 
tion of impressions than others. De Soto and 
Kuethe (1959) have also found that certain 
verbs heavily determine the overall impres- 
sion of the person. No doubt, the verb steal 
is such a verb. Thus, if it is attributed once 
to a person, one is left with an overall im- 
préssion of the person as a thief. In other 
words, one positive instance with a verb of 
this nature applied to a person (or type of 
Persons) may be sufficiently powerful to over- 
ride a majority of negative instances. 

To summarize our two major lines of ex- 
planation of the verb results (no doubt there 
are others), it seems that subjects will gen- 
eralize more often if either the negative form 
of the verbs does not imply a very negative 
relationship between subject and object, or if 
the verb is such as to lead to a holistic im- 
pression of the subject class. It is not clear 
which explanation should take precedence, 
and indeed, the two overlap considerably 
with respect to the present limited selection 
of verbs, One also wonders how to designate 
our two groups of four verbs. It seems awk- 
ward to follow Heider and refer to “unit 
verbs” and “sentiment verbs.” The former 
term lacks suggestiveness, and the latter does 
not seem broad enough. One possible termi- 
nology which preserves much the same mean- 
ing is to denote the verbs have, buy, produce, 
and Steal as manifest verbs, and like, under- 
stand, avoid, and get angry with as subjective 
verbs, At any rate, distinctions between verb 


classes are very important in studies of sub- 
jective logic using verbal materials, and the 
matter is much worth pursuing in further 
studies. 

From the first, it has been obvious that 
more effects the judgment of credibility of an 
assertion than the evidence matrix of its in- 
stances. Another aspect of the computer pro- 
gram (Abelson & Carroll, 1965) which as- 
sesses the credibility of an assertion consists 
in the examination of concepts of which the 
given statement is an instance, This involves 
going up the “tree” of evidence instead of 
down. It will be necessary in later studies to 
see whether verbs and evidence form play a 
part here, too. 
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SELF-ACCEPTANCE, DISCREPANCY BETWEEN PARENTS’ 
PERCEPTIONS OF THEIR CHILDREN, AND 
GOAL-SEEKING EFFECTIVENESS! 
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of this study was to d 
arents’ 


The purpose 
ceptance, differences between p: 


seeking effectiveness within an academic setting. Mi 
about 350 male and 400 female college students of their self 
perceptions and acceptance of them, and parental 


self-acceptance, their parents’ 
attitudes toward academic pursuits. 

related to academic effectiveness in m; 
between parents’ pt 
ceptance in females and in general to 


females. Males whose parents either agreed that a colle; 


marily for intellectual broadening or 


broadening were more effective than 


matter. 


Self-esteem and its correlate self-acceptance 
(Crowne, Stephens, & Kelly, 1961) have been 
related positively to intellectual achievement 
among subjects of varying ages while being 
negatively related to anxiety (Coopersmith 
1959; Mitchell, 1959) and dependency (Tud- 
denham, 1959). These findings suggest that 
self-esteem or self-acceptance indicates an 
internal stability conducive to effective pur- 
suit of chosen goals. Individuals high in self- 
acceptance may be less egocentric and have 
less need for extrinsic evaluation; if this is 
true, they may become more intrinsically 
involved in tasks and pursue goals with less 
anxiety over possible failure or negative 
evaluation by others (Harvey, Hunt, & 
Schroder, 1961). On the other hand, indi- 
viduals low in self-acceptance or self-esteem 
may have greater desire to attain goals per- 
ceived as increasing this esteem and may have 
a consequent anxiety in new situations in 
which self-worth is tested. In this regard, 
while individuals with unrealistically low 
self-esteem were found to have higher need 
for achievement than those not fitting this 


1 This study was performed under the sponsorship 
of the United States Office of Education under the 
direction of Glenn Terrell. Thanks are extended 
to Catherine Felknor for her help in organizing and 
collecting data for this study. 

E New at the University of Illinois at Chicago 

Arcie, 


elineate the interrelationship of self-ac- 
perceptions of their children, and goal- 
easures were obtained on 
-perceptions and 


Self-acceptance and parental acceptance 
ales but not in females, The discrepancy 


erceptions of their children related negatively to self-ac- 


academic effectiveness in both males and 
ge education was pri- 
agreed that it was primarily for social 
those whose parents disagreed on this 


description they did not necessarily attain 
higher intellectual achievement than those 
whose self-esteem was realistically high 
(Coopersmith, 1959). 

A person’s self-perception or self-evaluation 
may stem in part from perceptions or evalua- 
tions of him by others (Cooley, 1912; Rogers, 
1959). If this is true, a discrepancy between 
perceptions of their child by father and 
mother may create ambiguity in the child’s 
perception of himself. Specifically, discrep- 
ancy between parents’ perceptions of their 
child along a given dimension may prevent 
the child from acquiring a perception of him- 
self as either high or low along this dimen- 
sion; furthermore, any point along the di- 
mension that he may select as descriptive of 
himself is incongruent with the perception of 
him by at least one significant other. A state 
of cognitive imbalance (Heider, 1958) is 
therefore created in which the individual is 
forced to adopt attitudes that are discrepant 
from those of a positively regarded other. 
This conflict may lead to an unstable self- 
perception and to lack of acceptance of one- 
self along the dimension in question. If this 
reasoning is correct, and if it is true that 
lack of self-acceptance leads to ineffective goal 
striving as suggested above, discrepancy be- 
tween parental evaluations of their child 
should also be related to lack of effectiveness 
in goal seeking. 
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The effect of between-parent differences in 
perception and evaluation of their child may 
also be viewed from a somewhat different 
perspective. Differences between parents’ 
evaluations of their child’s behavior may pre- 
vent the child from learning behavior patterns 
that consistently lead to positive evalua- 
tion by others, and hence to positive self- 
evaluation. To this extent, concentrated effort 
directed toward attainment of specific goals 
(e.g., academic achievement) may not be 
exerted. 

Similar reasoning should apply in hypothe- 
sizing effects of differences between parents’ 
attitudes in areas of direct relevance to their 
child. To the extent that the child is faced 
with parental differences in evaluating an 
object or set of objects of importance to him, 
he also may be unable to develop consistent 
attitudes toward these objects; goal-seeking 
effectiveness in this area may therefore suffer. 
As these parental attitudes may not directly 
affect the individual’s perception of himself, 
however, a significant relationship between 
self-acceptance and parental differences in 
general attitudes may not occur. 

The suggested relationships among these 
variables may not generalize across different 
domains of behavior. Although differences in 
perceiving a child along certain dimensions 
may create ambiguity in self-perceptions and 
subsequent lack of self-acceptance along these 
dimensions, the lack of self-acceptance may 
not generalize to dimensions along which he 
is perceived consistently. Similarly, any ef- 
fects which self-acceptance along a set of 
dimensions may have upon goal-seeking ef- 
fectiveness may be restricted to the particular 
domains to which the dimensions are relevant. 

The study reported here was concerned 
with relationships among self-acceptance, 
goal-seeking effectiveness, and parental dif- 
ferences in perceiving their children and in 
other attitudes relevant to the attainment of 
academic goals. Within this domain, the fol- 
lowing hypotheses are suggested by the pre- 
ceding discussion: 


1. There will be a positive relationship be- 
tween academic effectiveness and self-accept- 
ance along relevant dimensions. 

2a. The discrepancy between parents’ 
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perceptions of their children along certain di- 
mensions will relate negatively to their chil- 
dren’s self-acceptance along these dimensions, 

2b. The discrepancy between parents’ per- 
ceptions of their children along relevant 
dimensions will relate negatively to their chil- 
dren’s academic effectiveness. 

3. The discrepancy between parental atti- 
tudes toward education will relate negatively 
to their children’s academic effectiveness, 


METHOD 
Subjects 


As part of a larger study on nonintellective factors 
relating to academic effectiveness in college, 525 male 
and 619 female university freshmen, selected at 
random, were sent questionnaires within a month 
after their entrance into college. Females in the 
sample were predominantly enrolled in the college 
of arts and sciences while males were about evenly 
distributed between the college of arts and sciences 
and the college of engineering. Questionnaires were 
also sent to parents of these students to assess their 
perception and acceptance of their children along 
selected personality dimensions and their attitudes 
toward the academic environment. Responses were 
received from 350 fathers and 383 mothers of male 
subjects and 399 fathers and 444 mothers of female 
subjects. Of the students, 393 males and 496 
females responded. Males and females were analyzed 
separately. 


Measures 


Academic effectiveness was defined in this study 
as the degree to which academic performance exceeds 
or falls short of that expected on the basis of 
aptitude alone. It was measured by computing for 
each subject the difference in standard scores between 
his performance (measured by first-term freshmen 
grade-point average) and his aptitude (measured by 
the combined Verbal and Mathematics scores on the 
Scholastic Aptitude Test of the College Entrance 
Examination Board), Male and female subjects were 
each divided into high (subjects over 1 sigma above 
the mean effectiveness score), normals (subjects 
within 1 sigma above or below the mean), and lows 
(subjects over 1 sigma below the mean effectiveness 
score). 

Self-acceptance, self-perception, and parents’ per- 
ceptions. To obtain a list of dimensions relevant te 
the academic domain, 24 adjectives were selected 
from the Adjective Check List (Gough, 1950; Sarbin 
& Rosenberg, 1955) which were found by Shaw 
(1960) to differentiate parents of high-school sub- 
jects at two levels of academic achievement, Their 
effectiveness in discriminating such subjects was the 
criterion of their relevance to the academic domain. 
The adjectives selected were the following: 


in 


ji 


——) 


5 


i 


acceptable optimistic 
alert poised 
ambitious responsible 
calm hardworking 
“competent stable 
confident studious 
helpful successful 
» logical stubborn 

*industrious worthy 
merry broadminded 
mature competitive 
nervous motivated 


- Students were instructed to estimate, on a 5-point 
scale ranging from “seldom” to “most of the time,” 
«the amount of time they felt each adjective was 
descriptive of them. For each of these ratings, they 
were then asked to assess, on a 5-point scale ranging 
from “dislike very much” to “like very much,” their 
feelings about themselves in that respect. Both 
mothers and fathers were similarly requested to rate 

. their child using the same set of adjectives and, 
for each adjective, to record their feelings about 
their child in that respect. 

For each adjective, the absolute difference between 
the rating by the mother and that by the father 
was used as an indication of parental discrepancy 
in perceiving or evaluating their child along this 
dimension, The sum of such differences provided one 
measure of the total discrepancy between parental 
Perceptions of their son or daughter. In comparing 
groups, the number of dimensions along which the 
average discrepancy between parents of subjects in 
one group exceeded the average discrepancy between 
parents of subjects in the second was also used as a 
measure of the relative discrepancy in parental per- 
ceptions, The sum of the ratings of how the subject 
felt about himself along these dimensions was used 
to indicate his self-acceptance. Parents’ acceptance of 
their sons and daughters was assessed similarly. 

Parental attitudes toward education. Parents of 
subjects were asked to respond, on a 5-point scale 
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ranging from “solely broadening oneself intellectu- 
ally” to “solely broadening oneself socially,” the 
degree to which they felt the importance of a college 
education was for intellectual or social broadening. 


RESULTS 


Both self-acceptance and parents’ accept- 
ance of their children were compared among 
males and females at three levels of aca- 
demic effectiveness, For each dependent vari- 
able, the effects of both sex and academic 
effectiveness were analyzed using a two-way 
analysis of variance. An unweighted-mean 
approximation method (Walker & Lev, 1953) 
was used in these analyses due to the unequal 
cell frequencies, Results shown in Table 1 
indicate that academic effectiveness was te- 
lated positively to self-acceptance and also 
to acceptance by both the mother and the 
father. The Sex X Effectiveness interaction 
was significant in the case of mothers’ ac- 
ceptance, Simple effects analyses show that 
in this case effectiveness was significantly re- 
lated to maternal acceptance among males 
(F= 14.59, df = 2/820, p<.01) but not 
among females (F = .58). Similar analyses 
also showed that the relationship of self- 
acceptance to effectiveness was significant 
among males (F = 5.20, df = 2/738, p < .05) 
but not among females (F = 2.78). 

Hypothesis 2a was supported only among 
females. The mean self-acceptance of female 
subjects whose parents showed low overall 
discrepancy (below the mean) in evaluating 


TABLE 1 


MEAN SELF-ACCEPTANCE AND PARENTAL ACCEPTANCE OF STUDENTS AT THREE LEVELS 
or ACADEMIC EFFECTIVENESS 


Level of academic effectiveness F 
MSe 
High Nopal Tey Sex Effectiveness | Interaction 
Self-acceptance 
Males 91.3 (63) | 90.0 (266) | 85.5 (64) 59 5.83** AT 1.779 
Females 89.5 (82) | 89.9 (329) | 87.0 (85) 
Mother’s acceptance me é 
Males 102.6 (64) | 99.6 (259) | 93.2 (60) | 7.99** | 10.32 4.85 1.580 
Females 102.4 (76) | 101.2 (295) | 100.5 (73) 
Father’s acceptance oe 
ales 102.6 (57) | 100.0 (235) | 94.2 se 3B 12.06 1.32 1.846 
Females 100.8 (65) | 101.2 (275) | 96.7 (59) 


Note, —N in each iven i 
group given in parentheses. 
we? S05, two-tailed. n p 
Ż <.01, two-tailed. 
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TABLE 2 


MEAN DISCREPANCY BETWEEN PARENTS’ 


PERCEPTIONS OF STUDENTS AT 


THREE LEVELS or ACADEMIC EFFECTIVENESS 


Level of academic effectiveness F 
MSe 
High Soral tow Sex Effectiveness | Interaction 
Males 11.7 (53) 13.6 (219) 14.2 (48) 
Females 12.0 (64) 12.5 (256) 15.1 (56) .00 6.94"" 93 568 


Note.—N in each group given in parentheses. 
** p <.01, two-tailed. 


their daughters was significantly higher than 
that of females whose parents showed rela- 
tively high discrepancy in their perceptions 
(My = 90.2, Mz = 87.7, t = 2.22, df = 316). 
A corresponding relationship was not found 
among males (M, = 89.9, Mz = 89.2, t = .48, 
df = 256). 

Mean overall discrepancies between par- 
ents’ evaluations of subjects at three levels 
of academic effectiveness are shown in Table 
2. An analysis of variance was also performed 
on these data. Academic effectiveness was 
negatively related to the discrepancy between 
parents’ evaluations as hypothesized. Further- 
more, significant differences were found in the 
amount of discrepancy in evaluations among 
parents of both males and females at dif- 
ferent levels of academic effectiveness when 
this discrepancy was assessed along each di- 
mension. For each pairing of effectiveness 
levels, Table 3 shows the number of dimen- 


TABLE 3 


Comparison or THREE LEVELS OF ACADEMIC EFFEC- 
TIVENESS WITH REGARD TO THE MAGNITUDE OF 
THE DISCREPANCY IN PARENTS’ PERCEPTIONS 


Number of 
dimensions 
Males |Females 
Parents of highs>parents of normals 3 8 
Parents of normals>parents of highs | 21 15 
z 3.66** | 1.46 
Parents of highs>parents of lows 3 5 
Parents of lows>parents of highs 20 19 
z 3.54** | 2.86** 
Parents of normals>parents of lows 7 5 
Parents of lows>parents of normals 17 18 
z 2.04* |2.72** 


ee ee |e 


*p <.05, two-tailed. 
** b <.01, two-tailed. 


sions along which the average discrepancy 
between parents’ perceptions was greater at 
one level than the other. 

Dimensions along which the average dis- 
crepancy was identical at the two levels of 
effectiveness being compared were not con- 
sidered. For five of the six comparisons, the 
proportion of dimensions along which parents 
of more highly effective subjects were more 
discrepant in their evaluations than parents 
of less effective subjects was significantly 
less than chance (.50). Hypothesis 2b is 
therefore generally supported. 

Table 4 shows mean academic effectiveness 
scores among males and females whose par- 
ents expressed various combinations of atti- 
tudes toward the value of a college education. 
Responses were divided into groups above 
and below the approximate median on the 
item assessing the degree to which college 
was felt to be for intellectual or social 
broadening. Parents were considered to be 
similar in their attitude if their responses fell 
either both above or both below the median 
response, The hypothesis that subjects whose 
parents were similar in their attitudes toward 
college would have higher academic effective- 
ness than subjects whose parents expressed 
dissimilar attitudes was supported only among 
males; the mean effectiveness of males whose 
parents were in agreement (M = .15) dif- 
fered significantly (¢= 2.22, df=348, P 
<.05) from the mean of subjects whose 
parents disagreed (M = —.15). A similar 
relationship among females was not found. 
As expected no differences were found in 
self-acceptance as a function of differences 
between parental attitudes; these results have 
not been tabulated. 
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TABLE 4 


MEAN AcADEMIc EFFECTIVENESS OF STUDENTS WHOSE PARENTS Have VARIOUS 
COMBINATIONS OF ATTITUDES TOWARD COLLEGE 


> -TE 


| Males 


Females 


Value of college to mother 


Value of college to father 


Value of college to father 


Intellectual Social Intellectual Social 
P Intellectual 15 (231) —.17 (61) .09 (207) 01 (80) 
Social —.00 (54) —.08 (55) 


—.10 (28) 


-15 (30) 


Note.—VN in each cell given in parentheses, s 


` 


Discussion 


2 Although self-acceptance and also parental 
acceptance were related to academic effective- 
De: ness as hypothesized, some possible contingen- 
Cies in these relationships may be indicated. 
That the relationship of self-acceptance to 
academic effectiveness did not reach signifi- 
cance among females suggests that additional 
factors should be taken into account when 
considering the sources of academic effective- 
ness in females. Other research by the authors 
(Wyer & Terrell, 1963) indicated that fe- 
males may be inspired to attain academic 
goals due to their perceived relevance to the 
attainment of more primary social goals. 

In this regard it may be noteworthy that 
maternal acceptance was related positively to 
academic effectiveness among males but not 
among females. Conceivably males are en- 
couraged by their mothers to manifest be- 
havior relevant to academic pursuits and are 
_ accepted contingent upon this behavior; fe- 
_» males, however, may be neither positively nor 
_ Negatively reinforced by their mothers for 
such behavior. On the other hand, results sug- 
gest that the patterns of paternal reinforce- 
Ment may be similar for members of both 
Sexes, 

The finding that similarity in parental 
attitudes toward the value of college was 
‘Telated positively to academic effectiveness 
among males but not among females may be 
interpreted similarly. If females value educa- 
tion primarily as a means to social goal attain- 
ment, they may not feel parental differences 
in emphasis upon social or intellectual goals 

college to be conflicting while males, who 
May see academic goals as independent of 


social goal attainment (Wyer & Terrell, 
1963), may feel greater conflict due to such 
parental differences, This ad hoc interpreta- 
tion should be subject to further investigation. 

The discrepancy between parents’ evalua- 
tions was related to self-acceptance among 
females but not among males. Parents may 
have a relatively greater influence on female 
children’s attitudes toward themselves than 
they do on male children’s self-attitudes. Males 
may be taught to depend upon sources outside 
the home for evaluation due to the instrumen- 
tality of extrafamilial sources in providing 
avenues for vocational goal attainment; this 
may be particularly true with regard to 
evaluation along dimensions relevant to at- 
tainment of these goals. Females, on the 
other hand, do not depend principally upon 
sources outside the home for evaluation of 
their adequacy in fulfilling requirements of 
their ultimate social role and therefore may 
be trained to place greater emphasis upon 
intrafamilial evaluation. 

Two measures of parental discrepancy in 
evaluating their children were used: a fre- 
quency count of the dimensions along which 
parental disagreement is greater in one group 
than in another and the total discrepancy 
summed across all dimensions. While both 
measures appear similar in their predictive 
utility (Tables 2 and 3), each has possible 
shortcomings. In the first measure, overlap ` 
in the meanings attached to the dimensions 
and also response set may prevent discrep- 
ancies in evaluations along various dimensions 
from being independent of one another; 
between-group differences may therefore be 
magnified. The measure of total discrepancy, 
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on the other hand, may give a nonrepresenta- 
tive picture since a total discrepancy due to 
large differences along a few dimensions may 
be equivalent to a discrepancy due to small 
differences along many dimensions. 

In conclusion, the hypothesized effects of 
self-acceptance and parental differences in 
attitudes on effectiveness in goal seeking, 
while generally supported, must be made con- 
tingent upon the relevance of these goals to 
the individual and to his ascribed social role. 
These contingencies make generalization of 
the effects of the variables used in this 
study somewhat questionable; in fact, they 
suggest that the effects of these variables 
on goal-directed activity may be manifested 
only in the specific areas in which they are 
assessed. Furthermore, these variables may 
have an effect only if the goals in question 
are felt to be important, 
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50 mothers of preschool children were administered the Parental Attitude Re- 
search Instrument (PARI) and the Maryland Parent Attitude Survey (MPAS). 
On the basis of scores on the Authoritarian, Hostility-Rejection, and Demo- 
cratic factors of the PARI, and on the Disciplinarian and Rejecting scales of 
the MPAS, Ss were dichotomized into high and low groups. The attitude groups 
were compared on scores of maternal behavior, based on observation of indi- 
vidual mother-child interaction sessions in an experimental play setting. High- 
and low-attitude groups differed significantly as hypothesized on nonattention, 
directing, restriction, forbidding, verbal interaction, interactive play, and rate 
of compliance. There were no significant differences between the groups on at- 
tentive observation, responsiveness to questions, helping, criticism, praise-ap- 
proval-affection, teaches-explains-demonstrates, questions, and structurizing, 


The influence of the mother-child relation- 
ship on the social development of the child 
„has in general been studied from two ap- 
proaches: an appraisal of parental attitudes, 
on the one hand, or a consideration of spe- 
cific parental practices and behaviors, on the 
other hand. Bell (1958) has described many 
of the existing studies as having a “reverse 
leap-frog” design in that “inferences from ob- 
tained differences in parental attitudes involve 
a backward extrapolation in time, concep- 
tually ‘leaping over’ the actual behavior of 
parents with their children.” It would seem 
desirable to examine more closely the linkage 
between attitudes and behavior, since a 


© knowledge of how maternal attitudes are 


manifested in behavior would shed more light 
_ On the primary social experience of the child 
with its mother. 

The present study is an attempt to investi- 
gate the relationship between maternal atti- 
tudes toward child rearing and family life, as 
measured by questionnaires, and the observed 
interaction of the mother with her preschool 


1 This research was carried out during the tenure 
of a Predoctoral Fellowship from the National In- 
stitute of Mental Health, United States Public Health 
Service. The data reported here are in part based on 

+a doctoral dissertation submitted to the Graduate 
School of American University. Gratitude is ex- 
Pressed to Irwin Altman, Chairman, for his advice 
and suggestions. 

Part of the data was presented in a paper given 
at the Eastern Psychological Association meetings, 


l- Philadelphia, Pennsylvania, April 16-18, 1964. 


? Now at West Virginia University. 


child in a specific play situation. The basic 
question to be answered is whether the dif- 
ferences which mothers express in attitudes 
are related to differences in maternal behav- 
ior. A secondary question is whether different 
attitude instruments differentially predict so- 
cial behavior. 

The attitude instruments used were the Pa- 
rental Attitude Research Instrument (PARI) 
developed by Schaefer and Bell (1958), and 
the Maryland Parent Attitude Scale (MPAS) 
developed by Pumroy (1960). Several factor 
analyses of the PARI (Schaefer, 1957; Wel- 
kowitz & Cohen, 1962; Zuckerman, Ribback, 
Monashkin, & Norton, 1958) have yielded 
three factors: Factor I is an authoritarian- 
controlling factor; Factor II is a hostility- 
rejection factor; Factor III is a democratic- 
equalitarian factor. The MPAS attempts to 
control for the element of “social desirability” 
by using a forced-choice format in which 90 
pairs of items, matched for social desirability, 
are presented to the subject. Four scale scores 
are obtained: Disciplinarian, Indulgent, Pro- 
tective, and Rejecting. 

From the general hypothesis that differ- 
ences in maternal attitudes toward child rear- 
ing and family life are associated with dif- 
ferences in maternal behavior, five specific 
subhypotheses were derived, expressed in 
terms of the behavioral expressions of the 
particular attitudes, as follows: 

1. Mothers who are high on the authori- 
tarian factor of the PARI will exhibit more 
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TABLE 1 
INTERCORRELATION OF ATTITUDE FACTORS AND SCALES 
PARI MPAS 
Se RORUN Democratic pi Rejecting Indulgent 
PARI 

Hostility-rejection A3** 

Democratic .02 01 
MPAS 

Disciplinarian .30* lt —.04 

Rejecting —.12 26 —.40** —.05 

Indulgent —.21 = 30" 29° e =.75** —.33* 

Protective 00 06 +13 —.31* —.62** Al 
M 173.22 55.48 49.86 23.62 19.68 20.30 
SD 28.38 7.11 4,53 7.21 6.00 6,90 

*p< 05, 
"D $ 01, 


attentive observation, directing, and restric- 
tion behavior in an interaction situation than 
mothers low on the authoritarian factor of 
the PARI. 

2. Mothers who are high on the hostility- 
rejection factor of the PARI will exhibit more 
nonattention, forbidding, and criticism and 
less verbal interaction, responsiveness to ques- 
tions, interactive play, helping, and praise- 
approval-affection than mothers low on the 
hostility-rejection factor of the PARI, 

3. Mothers who are high on the demo- 
cratic factor of the PARI will rank higher 
on teaches-explains-demonstrates, questions, 
structurizing, and rate of compliance than 
mothers low on the democratic factor of the 
PARI. 

4. Mothers who are high on the Discipli- 
narian scale of the MPAS will exhibit more 
attentive observation, directing, and restric- 
tion behavior in the mother-child interaction 
than mothers who are low on the Discipli- 
narian scale of the MPAS. 

5. Mothers who are high on the Rejecting 
scale of the MPAS will exhibit more nonat- 
tention, forbidding, and criticism and Jess 
verbal interaction, responsiveness to ques- 
tions, interactive play, helping, and praise- 
approval-affection than mothers who are low 
on the Rejecting scale of the MPAS. 


METHOD 
Subjects 


A group of 50 mothers and their preschool chil- 
dren, who attended a cooperative nursery school 
sponsored by the District of Columbia Recreation 


Department, served as subjects. The child 
boys and 28 girls—ranged in age from 3 y 
months to 5 years, 6 months, with a median 
4 years, 5 months. The group was homogt 
racial composition (Caucasian) and resided in a‘ 
sirable though nonprestige area of the city. 
dian age of the mothers was 31.8, They were 
dominantly high-school graduates with husbanı 
the most part in nonprofessional occupations. 
incomes were concentrated in the $5,000 te $733 
range, 


Procedure 


At a general meeting of the mothers’ group 
Purpose of the research was explained as invo 
a study of the play behavior of young children ¥ 
an adult such as the mother is present. The € 
group participated (with the exception of one 
who left the area and another who was ill). 

The PARI and the MPAS were administer 
the mothers during group meetings prior 
collection of observation data. The relationsh : 
tween the attitude scales and the observation - 
sions was not explained. For each subject thi 
factor scores on the PARI and four scale score 
the MPAS were obtained. Product-moment B: 
correlations of these seven scores and means â 
standard deviations are shown in Table 1. W 
the exception of the high negative corre 
between the Disciplinarian and Indulgent 
of the MPAS, which were anticipated fro 
forced-choice nature of the instrument, the al 
scales do not appear to be highly correlated | 
one another, 

Code numbers were used on the attitude $ i 
sheets, so that no knowledge of the mothers’ 4 
titude scores was available to the observer 
interaction sessions. When all observations were co 
pleted and the data were ready for analy: 
attitude scores were decoded and compared V 
the behavior scores. k 

The group leader prepared the children for 
interaction sessions by telling them that each 
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t 
would have a turn to have a play period alone with 
his mother. The children became familiar with the 
presence of the observer at the school and seemed 
f to-accept her as simply another of the adults who 
were usually about the recreation center. Since the 
children were accustomed to having their mothers 
present at the center periodically to assist the 
leader, they apparently attached no special sig- 
nificance to the mother’s presence on the day of 
the observation. 

Prior to each session the mother was again told 
‘that the purpose of the research was to study the 
play behavior of young children when the mother 
was present. The mother was told that during the 
30-minute session she was free to do as she wished, 
that she might think of the session as a half hour 
in her own home during which she was unoccupied 
by household duties and was free to be in the same 
room with her child, If a mother occasionally 
persisted in asking how she should behave, she was 
*, again told that she was free to do anything she 
liked, The experimental sessions thus appeared to 
be focused entirely on the behavior of the child, 
who seemed to show little, if any, awareness of 
his being observed. At the end of the session the 
mother was asked for data on the child’s chronolog- 
ical age and was also encouraged to talk about 
the child’s play behavior during the session and 
whether it was representative of behavior at home 
and in other situations. 


Experimental Situation 


The mother-child interaction sessions were 30 
minutes in length and were held in a large play- 
room, which was made available at the same time 
each day when the activities of the preschool group 
were conducted elsewhere. It contained children’s 
furniture on which toys and materials were set out, 
as well as a chair for the mother near a table con- 
} taining a number of current popular magazines. 
The same set of toys and materials, arranged in the 
same fashion, was used for each session, This con- 
. sisted of a set of large wooden building blocks; a 
Set of small interlocking building blocks; several 
jars of paints with mixing spoons and brushes; a 
box of crayons; scissors; drawing paper; a doll; 
a puppet; a box of brightly colored beads for 
stringing; a jigsaw-type puzzle; hammer, nails, 
and Composition board; a rubber horseshoe game; 
modeling clay; a small truck that could be pulled; 
and several illustrated story books. All of the toys 
and materials were readily accessible to the child 
„ Without help from the adult, The observer was 
Seated in an unobtrusive position at the rear of the 
Toom out of range of the play situation, but able to 
See and hear all interaction, 


Recording of Mother-Child Interaction 


i The recording method used in this study was 
a on the continuous recording method used by 
mera (1951), with the behavior categories developed 
Si Merrill (1946), Bishop (1951), Moustakas, 
igel, and Schalock (1956), and others. The criteria 
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for the selection of categories were that they lend 
themselves to clear definition, that they be de- 
scriptive rather than inferential, that they be 
easily recognized in the process of social interaction, 
and that they be sufficiently comprehensive to 
cover the behavior that would occur during the play 
situation. The behavior categories, with their de- 
scriptions, are contained in Table A3 and Table B, 

Recording was continuous, with each observed 
verbal or nonverbal behavior on the part of the 
mother or child tallied in the appropriate category. 
The verbal recording unit was approximately equal 
to a simple sentence. Although not of primary in- 
terest to the present study, the child’s behavior was 
recorded as well as the mother’s for use in future 
research. 


Observer Reliability 


To determine the reliability of observation a co- 
observer was trained in the use of the recording 
system on a separate group of mothers and children. 
The reliability of observation and recording was 
calculated by the method recommended by Bales 
(1951), in which both the number of behavioral 
acts and the categorizations of the two observers 
are compared by means of chi-square. Based on 12 
consecutive 10-minute periods immediately preceding 
the main study, » ranged from .60 to .99 and aver- 
aged .78, indicating a satisfactory level of reliability. 


Data Analysis 


On each of the three PARI factors, and on the 
Disciplinarian and Rejecting scales of the MPAS, 
the mothers were divided at the median into a 
high and a low group. In addition, since the sensi- 
tivity of the attitude instruments was not known, 
the upper and lower quartiles of the distributions 
were designated as the extreme groups. For each 
subject a set of 15 behavior scores was computed. 
The scores were based on the number of interactions 
recorded in the various categories. The behavior 
of mothers in high groups and low groups, and in 
extreme groups, was compared in accordance with 
the hypotheses, using the Mann-Whitney U test. 
The level of significance was set at the .05 level. 


RESULTS 
PARI Authoritarian Factor 


There were no significant differences þe- 
tween the high- and low-authoritarian mothers 
in attentive observation, directing, or restric- 


3A 2-page table giving the categories and descrip- 
tion of mother behavior and a 2-page table giving 
the categories and description of child behavior have 
been deposited with the American Documentation 
Institute, Order Document No. 8464 from ADI Aux- 
iliary Publications Project, Photoduplication Serv- 
ice, Library of Congress, Washington, D. C, 20540. 
Remit in advance $1.25 for microfilm or $1.25 for 
photocopies and make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 
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TABLE 2 
Mean RANKS ON BEHAVIOR FOR HicH AND Low Groups or PARI AUTHORITARIAN FACTOR 


Grace F. Bropy 


Total group Extreme groups 
Behavior “2 
Low group" | High groupè s ? Low group> | High group» U 2 
Attentive observation 22.8 28.2 1.33 09 10.7 14.4 49.0 >.05 
Directing 26.1 24.9 28 39 12.1 12.9 67.0 > 05 7 
Restriction 25.6 25.4 04 48 9.6 15.4 37.5 025 
aN = 25, 
bN =12, 


tion, although the high-authoritarian mothers 
showed evidence of more attentive observation 
than the low-authoritarian mothers (p = .09). 
Comparing the extreme groups, the extremely 
high-authoritarian mothers exhibited sig- 
nificantly more restriction than the extremely 
low-authoritarian mothers. These findings are 
presented in Table 2. 


PARI Hostility-Rejection Factor 


It was hypothesized that mothers who 
were high on the PARI hostility-rejection 
factor would exhibit more nonattention, 
forbidding, and criticism, and would exhibit 
less verbal interaction, responsiveness to 
questions, interactive play, helping, and 
praise-approval-affection than mothers who 
were low on the PARI hostility-rejection 
factor. Differences between the high and low 
groups in nonattention, forbidding, and crit- 
icism were not significant, although the 
means were in the hypothesized direction. 
The differences between the high and low 


TABLE 3 
MEAN RANKS ON BEHAVIOR ror Hics AND Low Groups or PARI HOSTILITY-REJECTION FACTOR 


groups in verbal interaction, responsiveness to 
questions, interactive play, helping, an 
praise-approval-affection were also not sign 
cant, although again in the hypothes d 
direction. However, trends were apparent in | 
verbal interaction (p = .09) and in prais 
approval-affection (p = .06). The differenct 
between the extreme groups were all in th 
hypothesized direction, but were significan 
only for nonattention (p = .05), verbal in 
teraction (.01 > p> .001), and interactive 
play (.05 > p> .025). These findings alt 
presented in Table 3. 


PARI Democratic Factor 


It was hypothesized that high-democratg 
mothers would rank higher on teaches-exi 
plains-demonstrates, questions, structurizin 
and rate of compliance than low-democra 
mothers. The differences between the high 


strates, questions, and structurizing were ii 
the hypothesized direction but were 


Total group Extreme groups 
Behavior 
Low group® | High groups s ? Low group> | High groupe U t 
Nonattention 23.0 28.0 1.23 | .11 11.3 16.5 56.0 05 
Forbidding 23.4 27.6 119 | 12 11.5 16.3 59.0 | >.05 
Criticism 23.3 27.7 125 Ad 13.4 14.3 83.5 | >.05 
Verbal interaction 28.3 22.7 1.37 | .09 18.2 10.0 36.0 01> p> 00 
Responsiveness to 
questions (responds + 
— no response) 25.6 25.4 03 | 49 15.6 12.5 70.5 | >.05 2 
Interactive play 28.0 23.0 1.26 | 10 16.9 11.2 525 05> p>0 A 
Helping 27.0 24.0 -13 | .23 14.8 13.2 80.0 | >.05 
Praise-approval- 
affection ` 28.6 22.4 1.52 | .06 15.2 12.9 75.0 | >.05 
aN = 25. 
bN = 13, 


oN =14, 
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TABLE 4 
Mean RANKS ON BEHAVIOR FOR HIGH AND Low GROUPS oF PARI DEMOCRATIC FACTOR 


Total group Extreme groups 
Behavior 
S Low group* | High groupè s b Low group? | High groupå U p? 
Teaches-explains- 
demonstrates 23.7 27.0 81 | .21 12.0 12.0 66 05 
\ Questions 22.9 27.7 1.16 | .12 10.8 13.4 51 | >.05 
Structurizing 24.3 26.5 52 | .30 11.3 12.7 58 | >.05 
Rate of compliance 21.0 29.4 2.03 | .02 10.2 14.0 44 | >.05 


7 


significant. However, high-democratic mothers 
were significantly higher than low-democratic 
mothers in rate of compliance (compliance 


**divided by compliance plus noncompliance) . 


The differences between the extereme groups 
were not significant, but were in the hy- 


` pothesized direction (except for teaches- 


explains-demonstrates). These findings are 
presented in Table 4. 


MPAS Disciplinarian Scale 


As hypothesized, high-disciplinarian moth- 
ers showed more directing and restriction than 
low-disciplinarian mothers. The differences 
were not significant for attentive observation, 
although high-disciplinarian mothers showed 
more attentive observation than low-discipli- 
narian mothers (p= 10). The differences 
between the extreme groups were all in the 
hypothesized direction, but none reached sta- 
tistical significance. These findings are pre- 


» sented in Table 5. 


MPAS Rejecting Scale 


It was hypothesized that the high-rejecting 
mothers would exhibit more nonattention, 
forbidding, and criticism, and would exhibit 


less verbal interaction, responsiveness to 
questions, interactive play, helping, and 
praise-approval-affection than the low-reject- 
ing mothers. 

None of the differences between the high- 
and low-rejecting mothers were significant, 
except in the case of forbidding, where the 
differences were significant at well beyond 
the .001 level. However, all differences, €x- 
cept for praise-approval-affection, were in the 
hypothesized direction, for example, the high- 
rejecting mothers showed more nonattention 
(p = .09) and less interactive play (p = .06). 

A trend counter to the hypothesis was 
evident for praise-approval-affection (p=.08), 
with the high-rejecting mothers tending to 
exhibit more of this behavior than the low- 
rejecting mothers. All of the differences be- 
tween the extreme groups were in the hy- 
pothesized direction (including praise-ap- 
proval-affection) , although the only reliable 
difference occurred in forbidding behavior. 
These findings are presented in Table 6. 


Comparison of PARI and MPAS 


Comparing the two attitude instruments 
used, there appears to be no clear-cut differ- 


TABLE 5 
Mean Ranks ON BEHAVIOR FOR HIGH AND Low Groups or MPAS DISCIPLINARIAN SCALE 
Total group Extreme groups 
Behavior ree 
mite Low group" | High groupè z t Low group? | High group°® U 3 
Attentive observati Wife | 28.1 1.29 10 10.4 14.6 46.5 >.05 
Directing Te tact A 29.3 190 | 03 10.2 as | 450 | $05 
Restriction 21.7 | 29,0 1.89 03 10.2 14.8 45.0 | >.05 
aN = 24, 
oN = 26, 


eN = 12, 


322 


Grace F. Bropy 


TABLE 6 
Mean Ranks ON BEHAVIOR FOR HicH AND Low Groups or MPAS REJECTING SCALE 


Total group Extreme groups 
Behavior 
Low group* | High groupè s b Low groupe | High group? U p i 
i 22.7 28.1 1.32 | .0900 10.6 13.2 51.0 05 
foie 19.6 30.9 2.74 | .0008 7.6 16,0 18.0 01> p>.001 
Criticism 25.2 25.8 1S | 4300 11.4 12.6 59.0 | >.05 
Verbal interaction 27.5 23.6 .95 | .1700 13.0 11.1 55.0 | >.05 
Responsiveness to 4 
ti espone fi 
Sis sony z 27.6 23.6 .98 | .1600 13.2 10.9 53.0 | >.05 
Interactive play 28.8 22.5 1.58 | .0600 12.9 11.2 56.5 | >.05 ` 
Helping 26.4 24.7 .42 | .3300 13.7 10.4 47.0 | >.05 
Praise- l- $ 
eae 22.5 28.2 1.32 | .0800 12.2 11.8 64.0 | >.05 
aN = 24, 
oN = 26, 
oN = 11, 
aN = 12. 


ence in predictability to behavior between 
the two. Although it would seem that the 
authoritarian factor of the PARI and the 
Disciplinarian scale of the MPAS are measur- 
ing the same attitude variable, they show only 
moderate correlations with one another. In 
testing their relationship to behavior, certain 
differences between the two instruments be- 
came apparent. Out of the five statistically 
significant findings obtained with the PARI, 
four applied to the extreme groups only, 
suggesting that the PARI discriminates less 
sharply in the middle range of scores. This 
does not seem to be the case with the MPAS, 
which yielded most differences between the 
total high and low groups and did not ap- 
pear to increase in sensitivity at the extremes 
of the distribution. 

Another fact to be noted is the small 
amount of overlap with respect to relation- 
ship to behavior shown by the PARI au- 
thoritarian factor and the MPAS Discipli- 
narian scale, and by the PART hostility- 
rejection factor and the MPAS Rejecting 
scale. In the case of only one behavior cate- 
gory—restriction—did both show a signifi- 
cant relationship to behavior, the PART au- 
thoritarian factor for the extreme groups only 
and the MPAS Disciplinarian scale for the 
total high and low groups. 


DISCUSSION 


This study sought to determine whether 
several maternal attitude factors could be 


related to a mother’s behavior with her child, 
when observed in a standardized play situa-, 
tion with systematic recording procedures. 
Of the 15 behavioral indices, 7 showed 4 
significant relationship to maternal attitude. 
Although the total pattern of results was 
in the predicted direction, each specific sub- 
hypothesis was only partly confirmed. It was 
possible to predict some aspects of the 
mothers’ behavior on the basis of homogeneity 
of attitude, but the findings do not support & 
strong relationship between expressed ma- 
ternal attitudes toward child rearing and 
family life and maternal behavior as 0b- 
served in an experimental situation. 

Two characteristics of the sample prob- 
ably operated to depress the incidence 0 
attitude-behavior relationships, These were. 
the homogeneity of the sample and the fact 
that this was a “normal” group of preschool 
children and their mothers rather than 2 
clinical group. Homogeneity tends to com 
strict the range of both attitude and behavior, 
making more difficult the emergence of sig- 
nificant differences, Another factor which may 
have reduced variability of behavior in the 
present study was the limitation imposed by, 
an observation session of only 30 minutes 1” 
length. 

Intrinsic characteristics of the attitude 
instruments themselves undoubtedly affected 
the results. Among such characteristics coul 
be mentioned the restricted range and vair 
ability of the PART democratic factor (com 
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posed of socially approved statements on child 
rearing with which most mothers agree); the 
factor of response set noted by Zuckerman 
and Norton (1958) in the PARI authori- 
tarian factor and its correlation with educa- 
tion, noted by Zuckerman et al. (1958; 
Zuckerman, Barrett, & Bragiel, 1960); the 
tendency of the PARI factors in the present 
study to show relatively little discrimination 
in the middle ranges and the MPAS scales to 


‘show little discrimination at the extremes. To 


the extent that factors other than content 
influence the attitude scores, the relationship 
to behavior will be more complex and less 
predictable. 

In the present study, factors other than 


„attitude were undoubtedly operating in the 


behavior situation, Organismic differences in 
the child which cannot be accounted for by 


_ maternal influences, for example, intelligence, 


+ 


physique, temperament, were a source of 
variability. Personality differences in the 
mothers were not controlled. It is possible 
that mothers reacted with varying degrees of 
“defensiveness” to being observed. For ex- 
ample, mothers with attitudes of high au- 
thoritarianism and _hostility-rejection may 
inhibit behavioral expression of these at- 
titudes to a greater degree in an observation 
Situation than in an unobserved home situa- 
tion, whereas mothers who are low in au- 
thoritarianism and hostility-rejection may 
feel no such need to disguise their behavior. 
Schalock (1956), in his study comparing 
mother-child interaction in the laboratory and 
in the home situation, found significant differ- 
ences in the direction of greater hostility, less 
interaction, less helping behavior, and more 
restrictive and forbidding behavior in the 
home ‘situation. 

The lack of reported research on the rela- 
tionship of the PARI or the MPAS to ob- 
served behavior makes difficult the com- 


. Parison of the results of this study with the 


findings of other investigators. The overall 
Predictability in the direction of results to- 
gether with the few statistically significant 
relationships which did emerge suggests 
that attitude was one of the components 
determining maternal behavior in this situa- 
tion. The pattern of results also suggests that 
Some behaviors are more salient to the par- 
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ticular attitude and other behaviors more 
peripheral. It would seem desirable in further 
studies of maternal attitude-behavior relation- 
ships to take into account the operation of 
personality, situational, and other motiva- 
tional factors with attitude. 
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RELATIONSHIP BETWEEN MANIFEST ANXIETY 
AND TWO INDICES OF AUTONOMIC 
RESPONSE TO STRESS 


EDWARD S. KATKIN 2 
Duke University 


The effects of stress (threat of shock) on basal skin resistance and GSR non- 
specific responses were investigated for Ss who differed in anxiety levels on the 
Taylor Manifest Anxiety scale. 26 high-anxious (HA) and 26 low-anxious (LA) 
Ss were randomly assigned to stress and nonstress conditions, yielding 4 groups. 
First, all Ss rested; then both stress groups were threatened with electric shock, 
while both nonstress groups continued resting; finally, all Ss rested again. Dur- 
ing the 2nd period GSR nonspecifics increased and basal resistance decreased 
for the shock-threat groups only, as expected; however, during the final period 
(after a 15-min. interview) GSR nonspecifics decreased to the prethreat level 
for the shock-threat groups, but basal resistance continued to decrease for all 
Ss. Both GSR indices were unrelated to individual differences in anxiety level. 
It was concluded that the 2 GSR indices reflect different phenomena. 


When anxiety responses are experimentally 
induced by stress, the particular indices by 
which such responses are to be evaluated 
must be specified. GSR measures of 
autonomic nervous system (ANS) activity 
have been frequently employed for this pur- 
pose, since ANS arousal is generally con- 
sidered to be an important attribute of 
anxiety (Freud, 1936; Malmo, 1959; Martin, 
1961; Silverman, 1957; Silverman & Blitz, 
1956). Two GSR measures which have been 
frequently used to assess ANS arousal are 
the resistance of the skin to an externally 
applied current (basal resistance) and the 
amplitude of skin resistance changes to 
specific stimulation (GSR specifics) (Lacey, 
Bateman, & Van Lehn, 1953; Lazarus, Deese, 
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and assignment of subjects. 

? Now at the State University of New York at 
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& Osler, 1952; Mandler, Mandler, Kremen, 
& Sholiton, 1961). A third, recently de- 
veloped GSR index which has also been used 
as a measure of arousal is GSR nonspecific 
activity (Cohen, Silverman, & Burch, 1956; 
Greiner & Burch, 1955; Silverman, Cohen, 
& Shmavonian, 1959). This index is defined 
as the frequency of measurable fluctuations in 
skin resistance which occur in the absence 
of specific stimulation, Both basal resistance 
and GSR nonspecifics have been shown ‘to 
be monotonic functions of experimentally in- 
duced arousal, while amplitude of GSR 
specifics shows an initial increase with moder- 
ate arousal followed by a subsequent decrease 
as arousal increases to maximal levels 
(Greiner & Burch, 1955; Silverman et al, 
1959). 

It is generally accepted that most people 
react with anxiety responses to stressful 
situations, and that such responses are char- 
acterized by autonomic arousal (Martin, 
1961). It has been noted, however, that there 
are important individual differences in the 
extent to which people react with anxiety 
to stressful (noxious) situations (Spences 
1958). Of the anxiety indices which have 
been used to measure individual differences iM 
response to stress, the Taylor Manifest 
Anxiety (MA) scale (Taylor, 1953) has bee? 
most frequently employed (Child, 1954)- 
This scale has been shown to be related tO 
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clinical judgments of anxiety (Buss, 1955; 
Buss, Wiener, Durkee, & Baer, 1955; Eriksen 
& Davids, 1955; Hoyt & Magoon, 1954) as 
well as to other scalar indices of anxiety and 
general neuroticism (Bendig, 1957; Cattell & 
Scheier, 1958; Lauterbach, 1958). 

The general goals of the present study were 
to investigate the relationship between meas- 
ures of individual differences in anxiety and 
two GSR indices of ANS arousal during 


` stressful and nonstressful conditions. Under 


stressful conditions, both high-anxious (HA) 
and low-anxious (LA) subjects were expected 
to show greater autonomic arousal than com- 
parable subjects who were not stressed. It 
was also expected that the HA subjects in the 
stress condition would manifest greater ANS 


* arousal than LA subjects who were subjected 


to stress and that the extent of this arousal 
would be a positive function of the degree of 
apprehensiveness which the subject experi- 
enced in the experimental situation. On the as- 
sumption that MA scale scores reflect ‘“reac- 
tive” anxiety, that is, the tendency to react 
with anxiety to stressful situations, HA and LA 
subjects were not expected to differ in basal 
resistance and number of GSR nonspecifics 
in the nonstress condition, It has been 
pointed out, however, that MA scale scores 
may reflect “chronic” anxiety, that is, the 
tendency to react with anxiety to all situa- 
tions, whether objectively stressful or not 
(Farber & Spence, 1956; Spence, 1958). 
On the assumption of the “chronic” hy- 
pothesis, HA subjects bring a high level of 
anxiety to all situations. Hence, if the 
chronic” hypothesis holds, it would be ex- 
pected that HA and LA subjects would show 
different patterns of ANS arousal in both 
stressful and nonstressful situations. 


METHOD 
Subjects 


i A modified form of the MMPI, consisting of 398 
items, was administered to all male students enrolled 
in introductory psychology at Duke University. 
Items comprising the validity scales, L, F, and K, 
the nine original clinical scales, the MA scale, and 
Welsh’s (1956) A scale were included. The Mar- 
lowe-Crowne Social Desirability (M-C SD) scale 
(Crowne & Marlowe, 1960) was also administered 
as a check on social desirability response set. 

The HA and LA subjects were selected in the 
following manner: (a) Of the 206 students who took 
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the MMPI, those who scored in the upper 20% 
on the MA scale (raw score of 19 or higher) and 
who also scored in the upper 50% on the A scale 
(raw score of 11 or higher) were designated the 
HA group; those students who scored in the lower 
20% on the MA scale (raw score of 7 or lower) 
and who also scored in the lower 50% on the 
A scale (raw score of 9 or lower) were designated 
the LA group. (b) Subjects with raw scores of 7 
or higher on the MMPI L scale or who scored 1 
standard deviation above the mean of the M-C SD 
distribution for this sample (raw score of 20 or 
higher) were then eliminated on the assumption 
that their needs to present themselves in a socially 
desirable manner invalidated their anxiety scores. 
By this procedure, 32 HA and 30 LA subjects 
were identified. In the course of running the 
experiment, 6 HA and 4 LA subjects were eliminated. 
Of these, 8 were eliminated because of apparatus 
failure, and 2 because it was discovered that they 
had received considerable information about the 
experiment prior to participating in it. 


Apparatus 


The experimental chamber was a well lighted 
room with temperature held constant between 73 
and 76 degrees Fahrenheit. The chamber was sound 
resistant. It contained a comfortable reclining chair 
for subjects, a chair for the experimenter, a tape 
recorder (used for interviewing), and various 
equipment necessary for the attachment of the 
GSR electrodes. The apparatus for amplifying and 
recording GSR responses was located outside the 
chamber. An intercom system allowed the experi- 
menter to hear any unusual movement or activity 
which might have created artifactual skin resistance 


changes. 
Two lead GSR electrodes which conformed to 
specifications suggested by Edelberg, Lhamon, 


Burch, and Childers è were used. The active electrode 
was square in shape, and curved to fit the finger. 
It was placed on the palmar surface of the middle 
segment of the middle finger of the left hand 
over a Dr. Scholl’s corn pad, Number 327, which 
provided a constant skin surface area of .32 square 
centimeter. The electrode was held in place by 
adhesive tape wrapped around the entire prepara- 
tion. The ground electrode was rectangular in shape 
(3 X 4 inches). It was placed on the lateral surface 
of the left arm and secured by an elastic bandage. 
‘An electrode paste of .05 molar solution of NaCl in 
water, with 6 grams of corn starch per 100 cubic 
centimeters added to give the solution a gela- 
tinous consistency, was used to insure uniform 
contact between the electrode and the skin. Basal 
resistance and GSR nonspecifics were simultane- 
ously detected from the single pair of electrodes 
and amplified by a multichannel GSR amplifier, 
oe alsa es 


8 “Biophysics of the Effector in the Galvanic 
Skin Reflex,” progress report to the National In- 
stitutes of Health, United States Public Health 
Service, Research Grant M-1904, September 1, 
1957 to September 1, 1960. 
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Model 201, manufactured by Bio-physical Re- 
search Instruments Company, Incorporated, The 
amplifier generated a 20 microamperes direct cur- 
rent for exosomatic recording. Its output led into 
two channels of a four-channel Grass Instrument 
Company polygraph, Model 5D. In order to check 
on sighing or irregular breathing which might pro- 
duce artifacts in the GSR measures, subjects’ 
breathing was monitored by a respirometer and 
recorded on a third channel of the same polygraph 
which recorded the GSR measures. 


Procedure 


The HA and LA subjects were assigned to stress 
and nonstress conditions by an assistant. The experi- 
menter, who had no knowledge of a subject’s scores 
on the individual difference measures of anxiety, 
was informed only of the experimental condition to 
which each subject had been assigned. At the 
beginning of the experiment, subjects were assigned 
to stress or nonstress conditions randomly, but 
toward the end of the experiment subjects were 
assigned to these conditions so that there would be 
an equal number of HA and LA subjects in each 
condition. Both the stress and nonstress conditions 
consisted of three 10-minute time periods. The initial 
and final periods were identical for all subjects, and 
were structured as rest periods. In the middle period, 
subjects in the stress condition were threatened with 
electric shock, while subjects in the nonstress con- 
dition continued to rest. Between the middle and 
final periods, all subjects were interviewed. Each 
of the 10-minute periods was further subdivided into 
four 2.5-minute time blocks to enable the analysis of 
changes in the dependent measures within each of 
the three experimental periods. 

First experimental period, At the beginning of the 
experiment all subjects were given the following 
instructions: 


The object of this study is to examine certain 
physiological reactions while you relax. In order 
to do that I am going to ask you to just sit back 
and relax for a while, as I measure some of your 
physiological reactions, The measures I am spe- 
cifically interested in are your breathing, and 
your skin resistance. This tube measures your 
breathing [at this point the subject was shown 
the respirometer tube, and it was placed around 
his chest]. These measure the resistance of your 
skin [at this point the subject was shown the 
GSR electrodes, and they were attached to his 
finger and arm]. Do you have any questions? 
[The experimenter paused to answer any questions 
which may have arisen.] All right, then, just 
relax, [At this point, the experimenter tilted the 
subject’s chair into its reclining position.] I will 
come back in about 10 minutes to give you 
further instructions and see if you are comfortable. 


After 10 minutes had elapsed, the experimenter 
returned to the chamber. 

Second experimental period. The instructions given 
the subjects in the shock-threat group at the be- 
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ginning of this period were designed to convince 
them that they were to receive electric shocks, and 
that they would have no control over the delivery 
of the shocks, These instructions were: 


I would like to ask you some questions about your 
health, Have you ever had rheumatic heart 
disease? Buerger’s disease? Raynaud’s disease? 
[The experimenter paused to record subjects’ 
answers to these questions which for all subjects 
were negative.] Good. The following procedure 
is safe if you are in good health. I would like to 
make some comparisons of the physiological re- 
actions I have just recorded with your reactions 
to the stress of an electric shock. Do I have per- 
mission to administer—during the next 10 min- 
utes—some electric shocks to the ankle? [All sub- 
jects gave permission.] These electrodes will carry 
the shock, [At this point the subject was shown 
the “dummy” electrodes, and they were attached 
to his right ankle.] The shocks will each last 
about 1 second, and they will be distributed dur- 
ing the next 10 minutes. I will come back then to 
take off the electrodes. 


No shocks were actually delivered at any time to 
any of the subjects who participated in the experi- 
ment. 

The instructions given subjects in the nonshock- 
threat condition at the beginning of the second 
period were designed to convince them that, the 
experimenter was interested only in physiological 
responses during rest. The instructions were as fol- 
lows: 


How are you? Is everything comfortable? [The 
experimenter paused for a response.] Good. 
would like you to continue resting just as you 
were for 10 more minutes. But first let me check 
out the respirometer and electrodes to see that they 
are secure. [The experimenter then slowly exam- 
ined the GSR electrode connections and the 
respirometer tube.] I will be back in about 10 
minutes. 


The purpose of “checking” the apparatus was to 
equate the amount of time spent between periods 
and the amount of physical activity of subjects mM 
the shock-threat and nonshock-threat conditions 1” 
order to control for changes in basal resistance OY 
GSR nonspecifics resulting from physical activity. 

Interview, After conclusion of the second 10-min- 
ute period all subjects were interviewed. The ex- 
perimenter reentered the chamber, removed the 
electrodes from the subject’s ankles, adjusted the sub- 
ject’s chair to its upright position, and proceeded 
to interview the subjects. The interview focused 0” 
the subject’s reactions during the first two experi- 
mental periods. Its purpose was to evaluate the 
amount of apprehensiveness subjects experienced 
with respect to the experimental situation so that 
the extent to which the experimental situations Were 
successful in arousing feelings of unpleasurable 
quality could be evaluated. At the conclusion of 
the interview all subjects were told: 
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That’s all for now. Thank you. Now I would like 
you to rest for 10 minutes more. 


Third experimental period. At the beginning of 
“this period the subjects in the shock-threat condi- 
tion were told: 


This period will be just like the initial period. 


.The subjects in the nonshock-threat condition were 
told: 


A This period will be another 10-minute period of 
. relaxation. 


. The chair was then tilted back to its reclining posi- 
tion for all subjects. At the conclusion of this final 
«rest period the subjects were thanked for their 
cooperation and, in a conversational manner, in- 
formed that there were actually three experimental 
groups: a group which received electric shock, a 
group which was threatened but did not receive 
electric shock, and a group which was not threat- 

p. ened and did not receive shock. This information was 
given in order to minimize the possibility that 
future subjects would be informed that there was 
no danger of being shocked. Each subject was asked 
to maintain secrecy about the purpose and design of 
the experiment so that future subjects would not 
develop preconceptions about the nature of the ex- 
periment, 


RESULTS 


Effects of Anxiety and the Threat of Shock 
on Basal Resistance 


The index of basal skin resistance was the 
deviation ratio (DR) suggested by Edelberg.* 
The DR measures the proportional change in 
resistance level from one experimental condi- 
tion or time block to another, and thus re- 
flects the extent to which each subject’s re- 
sistance level is affected by the conditions of 
a experiment. The DR was computed as fol- 
lows: 

1. The mean basal resistance for the final 
2.5-minute time block of the first rest period 
(for most subjects, the point of highest basal 
resistance) was determined by averaging the 
three resistance measures taken at 50-second 
intervals during that time block; the score 
for this time block was designated Rr. 

2. Mean resistance measures were similarly 
determined for each of the three remaining 
` 2.5-minute time blocks in the first experi- 
mental period, and for the four time blocks 
in the second and third experimental periods; 
the scores for these time blocks were desig- 
nated Rj. 

tA personal communication with B. M. Shma- 
vonian, December 14, 1962. 
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TABLE 1 


Means AnD SDs or BAsAL RESISTANCE DEVIATION 
Ratios (DRs) FOR THE FOUR EXPERIMENTAL 
Groups For EacH 10-MNuUTE PERIOD 


Rest Stress Rest 

first second third 

Group period period period 
M |SD|M|SD| M SD 

HA 

Shock-threat 38 | .24 | .75 | 44 | 1.05 «17 
E Reo eae “41 | .25 | .10 | .20 | 1.05 | 1.32 
Shock-threat 34 | 34 | .77 | 58 | 1.38 1.21 
Nonshock-threat | .44 | .52 | .25 .67 | 1.06} .85 


3. The DRs for each subject were then com- 
puted for each 2,5-minute time block by the 
formula, DR = 100(Rx — Rj/RjRx). It may 
be noted that as a subject’s basal resistance 
decreases, his DR increases. Subjects’ mean 
DRs for each 10-minute period were then 
computed. For the first 10-minute period 
there were only three scorable 2.5-minute 
time blocks, since the fourth time block de- 
fined Ry; in the second and third 10-minute 
periods the four scorable time blocks were 
averaged. 

The means and SDs of the DRs for each 
experimental period for the HA shock-threat, 
LA shock-threat, HA nonshock-threat, and 
LA nonshock-threat groups are presented in 
Table 1, in which it may be noted that the 
mean DRs for the four groups during the 
initial rest period were comparable. An analy- 
sis of variance® (Type II design; Lind- 
quist, 1953) of the data for the initial rest 
period confirmed the observation that there 
were no differences among groups (F = .03, 
df = 1/48). During the second experimental 
period, as may also be noted in Table 1, the 
mean basal resistance of the HA and LA 
shock-threat groups decreased as reflected in 
higher mean DRs, while mean basal resistance 
of the HA and LA nonshock-threat groups 
increased as reflected in lower mean DRs. In 
the final rest period (after the interview) the 
mean basal resistance of all four groups 


5A Type III analysis of variance was carried out 
for each of the experimental periods (Anxiety 
Levels X Conditions X Time Blocks). The analysis 
referred to in the text is the Anxiety X Conditions 
jnteraction, taken across time blocks, and is equiva- 
lent to a simple randomized design. 
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Fic. 1. Mean basal resistance deviation ratios 
X 100 (DRs) for the combined threat and. non- 
threat groups for the three experimental periods. 


markedly decreased, as reflected in higher 
mean deviation ratios. 

An analysis of variance (Type III; Lind- 
quist, 1953) of the effects of the threat of 
shock and individual differences in anxiety 
upon DRs indicated no significant main ef- 
fect of anxiety nor any interactions of anxi- 
ety with other variables. Therefore, the data 
for the HA and LA subjects in each experi- 
mental condition were combined. For the 
combined data, the performance of subjects 
in the shock-threat and nonshock-threat con- 
ditions in each of the three experimental 
periods is depicted in Figure 1. The DRs for 
subjects in the shock-threat and nonshock- 
threat conditions were comparable during the 


Epwarp S. KATKIN 


initial rest period; during the second ex- 
perimental period, the DRs of subjects in the 
shock-threat condition increased, while DRs 
for subjects in thé nonshock-threat condition 
decreased; during the final rest period, DRs 


for subjects in both conditions increased. An — 


analysis of variance of these data (Type III; 
Lindquist, 1953) indicated that the observed 
Periods X Conditions interaction was signifi- 
cant (F = 4.14, df = 2/96, p < .03). 


Effects of Anxiety and Stress on GSR 
Nonspecifics 


A GSR nonspecific was defined as a de- 
crease in skin resistance of at least 100 ohms, 
followed by a gradual return to predeflection 
level. A continuous record of GSR nonspe- 
cifics was obtained for each subject, and sub- 
jects’ scores on this measure were determined 


by counting the number of nonspecifics in _ 


each 10-minute time period. In this manner, 
three GSR nonspecifics scores were obtained 
for each subject. 

The means and SDs of the GSR nonspe- 
cifics for the HA shock-threat, LA shock- 
threat, HA nonshock-threat, and LA non- 
shock-threat groups for each 10-minute 
period are presented in Table 2. During the 
initial rest period the LA shock-threat and 
HA nonshock-threat groups gave a larger 
number of nonspecifics than the LA nonshock- 
threat and HA shock-threat groups. An 
analysis of variance of the data for the initial 
rest period (Type III; Lindquist, 1953) indi- 
cated that this interaction was significant 


(F = 6.49, df=1/48, p< .05). Since the , 


experimenter was unaware of the subjects’ 
MA scale scores and since differentially 
threatening instructions had not been intro- 


TABLE 2 


Means AND SDs or GSR NONSPECIFICS FOR THE Four EXPERIMENTAL GROUPS, 
FOR EAcH 10-MinuTeE PERIOD 


Rest first period Stress second period Rest third period 
Group H 
M SD M SD M SD 
HA ‘ 
Shock-threat 20.23 20.47 56.54 27.74 27.38 20.85 
n Nonshock-threat 38.23 26.94 44.00 32.13 41.92 27.68 
A 
Shock-threat 37.00 24.43 74.23 31.93 41.54 24.21 
Nonshock-threat 22.69 19.76 26.92 25.93 27.69 24.79 


duced at this point in the procedure, this 
finding is difficult to interpret. Careful re- 
examination of the experimental procedures 
ailed to reveal any systematic variables 
which might have determined this interaction. 
_A possible source of the obtained Anxiety x 
nditions interaction was a systematic 
nge in the GSR apparatus over time. If, 
o for instance, the apparatus was less sensitive 
to GSR changes toward the end of the experi- 
ment, and if the LA nonshock-threat and HA 
shock-threat groups were predominantly rin 
toward the end of the experiment, then it 
ould be expected that those two groups 
would show fewer GSR nonspecifics. Each 
‘subject’s record was examined, and it was 
found that there were no systematic relation- 


riod, as may be noted in Table 2, the mean 
SR nonspecifics for subjects in the HA 
ock-threat and LA shock-threat groups in- 
‘creased markedly, while those of the subjects 
in the HA nonshock-threat and LA nonshock- 
threat groups increased only slightly. During 
the final rest period mean GSR nonspecifics 
‘of the subjects in both shock-threat groups 
decreased markedly, while those of the sub- 
jects in both nonshock-threat groups remained 
‘at essentially the same level. 
‘An analysis of variance (Type HI; Lind- 
quist, 1953) of the effects of the threat of 
‘shock and individual differences in anxiety 
upon GSR nonspecifics indicated that no sig- 
ificant main effects of anxiety were obtained.® 
L Therefore, the data for the HA and LA sub- 
jects in each experimental condition were 
combined. For the combined data, the per- 
formance of subjects in the shock-threat and 
‘onshock-threat conditions in each of the 
three experimental periods is depicted in Fig- 
te 2, The nonspecifics for subjects in the 
®Th order to rule out the possibility that anxiety 
fects might have been obscured by the unexpected 
Anxiety X Threat-Shock interaction obtained in the 
ial rest period, one further analysis was per- 
formed. For each subject, GSR nonspecific dif- 
` | ference scores were obtained by subtracting the 
g number of nonspecifics given in the initial rest 
_ Period from those given in the second and third 
Periods. An analysis of variance of these difference 
Cores indicated that no significant main effect of 
anxiety nor any interaction of anxiety with other 
Variables was obtained. f 
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shock-threat and nonshock-threat conditions 
were comparable during the initial rest pe- 
riod; during the second experimental period 
the nonspecifics of subjects in the shock- 
threat condition increased markedly, while 
nonspecifics for subjects in the nonshock- 
threat condition remained essentially the 
same; during the final rest period, GSR non- 
specifics for subjects in the shock-threat 
condition decreased to approximately the pre- 
threat level, while nonspecifics for the sub- 
jects in the nonshock-threat condition re- 
mained at essentially the same level. An 
analysis of variance of these data (Type 
III; Lindquist, 1953) indicated that the ob- 
served Periods X Conditions interaction was 
significant (F = 22.90, df = 2/96, p < .001). 


The 2.5-minute Time Blocks within Periods 


A detailed description of the changes in 
GSR nonspecifics and DRs within periods has 
been previously reported (Katkin, 1964). Of 
present interest was the finding of a signifi- 
cant Time Blocks X Conditions interaction 
(F = 35.26, df = 3/144, P< 001) for GSR 
nonspecifics during the second experimental 
period. This interaction reflects the fact that 
marked differences were obtained between the 
shock-threat and nonshock-threat groups in 
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mean GSR nonspecifics at the beginning of 
this period, and that these differences dimin- 
ished over time. The main effect of experi- 
mental conditions (F = 12.65, df= 1/48, 
p < 001), which was also obtained in the 
analysis of the second experimental period, 
resulted largely from the difference in mean 
GSR nonspecifics between the two experi- 
mental conditions during the first 2.5-minute 
time block, 


Relationship between DRs and GSR Nonspe- 
cifics 


In order to evaluate the relationship be- 
tween the two GSR indices, Pearson product- 
moment correlations were computed between 
the DRs and GSR nonspecifics for the initial 
rest period, for the final rest period, and for 
the first time block of the second experi- 
mental period, The latter correlation provided 
an indication of the relationship between the 
GSR indices immediately following the intro- 
duction of differentially threatening instruc- 
tions. None of the correlations between the 
GSR indices separately computed for the 
shock-threat and nonshock-threat groups was 
significant. For the combined groups, a sig- 
nificant correlation (r= .45, N=52, p< 
.01) was obtained between the GSR indices 
for the first time block of the second experi- 
mental period. The correlations between 
GSR indices for the combined groups for the 


initial and final rest periods were not signifi- 
cant, 


Relationships among Apprehensiveness and 
the GSR Indices as a Function of Experi- 
mental Conditions 


The interview which followed the second 
experimental period was used to evaluate the 
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experienced in the experimental situation, 
Each subject was asked to report what he 
was thinking about during the preceding 10- « 
minute periods, and was further asked to de- — 
scribe his reactions and thoughts during each | 
“quarter” of each period. Threat-shock sub- 
jects were asked to describe the shock; they - 
were not told until after the interview that 
there was actually no shock. Subjects E. 
cally reported that the shock must have been 
very mild, because they had not felt it; how- | 
ever, a few subjects reported feeling a very 
mild shock sensation. Apprehensiveness was 
defined as verbalized concern or fear about 
the experimental chamber, the procedures, or — 
any other aspect of the experimental situa- ` 
tion. i 
Tape recordings of the subjects’ verbaliza- 
tions were rated for apprehensiveness on & 
6-point scale”: “no, little, mild, some, much,’ 
extreme.” Each subject’s verbal reports were 
rated for the initial rest period, and for the 
first time block of the second experimental 
period, since it was expected (and the non- | 
specifics confirmed) that this time block | 
would show the greatest differential effect of 
threatening instructions. On the basis of the 
interview data for the initial rest period, 29 
of the 52 subjects were rated in the “no” 
apprehensiveness category, 14 subjects were 
rated “little? 2 subjects were rated “mild,” 
6 subjects were rated “some,” and 1 subject 
was rated in the “extreme” category. The 
subjects who were rated in the no category 
were designated “‘nonapprehensive”; the sub- | 


amount of apprehensiveness which subjects | 


7I am indebted to L. D. DeNike, who served 38 
an independent rater of a random sample (N = 14) 
of the interviews. Percentage of agreement om 
dichotomous apprehensive/nonapprehensive cate- 
gory assignments was 89%. 


TABLE 3 


FREQUENCY OF APPREHENSIVENESS RATINGS FOR COMBINED SUBJECTS IN THE SHOCK-THREAT AND NoNsHOCK , 
THREAT GROUPS FOR THE INITIAL REST PERIOD AND THE FIRST TIME BLOCK OF THE 
SECOND EXPERIMENTAL PERIOD 


Initial rest period 


First time block, second period 


Group 
Nonapprehensive | Apprehensive Nonapprehensive Apprehensive 
Shock-threat 13 | 13 2 24 
Nonshock-threat 16 10 17 9 
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i 
f jects who were rated as having little, mild, 
| some, and extreme apprehensiveness were 
-combined into a single group which was des- 
. ignated “apprehensive.” 
‘Analysis of the interview data indicated 
_ that the obtained individual differences in 
yerbalized apprehension were independent of 
anxiety level during both the initial rest 
period and the first time block of the second 
, period, Therefore, the data for HA and LA 
subjects were combined, and apprehensive- 
ness was evaluated as a function of the threat- 
tionthreat variable. The number of appre- 
Tensive/nonapprehensive subjects in the 
‘shock-threat and nonshock-threat groups is 
indicated in Table 3. During the initial rest 
„ Period, about half of the subjects who were 
_ "subsequently to be threatened with shock 
were rated apprehensive; slightly fewer than 
half of those subjects who were subsequently 
* to continue resting were rated apprehensive. 
The difference between the frequencies with 
which subjects in the shock-threat and non- 
shock-threat groups were rated apprehensive 
in the initial rest period was not significant 
(¢=.31, df =1). This finding indicated 
that prior to the introduction of differentially 
threatening instructions the subjects in the 
threatened and nonthreatened groups did not 
differ in apprehensiveness. During the first 
time block of the second experimental period, 
as may also be noted in Table 3, 24 of the 26 
,, Subjects who were threatened with shock were 
= tated apprehensive, and 9 of the 26 subjects 
who were not threatened were rated appre- 
hensive, 
$ In order to evaluate the relationship be- 
tween rated apprehensiveness and the two 
GSR indices, Pearson product-moment corre- 
lations were computed for the threat and non- 
threat groups separately, and for the com- 


A 
j. 


ai 


bined groups. The only significant correla- 
tions obtained were between the DRs and 
, apprehensiveness for the combined groups 
during the first time block of the second 
_ experimental period (r= .37, N = 52, P< 
» .05), and between GSR nonspecifics and 
« 4pprehensiveness for the combined groups 
during the first time block of the second ex- 
oo period (r= 42, N=52, 2 < 
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Discussion 


Effects of Experimental Conditions upon the 
GSR Indices 


Contrary to expectation, the present study 
failed to find differences between HA and LA 
subjects on either of the GSR indices em- 
ployed. This finding, however, was consistent 
with the results of Johnson (1963) and Wil- 
son and Dykman (1960). For the combined 
data of the HA and LA subjects, it was found 
that the mean DRs of the threat and non- 
threat groups were of the same order of mag- 
nitude in the initial rest period, and that DRs 
increased for the shock-threat group and de- 
creased for the nonshock-threat group during 
the second experimental period. From the 
second experimental period to the final rest 
period, DRs increased markedly for subjects 
in both the shock-threat and nonshock-threat 
conditions. For the combined data, the mean 
numbers of GSR nonspecifics for subjects in 
the shock-threat and nonshock-threat condi- 
tions were approximately the same during 
both the initial and final rest periods; dur- 
ing the second experimental period, GSR 
nonspecifics increased for the shock-threat 
group but remained at essentially the same 
level for the nonshock-threat group. Thus, 
the two GSR indices showed similar patterns 
of change from the initial rest period to the 
second experimental period, but showed mark- 
edly different patterns of change after the 
15-minute interview. 

The finding that the two GSR indices 
diverged after the interposed interview 
strongly suggests that basal resistance and 
GSR nonspecifics reflect different underlying 
phenomena. The interview was designed to 
reduce the threat of shock by removing the 
electrodes and reassuring the subjects, and to 
get the subjects to think about their experi- 
ences in the experimental situation. For the 
threat-shock group, it may be inferred from 
the increase in GSR nonspecifics in the 
threatening period followed by the decrease 
in this measure in the final rest period, that 
GSR nonspecifics reflected the emotional as- 
pects of these subjects’ responses to the ex- 
perimental conditions. The marked increase 
in DRs for both the shock-threat and non- 
shock-threat groups immediately after the 
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interview suggested that the DRs may reflect 
nonemotional (cognitive) aspects of the sub- 
jects’ responses to stimulation provided by 
the interview itself. Fox® has reported that 
in a recently completed study, subjects’ skin 
resistance levels increased in a rest period 
following a threat-of-shock stress period 
when there was no interview interposed be- 
tween the stress period and the final rest 
period. 

Although the attribution of stress-reducing, 
thought-inducing, properties to the interview 
in the present experiment is admittedly post 
hoc, the present findings were significant and 
generally consistent with recent physiological 
data reported by Edelberg et al. (see Foot- 
note 3). These investigations reported evi- 
dence that GSR responses were mediated by 
two separate effector organs: the epidermis 
and the sweat glands. Furthermore, it has 
been suggested by Edelberg (see Footnote 4) 
that the sweat glands were primarily respon- 
sible for the production of GSR nonspecifics, 
while the epidermis was primarily associated 
with skin conductance changes which are as- 
sociated with the orienting reflex, and par- 
tially mediated by the central nervous system 
(Wang, 1957, 1958). In the present study, 
changes in basal resistance may reflect epi- 
dermal activity associated with cognitive re- 
sponses to the interview situation, while GSR 
nonspecifics reflect sweat gland activity as- 
sociated with emotional responses to the 
threatening instructions. 


Relationship between Individual Differences 
in Anxiety Level, Apprehensiveness, and the 
GSR Indices 


In the present study, no differences were 
found between HA and LA subjects on either 
of the GSR indices during the three experi- 
mental periods, nor were any differences 
found between HA and LA subjects on rat- 
ings of apprehensiveness, Since there were 
no differences between HA and LA subjects 
in the initial rest period, in which there was 
no stressor, these findings may be viewed as 
consistent with the idea that MA scale scores 
reflect “reactive” anxiety, since the “chronic” 
anxiety hypothesis would predict greater ap- 


8R. Fox, personal communication, January 24, 
1964, 
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prehensiveness and more active GSR responses 
for HA subjects during the initial rest period. 
However, differential physiological responses 
in the threatening period would be expected 
for HA and LA subjects on the basis of the 
reactive hypothesis, but none were obtained; 
the LA and HA subjects responded to the 
shock threat with an equally Aigh level of 
apprehensiveness. In order to account for 
these findings, it must be assumed that MA 
scale scores reflect differential tendencies to 
give anxiety responses to moderately stress- 
ful stimuli. If this were the case, no differ- 
ences in anxiety responses would be expected 
for HA and LA subjects either in nonthreat- 
ening situations which elicit no apprehensive- 
ness or in highly threatening situations which 
induce a high degree of apprehensiveness in 
LA and HA subjects alike. 

The findings of the present study point up 
the importance of the specific stressor situa- 
tion used to elicit anxiety responses in investi- 
gations in which physiological response to €x- 
ternal stressors are evaluated. In order to 
elicit differential physiological response, from 
subjects who are presumed to differ in “re 
active anxiety” as measured by the MA scale, 
it would appear necessary to elicit differential 
levels of apprehensiveness from HA and LA 
subjects (Endler, Hunt, & Rosenstein, 1962). 
Examples of stressors which appear to have 
been used effectively for this purpose are 
“personally involving stress” (Mandler et al., 
1961) and “failure-stress” (Lazarus et al, 
1952). 


Relationship between GSR Indices 


The present study failed to demonstrate 
any significant correlations between the two 
GSR indices for the shock-threat group and 
the nonshock-threat group separately. Al- 


though this finding lends support to the at- 


gument of Lacey et al. (1953) that separate 
autonomic indices are frequently uncorre- 
lated, it is contrary to the findings of Silver- 


man et al. (1959) who report that “there 5 
a very good correlation between number 0 


non-specifics and level of basal resistance [P: 
69].” The finding that relationships betwee? 


2a 


two GSR indices of ‘arousal vary from One 


experimental condition to another underscores 
the importance of further exploration of the 
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‘characteristics of situations which elicit auto- 
nomic responses, and more specifically, the 
role of external stimulation in the elicitation 
of anxiety responses. 
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THE SIMILARITY, COMPATIBILITY, OR INCOMPATIBILITY 
OF NEEDS IN INTERPERSONAL SELECTION 


JOSEPH F. RYCHLAK 


Saint Louis University 


This study examines the influence of need similarity (e.g., need Dominance and 
need Dominance), need compatibility (e.g., need Nurturance and need Suc- 
corance), and need incompatibility (e.g., need Order and need Change), on 
interpersonal selection. After having participated in 2 small-group problems, Ss ‘ 
selected a most and least preferred co-participant on 3 role-relationship dimen- 
sions: supraordinate (boss), subordinate (employee), and peer (neighbor). The 
findings are consistent with a need-compatibility or incompatibility interpreta- 
tion. For example, highly nurturant Ss choose highly succorant individuals as 
potential neighbors. Role relationships also are of importance in interpersonal 
selection. Need similarity in interpersonal selection was not found, 


It is reasonable to believe that people 
choose their friends, marital partners, etc., on 
the basis of anticipated positive reinforce- 
ment. However, problems develop when one 
attempts to define the exact nature of this 
positive reinforcement. Winch, Ktsanes, and 
Ktsanes (1955) found what they believed to 
be evidence for a complementary needs view 
in the selection of marital partners. Newcomb 
(1956) later proposed that complementary 
need selection is actually a special type of 
common attitude selection. If an assertive 
male chooses a passive female and vice versa, 
they do so because each holds the common 
verbalized or unverbalized attitude “that one 
of them should be assertive and the other re- 
ceptive [p. 579].” Studies done since New- 
comb’s observation have tended to support 
his position (Izard, 1960; Rosenfeld & Jack- 
son, 1959), but the picture is not entirely 
clear (Reilly, Commins, & Stefic, 1960). 

The present study takes up the influence 
of needs on interpersonal selection. The atti- 
tude-need relationship is not studied, although 
there would certainly be implications for this 
problem if it could be demonstrated that 
needs do exert an influence on a chooser’s 
selection of another individual. There are 
three preliminary questions which one must 
ask, embracing points which the following 
research design attempts to meet: 1 


1The author is greatly indebted to Douglas W. 
Bray, Director of Personnel Research of the Ameri- 
can Telephone and Telegraph Company and Director 
of the Management Progress Study. Special thanks 
also are extended to Warren Bachelis, H. Weston 


1. What are the needs at play between 4) 
subject choosing and the subject chosen, and! 
are they similar, compatible, or incompatible 
in nature? By similar needs we mean that 
both subjects may be said to have a givell 
personality need; for example, they are both 
extraverted. Compatible needs would result in 
the case where the choosing subject has @ 
need for dominance and the subject selected 
has a need for deference or submission. It 
this case, the needs would complement each 
other and presumably facilitate interpersonal 
contact. Incompatible needs, on the other) 
hand, would result when the choosing subject) 
has a need for order, but the subject to be 
chosen has a need for change in routine. Here, 
should the two subjects relate socially, a col 
flict in personality needs would be expected: 

2. What is the strength of the subject’ 
need who is about to make a selection from 
among several individuals? It may be that # 
mildly dominant subject would prefer to a 
sociate with other dominant subjects, whereas) 
the highly dominant subject might seek dê 
pendent or deferent subjects as associates. 

3. What is the nature of the role relation” 
ship between the subject making the selec 
tion, and the subjects from among whom he 
must choose? Is he to select a marital path 
ner, next-door neighbor, work associate or SU” 
perior, etc.? Most of the research to date has 
dealt with friendship choice (Izard, 1960; 
Reilly et al, 1960; Rosenfeld & Jackso™ 


Clarke, Donald Dewar, William Felton, Walter Kat- | 
kovsky, and John McKinney. 
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1959). It would be misleading to generalize 
to all role relationships from this one aspect 
of social interaction. It might well be that 
need compatibility is the basis for certain se- 
lections, and need similarity or even incom- 
patiblity may serve as the basis for other se- 
lections (e.g., some men like to have their 
job peers “unlikable” so that they feel less 
guilty having to compete with them). 


METHOD 


Subjects i 

This study was done within the framework of the 
Management Progress Study being conducted by the 
Personnel Research Section of the American Tele- 
phone and Telegraph Company. Subjects were 96 
males, who had either just completed college and 
were beginning their business careers in the Bell 


‘Telephone System, or, high-school graduates who 


had worked in this large business complex for 
roughly 8-10 years and had been promoted to lower 
management positions. The mean CA of all subjects 
was approximately 26 years, with a range of 22-33 
years, About 60% of the subjects had obtained bac- 
calaureate degrees, 20% had had some education be- 
yond high school, and 20% held only a high-school 
diploma. Slightly more than two-thirds of these 
young faen were married, with families of various 
sizes. All of the data were gathered over a 2-month 
period. Data collection took place at an Assessment 
Center, with subjects undergoing a 3.5-day period 
of scheduled test taking and small-group evaluation. 


Procedure 


_ Subjects chose one another following joint par- 
ticipation in two six-man small-group problems. The 


. Same six subjects participated in both groups. The 


problems came on successive days, in the following 
order: manufacturing and discussion groups. In ad- 
dition to this more formal interaction, subjects had 
an opportunity for informal contact over the 3.5 
days of assessment when at lunch, in the evening, 
etc, Several of these young men were from out of 
town, and stayed at a hotel where they had an op- 
Portunity for informal contact. It should also be 
noted that probably 20% of the subjects knew one 
another through business or school contacts made 
before coming to the Assessment Center. The sam- 
pling is therefore not random and it was felt most 
desirable to wait until after the discussion group, 
which came on the final day of assessment, before 
asking the subjects to choose one another. The two 
group problems assured the experimenter that the 
subject's selections would at least be limited to men 
with whom the experimenter was certain that some 
Interaction had taken place. Thus, the selections 
were one of the last acts performed by subjects at 


3, the Assessment Center. $ 
e manufacturing-group problem lasted for a pe- 


An of 2 hours, The six subjects were brought to- 
gether, furnished with a large work table on which 
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were placed five tinker-toy objects (mechanical man, 
airplane, etc.) permanently assembled as models, and 
laboratory coats of different colors. After the ex- 
perimenter had requested each subject to don a coat 
and assume the identity of its color (Mr. Red, Mr. 
Blue, etc.), the men were instructed that they were 
about to constitute a business enterprise. They were 
furnished with a small sum of money as capital, and 
provisions were made whereby they could purchase 
tinker-toy parts in order to build the displayed sam- 
ple objects. Provisions were also made for the sale 
of completed tinker-toy objects, and all of the 
profits earned were to be shared by the six partici- 
pants.? 

The same six subjects, with the same color desig- 
nations, performed 1 day later in the discussion- 
group problem. In this situation, subjects were asked 
to assume the role of a middle management super- 
visor who was about to participate in a conference, 
the purpose of which was to select a subordinate for 
promotion from among a list of six likely candidates. 
Thus, each subject was given a typed page describ- 
ing the merits of a subordinate foreman employee 
who was considered excellent material for promotion 
by the “supervisor” involved. After an initial state- 
ment from each of the six participants, the confer- 
ence was thrown open for a 1-hour period of dis- 
cussion before a final voting and ranking of candi- 
dates was tabulated. In all, this problem took 
approximately 1 hour and 45 minutes for completion. 

The problems of group organization were left com- 
pletely in the subjects’ hands, and the general char- 
acter of the group structure which evolved varied 
according to the men who comprised its members. 
Leaders (based on observer ratings of leadership) in 
one situation were not always leaders in the other, 
although a significant Pearsonian correlation of ap- 
proximately 40 (p< 01) between situations was 
found (Rychlak, 1963). It was clear to the subjects 
that these problems were competitive in nature and 
that they would be expected to show up well in re- 
lation to their peers. They were being evaluated and 
rated by two experimenters throughout the perform- 
ance of both group problems. Motivation was high, 
and the task involvement of subjects was generally 
excellent. 


Interpersonal Selection Sociometric 


Before leaving the group-discussion problem, sub- 
jects were requested to complete the following “Opin- 
ion Scale”: 

Now that you have had a chance to get to know 
the other men in this group, answer the following 
questions by circling a man’s color following each 
question. Do not spend too much time deciding, 
put try as honestly as you can to circle the most 
appropriate color. Please be as sincere and straight- 
forward as you can. Remember, this a relative 
rating, and your answers do not imply that you 
actually or necessarily “dislike” any of your group 
associates. We merely want your opinion, based on 


2 The Manufacturing Problem was devised by John 
K. Hemphill of Educational Testing Service. 
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what information and experience you have avail- 
able to you at present. Circle only one color per 
question and do not vote for yourself: 


1) Which man would you most like to work for, 
to have as a supervisor or boss? 


Red Green Blue Gray White Brown 


2) Which man would you least like to work for, 
to have as a supervisor or boss? 


Blue White Red Green Brown Gray 


3) Which man would you most like to have as an 
employee, to be your subordinate? 


Green Brown Gray White Blue Red 


4) Which man would you least like to have as an 
employee, to be your subordinate? 


White Brown Gray Red Green Blue 


5) Which man would you most like to have liv- 
ing next door to you, to be an acquaintance or 
friend of yours? 


Brown Blue Green Red Gray White 


6) Which man would you least like to have living 
next door to you, to be an acquaintance or 
friend of yours? 


Gray Red White Brown Blue Green 


Selections made on the basis of this sociometric 
scale provided part of the raw data of this study. 
For each of the six role-relationship choices, it was 
possible to assign the subject a value from O (no 
votes received) to 5 (all members selected). Against 
this 6-point criterion score, scholastic aptitude and 
personality test measures could be correlated. Since 
the color codes were recorded for each subject, it 
was also possible to pair-up the members of the 
small groups in terms of those who had selected or 
rejected one another. 


Scholastic Aptitude and Personality Measure- 
ment 


Subjects were administered two tests of scholastic 
and/or intellectual aptitude: the Wonderlic (1945) 
Personnel Test and Form 1A of the Cooperative 
School and College Ability Test (SCAT; Educa- 
tional Testing Service, 1955). Five tests of person- 
ality were administered to the subjects: Bass’ (1955) 
adaptation of the California Authoritarian Scale 
(Bass F Scale), Edwards Personal Preference Sched- 
ule (EPPS; Edwards, 1954), Guilford-Martin In- 
ventory of Factors GAMIN (Guilford & Martin, 
1948), and the Rotter Incomplete Sentences Test 
(ISB; Rotter & Rafferty, 1950). Both the personality 
and scholastic aptitude tests were administered over 
a 3-day period. Subjects took all tests in small 
groups, at different times of the day, and under the 
supervision of an experimenter. Only the EPPS will 
be taken up in detail, because it was on the basis of 
this test that need similarity, compatibility, and in- 
compatibility were determined. 


- teraction, should two people high in the respec 
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The Similarity, Compatibility, and Incom: į 
patibility of Needs 


Henceforth, subjects of this study who made selec- 
tions or chose others will be referred to as selectors; 
those subjects who were chosen will be call 
selectees. Bear in mind that every subject of the 
present study was a selector, but since sociometrit 
scores ranged from 0 to 5, not all subjects were 
selectees on each of the role-relationship dimensions. 
If the subject received no “most like” selections from | 
his fellow group members this would of course sugi 
gest that he lacked popularity and was an “isolate” 
in the usual sense of that term. On the other hand, 
if the subject received no “least like” selections 
would mean that he was probably at least moder- 
ately popular, and thus an isolate in the negative) 
sense only of being “left out” of an unpopular group 
selection. 

The EPPS is designed to measure 15 manifest per} 
sonality needs, such as Dominance, Nurturance, andj 
Autonomy, and it makes use of a forced-choice tech 
nique to correct for social desirability of items. Both | 
selectors and selectees had score values on these 13) 
needs. It was therefore possible, for example, to 
break down the group of 96 subjects into selectors) 
falling above and selectors falling below the media) 
Dominance score for the experimental sample as 4 
whole. Following this dichotomization of the samp 
one could examine the Dominance scores of the se 
lectees of these two groups. If need similarity playet) 
a role in interpersonal selection as weighted bi 
strength of need, we would expect the selectors big 
in need Dominance to choose selectees high in net) 
Dominance. If need compatibility were the factor ® 
importance, then the selectors high in Dominant 
might well choose selectees high in need of Defe 
ence, etc. It was in this way that a series of EPP 
variables could be examined to assess the essen! 
influences at play in interpersonal selection. The first 
step taken in the present research, even before data 
were collected, was to combine certain EPPS need 
into the three classifications of need similarity, com 
patibility, and incompatibility, based upon the deft 
nitions of these variables proffered by the test’s alt 
thor (Edwards, 1954), All but the following naa 
EPPS needs were thus utilized in this study: Heter® 
sexuality, Endurance, and Intraception. A 
tion of the need classifications follows: 

Need similarity. This is a simple comparison ® 
the same EPPS need in the selector and selectet 
For example, the Exhibition score of the selector © 
compared with the Exhibition score of the select 
If need similarity is relevant in interpersonal sele 
tion, then we should find selectors high in a give 
need choosing selectees high in the same need. 
EPPS needs were studied in this manner: Do 
nance, Nurturance, Exhibition, Order, and A 
tonomy, 

Need compatibility. The definitions of the t 
EPPS variables in question imply a harmony in ™ 


needs relate socially. For example, Dominance 
argue for one’s point of view, to be a leader ® 
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groups to which one belongs) and Deference (To 
follow instructions and do what is expected . . . to 
accept the leadership of others) is considered a po- 
tentially compatible basis on which to formulate a 
relationship (Edwards, 1954, p. 14). It could be that 
a selector with above average Dominance needs 
would be attracted to a selectee with high Defer- 
ence needs. Three such compatible combinations 
were singled out for study: Dominance and Defer- 
ence, Nurturance and Succorance, and Exhibition 
and Affiliation. Since the reader may have some 
question regarding the compatibility of the latter 
two variables, it should be noted that their defini- 
tions are as follows: EPPS Exhibition implies. a 
need “To say witty and clever things . . . to talk 
about personal achievements ...to be the center 
of attention”; whereas EPPS Affiliation suggests a 
need “To participate in friendly groups . . . to share 
with friends, to do things with friends rather than 
alone (Edwards, 1954, p. 14].” If an exhibitionist 
needed a compatible audience, the experimenter rea- 
Soned, his best bet would be to relate to an affilia- 
tive individual. 

Need incompatibility (conflict). The definitions of 


_the two EPPS variables suggest disharmony in inter- 


action, should two people high in the respective 
heeds relate socially, For example, Order (To have 
things arranged so that they run smoothly without 
change) and Change (To do new and different things 
. . . experience novelty and change in daily routine) 
would*be considered an incompatible combination 
for any selector and selectee (Edwards, 1954, p. 14). 
Three incompatible combinations were studied: Or- 
der and Change, Dominance and Aggression, and 
Autonomy and Abasement. 


RESULTS 


The first point of interest centers on the 


. Possibility of a general trend for the entire 


sample between the role relationships of the 
Sociometric and the personality or scholastic 
aptitude variables. It should be emphasized 
that the selectors had no knowledge of the 
test data of their selectees nor did they have 
knowledge of their own scores on the tests 
administered. Pearsonian correlations were 
run between all test data and the sociometric 
Scores for each subject. There were actually 
Six sociometric scores per subject, one for 
each of the questions. Neither of the schol- 
astic aptitude tests proved to be significantly 
related, but five of the personality variables 
ee into the findings. Considering the most 
iked sociometric choices first, EPPS Afilia- 
tion was positively related to being selected 
as a potential employee (7 = .205, $ < .05)- 
EPPS Deference was negatively related to 
being selected as a most likable neighbor (r 
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= — 284, p < .01). Correlations run against 
the least liked sociometric choices gave find- 
ings on the role relationship of neighbor. As 
the following three GAMIN personality char- 
acteristics increased among selectees, their 
likelihood of being seen as least likable neigh- 
bors also increased. Ascendance (r= .228, 
p< 05), masculinity (r= .208, p< 05), 
and (lack of) inferiority feelings (r = .265, 
p<.01). 

The second and major point of interest of 
this study concerns the influence of person- 
ality needs on interpersonal selection. Do se- 
lectors base their choices of selectees on need 
similarity, compatibility, or incompatibility? 
Ts there a difference depending upon the type 
of role relationship being chosen? For exam- 
ple, selectors might seek need similarity with 
peers, but need compatibility with subordi- 
nates, etc. In order to test for this, the sam- 
ple of 96 subjects was repeatedly divided at 
the median into subgroups on each of the 
following EPPS need variables: Dominance 
(median raw score for sample = 21.75), Nur- 
turance (10.50), Exhibition (15.50), Order 
(12.50), and Autonomy (11.90). This re- 
sulted in a “high” and “low” group of ap- 
proximately 45-50 selectors on each of five 
needs; the high group presumably had greater 
need than the low group of the personality 
characteristic in question. It should therefore 
be clear that the EPPS scores of the selectors 
formed the basis for sample dichotomization 
into subgroups. The sample was reshuffled on 
this basis five times, and each of these di- 
chotomizations was independent of the others. 
As raw data for the tests of significance be- 
tween these subgroups, the EPPS scores of 
the selectees were used in three ways. 

1. The selectee of each selector was found 
and the former’s score on the EPPS variable 
reflecting need similarity, need compatibility, 
or need incompatibility with the selector vari- 
able under dichotomization was arrayed, and 
then contrasted between subgroups. For ex- 
ample, when the selectors had been divided 
into high and low groups on the basis of their 
Dominance scores, these subgroups had as 
raw scores the following EPPS test values 
taken from the protocols of their selectees: 
Dominance (for need similarity), Deference 
(for need compatibility), and Aggression (for 
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need incompatibility). The test of significance 
between subgroups was the ¢ test. 

2. Using the selectee’s EPPS scores reflect- 
ing need similarity, compatibility, or incom- 
patibility, it was also possible to make within- 
subgroup comparisons across the six questions 
of the sociometric. The test of significance 
within high- and low-selector groups was 
analysis of variance. 

3. Finally, it was possible to devise a dif- 
ference score (D score) between the most 
liked and least liked selectees on each of the 
three role-relationship dimensions (boss, em- 
ployee, and neighbor) of the sociometric. The 
effect of this was to reduce the sociometric 
scores from six to three in number. The D 
scores take into consideration the disparity 
between the selectee most preferred and the 
selectee rejected by a given selector, and it is 
possible to make both within- and between- 
subgroup comparisons (via analysis of vari- 
ance and ¢ tests). 

Statistical analyses failed to identify sig- 
nificant findings on need similarity in inter- 
personal selection, even though all three forms 
of testing for this influence were attempted. 
Moving on to need compatibility, two of the 
three compatible arrangements of EPPS vari- 
ables between selectors and selectees reflected 
a significant finding: Nurturance and Succor- 
ance, and Exhibition and Affiliation. Table 1 
gives the means and standard deviations of 
the EPPS Succorance and Affiliation raw 
scores for the selectees of the selectors with 
compatible need arrangements (Nurturance 
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and Exhibition, respectively). Note that the 
six sociometric questions are quoted in ab- 
breviated fashion, and that means and stand- — 
ard deviations of selectee EPPS scores are 
given for both the high and the low EPPS | 
selector groups. The number of selectors in | 
each of these subgroups is given in parenthe- 
sis. Only the ¢ tests across high and low sub- % 
groups proved to be significant. The analyses 
of variance within subgroups failed to reach 
the .05 level of significance; however, it | 
seemed desirable to present all data for a’ 
given sociometric series so that the reader | 
could have a better picture of how the selectee 
data were distributed. 
The findings in Table 1 suggest that selec- , 
tors high in EPPS Nurturance are more likely k 
to choose selectees with high needs for EPPS 
Succorance as neighbors (mean = 9.34) than 
are selectors low in EPPS Nurturance (their 
selectees had a mean Succorance score of 
7.28). Selectors who have a high need fot 
EPPS Exhibition are more likely to reject 
selectees as possible employees on the grounds | 
of their low need for EPPS Affiliation (12.84) ' 
than are selectors who have a low need fot 
EPPS Exhibition (their selectees had a mean 
Affiliation score of 14.43). This finding is es- 
sentially in support of the compatibility of 
needs, even though it suggests that the less 
affiliative selectees were rejected because the 
more exhibitionistic selectors sensed an M- 
compatibility in interpersonal relations. This 
merely emphasizes the fact that compatibility 
and incompatibility in need influence are 0p- 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF EPPS SUCCORANCE AND AFFILIATION RAW SCORES FOR 
SELECTEES OF COMPATIBLE SELECTORS 


Nurturance selectors (N = 96) 


Succarance of selectees Ste ee m 
$ 4 ' 
Sociometric question High gon TR ee ig aE Loy, mea 
M SD M SD M SD M SD 
Most like boss 8.72 6.36 7.73 3.87 14. 2.97 
Least like boss 77.43 4.58 7.32 300 | 1345 391 i376 | 378 
Most like employee | * 8.55 4.88 8.14 437 14.73 4.05 14.05 4.28 
Least like employee | 7.47 4.53 8.10 3.98 | 12.84" | 3.83 | 1443 | 402 
Most like neighbor 9.34* 5.16 7.28 3.61 13.66 4.01 14.76 3.89 
Least like neighbor 7.32 5.00 7.67 4.07 13.68 4.80 14.35 3.59 


* Significantly greater than comparable score for selectees of low selector group (p < +05). 
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TABLE 2 


Means, STANDARD DEVIATIONS, AND D Scores or EPPS CHANGE SCORES FOR SELECTEES 
OF INCOMPATIBLE SELECTORS 


High EPPS order selectors (N = 45) 


Low EPPS order selectors (N = 51) 


EPPS change raw 


Sociometric question ‘scores of selectees 


Role-relationship 
D scores of selectees 


Role-relationship 


EPPS change raw 
D scores of selectees 


scores of selectees 


M SD M SDp-s M SD M SDo-s 
Most like boss 16.33 | 472 17.18 | 3.81 
» Maat like boss 1820 | 2.93 —187* | 512 | 1696 | 3.82 22 | 4.83 
‘ost like employee 16. 4.7. = 16.56 4.67 
Teast like employee 17.09 | 4.05 82 | 604 | 1735 | 3.94 FAAA ANSERA 
E like neighbor | 1820. | 3.58 1698 | 417 
“Teast like neighbor | 16.66 | 3.93 153**| 5.00 | iess | 3.76 410 | 417 


Note.—In deriving a D score, the least preferred choice is subtracted from the most preferred choice. 
* Significantly different than comparable D score for low need Order selector group (p <.05). 
** Significantly different than mean D score for the boss role-relationship dimension (p < .01). 


|” posite sides of the same coin. Since the socio- 


metric had most and least liked alternatives it 


was possible to get a finding of this sort. 
Turning to the influence of need incompati- 
bility in interpersonal selection, it was found 


that one of the incompatible need arrange- 


ments reflected a significant finding: Order 
and? Change. Table 2 gives the raw score 
means, standard deviations, and D scores for 
the selectees of the selectors with high and 
low EPPS Order scores. Only the D scores of 
Table 2 reached a significant level of differ- 
ence. Recall that the D scores were obtained 
by subtracting the selector’s least liked se- 
lectee’s EPPS score from his most liked se- 
lectee’s EPPS score on a given role-relation- 
ship dimension. There is a mean and standard 
deviation of D scores cited in the column 


_ titled: Role-relationship D scores of selectees. 


The ¢ test of D scores across high and low 
groups on the boss role-relationship dimension 
was significant (p < .05). The analysis of 
variance on D scores within the high EPPS 
Order subgroup was found to be significant 
(Ż < .01), and a subsequent test differenti- 
ated the boss and neighbor role-relationship 


, _ dimensions (p < .01). 


The findings in Table 2 indicate that se- 


> lectors high in the need of EPPS Order make 


a greater distinction between most and least 
Preferred selectees on the boss role-relation- 
ship dimension than do selectors low in EPPS 
Order, This finding suggests that selectors 
who need Order would prefer having a boss 
with a low need for Change (mean = 16.33), 


if they had a choice, to a boss with greater 
needs for Change (mean = 18.20). On the 
other hand, such selectors seem prone to make 
the opposite discrimination when choosing a 
neighbor. On the latter role-relationship di- 
mension, selectors high in need for Order were 
more likely to prefer a neighbor high in EPPS 
Change (mean = 18.20) over an alternative 
selectee, low in the need for Change (mean = 
16.66). 


DISCUSSION 


It must be stressed that the pairing of 
needs as to compatibility or incompatibility 
was fully decided before data collection. Only 
the pairings as predicted were studied. The 
possibility of finding significant differences by 
chance, due to post hoc pairing of various 
EPPS variables was therefore eliminated. 
There were several statistical tests run, and 
the findings are not prolific. Yet, if one con- 
siders the nature of objective personality test- 
ing, the indirection and unreliability involved, 
plus the fact that our subjects had no idea of 
one another’s EPPS scores, even these limited 
findings are rather remarkable. 

The lack of findings on need similarity may 
be due to the limited acquaintance period sub- 
jects had before making their selections. It 
may be that people choose on the basis of 
compatibility-incompatibility after brief con- 
tact with others, only to single out for longer 
friendships those who have a need pattern 
similar to their own. Personality needs them- 
selves might also be influenced to change and 
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to conform to the needs of long-standing as- 
sociates. Having deferent characteristics seems 
to negate one’s chances of being highly 
preferred as a neighbor by young men of 
this sample’s general background. However, 
neither do these young men seek a highly 
ascendent, self-confident, masculine associate 
to call next-door neighbor. The GAMIN find- 
ings suggest that such interpersonal com- 
petence bore a positive relationship to being 
chosen as least desirable as a neighbor. Ap- 
parently, after a hard and possibly competi- 
tive day at the office, subjects would just as 
soon relax with associates who could conceiv- 
ably build up their egos rather than threaten 
their adjustment all the more. 

The findings on need compatibility were 
fairly clear-cut on the nurturant-succorant 
dimension. Highly nurturant subjects choose 
the more succorant individual as a possible 
neighbor. The results also suggest that those 
subjects low in affiliative needs are likely to 
be rejected as potential employees by subjects 
high in needs for exhibition. The work situa- 
tion is fraught with status considerations. No 
one expects his boss to laugh regularly, but to 
have a subordinate who might not laugh at 
one’s jokes must be very difficult for an exhi- 
bitionist to bear. This kind of reasoning pre- 
sumes that the role relationship within which 
people relate is an important factor deter- 
mining whether or not need compatibility will 
be sought for by the individuals in question, 
or whether incompatibility will be tolerated. 

Fortunately, the D score findings on incom- 
patibility support this reasoning, because they 
suggest that incompatibility in need struc- 
tures may be both avoided and literally sought 
for within different role relationships by the 
same subjects. Table 2 findings indicate that 
highly orderly subjects made a significant dis- 
tinction of this sort. When it came to choos- 
ing a potential boss they preferred having one 
with personal stability and little need “to ex- 
perience novelty and change in daily routine 
[Edwards, 1954, p. 14].” On the other hand, 
when it came to selecting a neighbor, their 
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preference went in the opposite direction. It f 
is as if the orderly individual is fundamentally 
attracted to the changeable individual, but 
wants to relate to him only after working | 
hours, when he has the protection of infor- 4 
mality and economic independence. Even s0, 

it is comforting to think that people may not 
always want to fit together perfectly, com- 
patibly, or similarly in need structure pre- | 
liminary to the formulation of an interper- 4 
sonal contact. At times, in the proper circum=" 
stance, they may very well enjoy relating to | 
someone who is quite at variance with their 
own personality. 
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TOLERANCE FOR UNAMBIGUOUS FEEDBACK* 


SOLOMON E. FELDMAN ann JOY K. RICE 


University of Wisconsin 


In contrast to previous work, the present study defines tolerance for ambiguity 
in terms of differential preferences and predicts that compared to Ss in a per- 
missive atmosphere, Ss exposed to a stressful atmosphere will prefer ambiguous 
evaluative feedback. A 2nd hypothesis was that these preferences should be re- 
lated to autonomous vs. dependent sets. After set and atmosphere induction, 
Ss selected the responses to either of 2 inkblot tests as a basis for a “diagnostic” 


decision. Either selection elicited the sami 


e information from E, but one resulted 


in clear-cut feedback, while the other resulted in ambiguous feedback on 50% 
of the trials. The results confirmed the major hypothesis but showed no signifi- 


cant effects due to set. 


A typical paradigm for the investigation of 
the concept of tolerance for ambiguity con- 
fronts subjects with ambiguous stimuli under 
conditions of varying stress, with the expecta- 
tion that differential responses can serve as 
indices of the discomfort the subject experi- 
ences in the face of ambiguity. Operations 
have included measurement of the speed of 
closure (Smock, 1955a), assessment of the 
suhjective probability of event occurrence and 
the certainty of judgment (Brim & Hoff, 
1957), and decision making under conditions 
of incomplete (Dittes, 1961) or incongruous 
(Smock, 1955b) information. 

These procedures, at best, can provide only 
indirect support for the hypothesis of differ- 
ential tolerance of ambiguity. Looking at one 
typical study, Smock (1955a) presented a se- 
ries of incomplete figures to subjects and in- 
structed them to attempt to identify each fig- 
ure in as few cards as possible. There were no 
penalties for incorrect responses. Before each 
subject began guessing he was presented with 
a list of hypotheses as to nature of each fig- 
ure. The subjects in the stress condition (fail- 
ure and rejection) tended (p < .10 > .05) to 
respond earlier than the subjects in the se- 
curity condition (success and acceptance), 
although by the fifth, and last trial, there was 
essentially no difference between the two 


1 This study was partially supported by the Na- 
tional Science Foundation and the Wisconsin Alumni 
Research Foundation through the Wisconsin Univer- 
sity Research Committee. It is based upon a thesis 
submitted in partial fulfillment of the requirements 
for the degree of Masters of Science by the second 
author under the supervision of the first author. 


groups. The subjects in the stress group had 
a consistently higher rate of incorrect recog- 
nitions. 

Smock hypothesized that the stress pro- 
duced anxiety which decreased tolerance for 
ambiguity as shown by the “premature clo- 
sure” of the subjects in the stress group. The 
difficulty with this line of reasoning is the 
potential confounding of the behaviors used 
to define decreased tolerance for ambiguity 
with the effects of stress per se. Increased 
pressure to conform to the nonoptimal strat- 
egy suggested by the experimenter, flattened 
generalization gradients produced by in- 
creased drive (eg., Bersh, Notterman, & 
Schoenfeld, 1956), and faster responding via 
the interaction of drive and habit strength 
(Hull, 1943) are plausible alternative ex- 
planations. In fact, one might consider the 
subjects in the security condition as intoler- 
ant of ambiguity since these subjects, despite 
the experimenter’s encouragement to respond 
quickly, did not venture a response until the 
stimulus picture was almost complete. 

In addition to the potential confounding, 
response tendencies in the face of incomplete 
information have been variously interpreted 
as a function of a need to enhance self- 
esteem (Dittes, 1961), suspiciousness (Binder, 
1958), cautiousness (Messick & Hills, 1960) 
and more generally, as a function of the utili- 
ties associated with alternative decision strat- 
egies (Swets, Tanner, & Birdsall, 1961). 

Not only have previous operations fallen 
short of clearly delineating intolerance of am- 
biguity from alternative constructs, but the 
assumed relationship between intolerance of 
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ambiguity and stress can also be questioned. 
In Millon’s (1957) study, ego-involving 
(stress-producing) instructions in the context 
of judgments of autokinetic movement did not 
produce greater intolerance of ambiguity. 
Similarly, Blood (1961) found anxiety level 
unrelated to intolerance for ambiguity as 
measured by time for first response to de- 
creasingly ambiguous stimuli, while Wolff 
(1955) concluded that there was little sup- 
port for an inverse relationship between de- 
sire for certainty and anxiety. 

At least one investigator (Hamilton, 1957) 
found that anxious subjects, in contrast to 
other types of neurotic subjects, preferred 
ambiguous commitment. In two of three tests 
of discrimination, anxious individuals had a 
higher mean of “can’t decide” scores than did 
the controls. 

While, despite the previously mentioned 
dangers, it is entirely legitimate to define 
tolerance for ambiguity in terms of some form 
of psychological or physiological disequilib- 
rium in the presence of ambiguity, the present 
study attempts to approach the problem via 
the study of preference behaviors. Its aim is 
to demonstrate that stress may, in fact, in- 
crease preference for ambiguity, a direct re- 
versal of the hypothesis that stress decreases 
tolerance for ambiguity. 

There is evidence that individuals desire 
feedback for self-evaluative processes (Fest- 
inger, 1954), particularly feedback which is 
consonant with their benefits and self-images 
(Adams, 1961; Ehrlich, Guttman, Schönbach, 
& Mills, 1957; Mills, Aronson, & Robinson, 
1959). Under conditions of stress individuals 
may distort or deny feedback inconsistent 
with their self-image (Hogan, 1952; Hunt & 
Schroder, 1958; Goldstein, 1959; Schroder & 
Hunt, 1957). It also has been demonstrated 
that knowledge of results, that is, feedback, 
can be detrimental to the performance of the 
highly anxious person (Mandler & Sarason, 
1952). It appears reasonable to hypothesize 
that under conditions of stress, potential 
negative feedback is threatening to the indi- 
vidual. Under such conditions, he will prefer 
ambiguous to clear-cut feedback, because the 
lack of structure facilitates processes that al- 
low for greater protection of the self-image. 

In contrast, Rogers (1951) suggests that 
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a permissive atmosphere encourages accurate 
self-evaluation and stimulates an examination 
of the realities of one’s life circumstances, 
It is likely that under these conditions the in- 
dividual would be less prone to attempt self- 
deception. Instead, under the impetus of the 
push toward self-actualization, we would ex- 
pect the individual to seek out clear-cut feed- 
back (positive or negative) as a guide toward 
further growth. 

In summary, the major experimental hy- 
pothesis is that relative to subjects in a stress 
atmosphere, subjects under conditions of per- 
missiveness will exhibit greater preference for 
clear-cut, in contrast to ambiguous, feeback. 

A second concern of the present study is 
the relationship of an autonomy-dependency 
continuum to the preference for ambiguous 
versus clear-cut feedback. The dependent per- 
son, as contrasted to the independent person, 
has been found to be more group-minded 
(Barron, 1953). A greater need for self- 
evaluation and approval from others is often 
associated with dependency, while, on the 
other hand, independence is sometimes Seen 
as a counterpart to creativity and a desire for 
freedom of individual choice (Barron, 1953). 
In light of the above, we would expect that 
relative to conditions inducing autonomy, sub- 
jects under conditions fostering dependency 
would have a greater need to seek out the ap- 
praisals of others and thus would exhibit a 
greater preference for clear-cut than am- 
biguous feedback. 


METHOD 
Subjects 


Sixty undergraduate women, enrolled in various 
introductory psychology sections at the University 
of Wisconsin, served as subjects. Each subject was 
run individually in sessions lasting approximately 7 
minutes, The study utilized a 2 X 3 factorial design 
with 10 subjects in each of the six cells. Cell assign- 
ment was randomized by order of appearance. 


Procedure 


One half of the subjects were exposed to a per- 
missive atmosphere while the other half of the sub- 
jects were exposed to a stressful atmosphere. Within 
each of these conditions, subjects were divided into 
three subgroups receiving either an autonomous set, 
a dependent set, or no set. 

In the stress condition, the experimenter met the 
subject with the comment that she was late. Denial 
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was ignored except when it was very persistent in 
which case the experimenter essentially repeated the 
original comment. No other conversation was made 
with a subject, and the subject was shown to her 
seat by a gesture. In general, the experimenter’s man- 
ner with a subject in this condition was serious and 
task oriented. 

In the permissive condition, the experimenter in- 
troduced herself to the subject, pleasantly asked the 
subject to sit down, and casually questioned the 
subject as to how she liked her psychology course. 
The experimenter’s manner with these subjects was 
friendly and less businesslike. Except for an occa- 
sional smile for subjects in the permissive condition 
during the reading of the instructions, the experi- 
menter’s attitude was the same for all subjects dur- 
ing the remainder of the experiment. 

All subjects were told that the purpose of the ex- 
periment was to investigate the possibility of train- 
ing the layman in the identification of maladjusted 
individuals, The details of the experimental pro- 
cedure were then explained. A subject was told that 
on each of a long series of trials she would see the 
response that a different individual made to one of 
two new inkblot tests, the “Howard” or “Thomp- 
son.” She was further informed that she was to use 
each response to estimate the individual’s level of 
adjustment. 

The subject was told that she was to begin a trial 
by requesting either the individual’s response to the 
Howard or his response to the Thompson. The ex- 
perimenter would then present a card to the subject 
on which the appropriate response was printed. The 
subject’s choice here was inconsequential, since, un- 
known to the subject, the test response shown was 
the same for either request. 

After reading the test response, the subject. was to 
decide whether the individual was adjusted or mal- 
adjusted by placing the card in the appropriate slot 
of the experimental apparatus. The subject then re- 
ceived another card from the experimenter with 
some information concerning the person about whom 
the diagnostic decision had been made. The subject 
was told that the purpose of the information card 
was to provide feedback as to how well she was 
doing in her diagnostic decisions. Unknown to the 
subject, if she chose to see inkblot responses made 
to one set of cards (Howard or Thompson, counter- 
balanced within each cell), information feedback 
about the person would always be “clear-cut,” that 
is, it was obvious as to whether the person was ad- 
justed or maladjusted. Within this feedback condi- 
tion, 50% of the statements were clearly indicative 
of adjustment, and 50% were clearly indicative of 
maladjustment. This noncontingent reinforcement 
was randomized within blocks of 12 trials. If, how- 
ever, the subject chose to see the response made to 
the other set of inkblots, 25% of the time the feed- 
back would be clearly indicative of adjustment, 25% 
of the time the feedback would be clearly indicative 
of maladjustment, and 50% of the time the feedback 
would be ambiguous, that is, not clearly indicative 
of degree of adjustment. Once again, reinforcement 
was noncontingent and randomized within 12 trials. 


343 


The sequence of reinforcement was randomly deter- 
mined across subjects within both the clear-cut and 
the ambiguous feedback contingencies. 

The subjects were informed that each trial re- 
quired a diagnostic decision to be made about a dif- 
ferent individual. They were told that they could 
switch from requesting the Howard or the Thomp- 
son as frequently as they chose and that each diag- 
nostic decision was to be made independently of all 
previous ones. 

After a subject was allowed to ask questions about 
the procedure, she was told that she was to take a 
personality test designed to measure some traits rele- 
vant to the ability to make diagnostic decisions. The 
subjects in the “autonomous” condition received a 
list of 20 true-false statements headed “Scale of 
Self-Trust” which portrayed dependency in a nega- 
tive light and autonomy in a positive light (e.g. “A 
person should never trust his own opinions without 
checking it with others,” “I usually make my own 
decisions,” “I would rather have other people’s praise 
than personal satisfaction”). The subjects in the 
“dependent” condition received a list of statements 
headed “Scale of Exaggerated Self-Importance” 
which portrayed autonomy unfavorably (eg, “I 
know more than most of my teachers,” “Asking 
other people’s advice is a sign of weakness,” “I don’t 
care what anyone thinks of me”). The statements in 
the “no-set” condition were irrelevant to the au- 
tonomy-dependent dimension (eg, “I would like 
cooking better than sewing”). 

Upon completion of the personality test the ex- 
perimental instructions were briefly summarized for 
the subject and the task was begun. During the en- 
tire experimental phase, the experimenter and the 
subject were hidden from each other’s view by the 
apparatus screen. After the first block of 25 trials, 
the subject was told that occasionally (each 25 
trials) she would be asked to indicate how well she 
thought she was doing by placing a vertical mark 
on a 7-point scale. The conclusion of the sixth self- 
evaluation ended the experimental phase of the ses- 
sion. The subject was then given a questionnaire de- 
signed to assess reactions to the experimenter and 
the experiment.? 


Materials and Apparatus 


The experimental apparatus consisted of a table 
upon which was mounted a heavy white board with 
three slots. The two labeled “adjusted” and “mal- 
adjusted” were employed by the subject when she 
made her categorization and the third was used by 
the experimenter to deliver the stimulus and infor- 
mation cards and by the subject to return the in- 
formation cards. The left-right position of the ad- 
justed and maladjusted labels was counterbalanced 
for all subjects within each experimental group. 


2Each subject also completed a 120-item person- 
ality inventory designed to measure 12 traits thought 
to be related to preference for ambiguous or clear- 
cut feedback. Since cross-validation is demanded, the 
relationship of organismic variables to feedback pref- 
erence will be dealt with in subsequent publications. 
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The stimuli used for the adjustment decision were 
150 Rorschach responses. The previous ratings of a 
group of approximately 75 undergraduates enrolled 
in a behavior pathology course at the University of 
Wisconsin indicated that the responses were “am- 
biguous” with regard to the presence or absence of 
pathology. Examples of the type of responses are 
“headless bears dancing,” “blackened trees after a 
forest fire,” and “a man admiring himself in the 
mirror.” The complete set of 150 inkblot response 
cards was duplicated on pink and green cards in 
order to make the subjects believe that they were 
choosing inkblot responses to two different tests, 
that is, Thompson and Howard. The pairing of am- 
biguous or clear-cut feedback with the color of the 
stimulus cards, as well as the Howard or Thompson 
inkblot test label, was counterbalanced for all sub- 
jects within each experimental group. 

The 225 feedback cards were selected from a larger 
pool of statements classified as either indicative of 
adjustment, maladjustment, or ambiguous with re- 
gard to adjustment on the basis of prior ratings by 
a similar group of undergraduates. Examples of am- 
biguous feedback would be “This individual has 
never been out of his home town” and “This indi- 
vidual likes to read newspaper articles on crime.” 
Clear-cut feedback would be typified by “This young 
man eventually committed suicide” and “This person 
is a successful lawyer and recently opened his own 
firm.” 

The self-evaluation device was a 7-point rating 
scale with the extreme points labeled “good” and 
“bad.” The left-right position of good versus bad on 
the scale was randomized across evaluations for each 
subject. 

The questions on the postexperimental question- 
naire were open-ended questions designed to meas- 
ure the subject’s reaction to the experimenter and 
the experiment. The comments of each subject were 
rated by the experimenter and one other judge on 
a 5-point scale (highly negative, negative, neutral, 
positive, and highly positive) without knowledge of 
the experimental group to which each subject be- 
longed. The interjudge reliabilities were uniformly 
high (lowest r = .77). 


RESULTS 


Effectiveness of Stress and Set Operations 


Compared to the subjects in the stress 
group, the subjects exposed to the permissive 
atmosphere reported that they felt more posi- 


TABLE 1 


MEAN NUMBER or CHOICES or CLEAR-CUT FEEDBACK 
4 AS A FUNCTION OF PERMISSIVENESS AND SET 


Autonomous 


Dependent No set 
Permissive 66.3 65.5 69.9 
Stress 58.2 58.9 62.6 
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TABLE 2 


ANALYSIS OF VARIANCE SUMMARY TABLE: NUMBER OF 
Cnrorces or CLEAR-CUT FE S 
A FUNCTION OF PERMISS 


AND SET 
Source df | F 
Permissiveness (A) 1 6.085*. 
Set (B) 2 ns 
AXB 2 ns 
Error (MS) 54 | (132.56) 
*p <.025, 


tive toward the experimenter (p < .001) and 
more comfortable during the experiment (b 
< .01). Those subjects reporting more post 
tive feelings toward the experimenter also 
showed a greater preference for clear-cut 
feedback (r = .34, p < .05). 

There is evidence that the operations in- 
tended to produce the autonomous and de- 
pendent sets did not achieve the desired ef- 
fects. Postexperimental questioning revealed 
that subjects had difficulty in remembering 
very much about the set-inducing “person- 
ality test.” ‘ 


Preference for Clear-Cut Feedback 


Since it was assumed on an a priori basis 
that during the first 25 trials subjects would 
be learning the feedback expectancies, only 
the number of clear-cut choices in the last 
125 trials was used in subsequent analyses. 

The mean number of clear-cut choices for 
each group are presented in Table 1, and the 
results of the 2 x 3 anova used to test the 
effects of set and permissiveness upon pref 
erence for clear-cut versus ambiguous feed- 
back is summarized in Table 2. 4 

The hypothesis that relative to subjects 19 
a stress condition, subjects in a permissive 
condition would exhibit a greater reference 
for clear-cut, in contrast to ambiguous, feed- 
back was confirmed (p< .025); however, 
there were no statistically significant differ- 
ences as a function of the three sets or the 
interaction of atmosphere and set. 

In order to make specific comparisons 
among the six means, the Duncan range test 
was employed. The only significant difference 
was between the permissive-no set groups a” 
the stress-autonomous groups (p < .05). 


a 
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` Self-Evaluation Scale 


Differences between the means of the six 
experimental groups on the self-evaluation 
scale were not statistically significant; how- 
ever, the average rating across all six self- 
evaluations was positively related to a high 
score on the total postexperimental question- 
naire (r= .36, p < .05). This relationship 
was particularly magnified for the subjects in 

. the permissive group (r= .68, p <.001). 

_. Furthermore, for these same subjects, high 
self-evaluations correlated significantly with 
‘preferring clear-cut feedback for all six trial 
blocks (r= .36, p< .05) and just missed 
significance when only the last five trial blocks 
were considered (7 = .34). These correlations 
were very low within the stress group. 


Discussion 


_ The results indicate that individuals who 

are faced with decision making under stress- 
ful conditions, in contrast to conditions of 
permissiveness, exhibit a greater preference 
for ambiguous evaluative feedback. It has 
beerf suggested that such information is less 
threatening to the individual because it is 
possible for him to interpret it as he desires. 
In the present study, the question of whether 
ambiguity was used for purposes of avoiding 
failure or for purposes of feigning success is 
not immediately answerable. However, one 
approach to this problem might be to have a 
subject indicate on each trial whether she was 
right or wrong, and thus a measure of the sub- 
ject’s perception of the ambiguous feedback 
. could be obtained. 

The results of this study contradict those of 
Smock (1955a) and Brim and Hoff (1957), 
who maintain that conditions defined as 
stressful or frustrating increase the individ- 
ual’s intolerance of ambiguity and his desire 
for certainty. However, as suggested in the 
introduction, the contradiction can be re- 
» . solved by careful examination of the defini- 
tion of the construct, intolerance of ambigu- 
ity. In the last analysis, the previous investi- 
gations are not directly comparable to the 
present one because in no previous study 
(with the possible exception of Hamilton’s, 
1957, study) has the actual preference for 
ambiguity or nonambiguity been investigated 
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and because this study concerns itself with 
ambiguity within a feedback context. 

In the present study further work needs to 
be done in specifying the particular organis- 
mic states produced by the experimental op- 
erations. While it was assumed that the oper- 
ations were stress provoking or anxiety 
arousing, alternative postulations such as hos- 
tility, lack of motivation, etc., remain as pos- 
sibilities. In particular, it is possible that 
stress-inducing operations could have aroused 
hostility instead of or in addition to anxiety. 
The subjects may then have been motivated 
to aggress against the experimenter by choos- 
ing the ambiguous dimension and thus pro- 
vide the experimenter with ambiguous data. 
Although this explanation cannot be ruled 
out as a factor that may have affected the 
responses of some subjects it generally attrib- 
utes to the subjects a greater sophistication 
than either past experience with this popula- 
tion or postexperimental questioning reveals. 

The implication of these results for educa- 
tion and understanding psychopathology are 
self-evident. Individuals in a stressful atmos- 
phere with potential (perceived or real) 
negative evaluation hanging over their heads 
should prefer engaging in behaviors that lead 
to ambiguous evaluations (e.g. last-minute 
cramming, keeping people at distance, etc.). 
Less obvious is the fact that the feedback 
preference model is in accord with Atkinson’s 
(1957) prediction that individuals with a 
high fear of failure will engage in behaviors 
that have very high or very low probabilities 
of success. Attempting the impossible or try- 
ing a “sure thing” are excellent ways to elicit 
feedback that is ambiguous with regard to 
ability. 

Although the analysis of variance failed to 
yield significant differences between the au- 
tonomous and dependent groups in the mean 
number of clear-cut feedback choices, within 
both the permissive and stress conditions the 
subjects given a set tended to prefer more 
ambiguous feedback than the subjects given 
no set. It appears that the particular set, 
“autonomy” or “dependency,” was irrelevant 
and that the set-inducing operation, per se, 
was an important factor. While even this 
dichotomy failed to reach significance, the 
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trend in the data is suggestive enough to 
justify interpretative attempts. The set-induc- 
ing operations may have led subjects to be- 
lieve that good diagnosticians were particular 
types of individuals and that, to perform well 
on a diagnostic task, a particular frame of 
reference would be necessary. If such an ex- 
pectation constituted additional stress for the 
subject, it may have contributed to the pref- 
erence for ambiguous feedback. 

In addition to the very real possibility 
that the negative result in the autonomy- 
dependency differentiation was due to a weak 
manipulation of this dimension, the ineffec- 
tiveness of the set-inducing operation may 
have been due to the fact that simply raising 
the autonomy-dependency issue to college 
students would tend to obscure differential 
attitudes, It is well known that in adolescence 
the severing of family ties and the develop- 
ment of independence are prime conflicts. 
Raising this issue via the “personality test” 
may have exacerbated a typical solution to 
this conflict, namely, exaggerated autonomy 
and thus induced a greater preference for 
ambiguous feedback, Of course, these are 
speculations, and only future investigation 
can make the meaning of this result clear. 

Several results concerning the self-evalua- 
tion measure bear amplification. It was found 
that for the subjects in the permissive condi- 
tion, but not for the subjects in the stress 
condition, high self-evaluations correlated 
positively with preferring clear-cut feedback. 
The latter correlation suggests that, given per- 
missive conditions, individuals with more self- 
confidence can more easily accept clear-cut 
feedback and, in fact, prefer it for realistic 
appraisal. Perhaps because these individuals 
value themselyes and are confident of their 
performance, the possibility of criticism or 
failure does not pose a threat and is more 
easily accepted, Under stress conditions, how- 
ever, the reactions to the stress might obscure 
whatever relationship there would be between 
high self-evaluation and preference for clear- 
cut feedback; thus, although the correlation 
between these two variables within the stress 
condition was in the same direction, it did 
not reach significance (r = .10). 
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OTHERS’ SELF-EVALUATIONS AND INTERACTION. ANTICIPATION | 
AS DETERMINANTS OF SELF-PRESENTATION * 
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Duke University 


Female Ss faced a partner who was either self-centered, self-derogating, or who 
evaluated herself in an average fashion. 4 of the Ss in each of these groups 
anticipated further interaction with the partner; 4 expected no further inter- 
action. All Ss then made a number of self-ratings which were to be sent to 
their partner. As predicted, Ss rating themselves for the self-centered partner 
became more positive in their self-ratings, and the self-derogating partner caused 
Ss to emphasize more negative self-characteristics. Contrary to prediction, the 
average partner caused Ss to emphasize more negative features of self when 
future interaction was anticipated. Such modifications did not seem to depend 


on conscious mediation. 


The information we present to others about 
ourselves is seldom selected at random. We 
constantly face the dilemma of choosing 
from a vast storehouse of self-knowledge the 
appropriate items for public display. One of 
the more crucial periods for such decision 
making is during the formative stage of a 
relationship. As Thibaut and Kelley (1959) 
have pointed out, the fate of any relation- 
ship depends to a great extent on the level 
of outcomes experienced by the respective 
members during the introductory stages. The 
present investigation was an attempt to ex- 
plore the ways in which the attributes of one 
member of a dyad and the interaction con- 
text can modify the self presented by a 
second member during the initial stages of a 
relationship. 

One principal dimension along which self- 
referent statements can vary is that of posi- 
tiveness, The importance of this dimension is 
well reflected in the abundance of psycholog- 
ical studies using the variable of self-esteem. 
By far the major portion of the literature on 
self-esteem has found it convenient to assume 
relatively stable feelings of self-regard within 
any individual. Thus many of the studies in 
this area have been devoted to demonstrating 
the differential reactions to various situations 
of persons deemed to be either high or low 
in self-esteem. 


* This article is based on a thesis submitted by 
the second author in partial fulfillment for the BA 
degree with distinction at Duke University. 

2 Now at Brandeis University. 
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On the other hand, the importance and to 
some extent the validity of a concept like í 
self-esteem depends largely on the consistency 
of the overt behavior of the individual. If a 
person is completely capricious in his ex- 
pressed feelings of self-regard, any single 
measure of self-regard would constitute a poor 
predictive device. And yet, we are all quite 
aware that persons do indeed vary in .what 
they say about themselves from one situation 
to another. Many such variations have been 
discussed by Goffman (1959); others have 
been more formally exposed in studies like 
those of Jourard and Lasakow (1958), and 
Katz (1942). There are two prominent ques- 
tions which such studies pose: What are the 
systematic determinants of such variations? 
Under what circumstances do these overt 
variations reflect changes in the subjective 
state of the person? The present study was 
devoted primarily to shedding light on the 
former of these questions. In doing so, how- 
ever, it was hoped that at least some evidence 
could be generated concerning the latter. 

There are a number of studies which havê | 
dealt with systematic changes in self-evalua- 
tions. Several investigations, for example, have 
explored the conditions under which success 
and failure affect a person’s evaluations of 
self (cf. Stotland & Zander, 1958; Vide- 
beck, 1960). From a different vantage point, 
Jones, Gergen, and Davis (1962) and Jones, 
Gergen, and Jones (1963) have demon- 
strated the effects of various interaction 
goals and status assignments on a person's 


s 


l 
E 


| public self-evaluation. In a further study 
(Gergen, 1965), the effects of social feed- 
back on the positiveness of self-ratings were 
_ explored under systematically varied con- 
ditions. In this study it was possible to 
distinguish between those conditions giving 
rise to public versus private self-evaluations. 
However, it also seems apparent that a 
person’s public self-evaluations are highly 
„ dependent on personal attributes of others in 
the same situation. For example, Davis (1962) 
demonstrated that the degree of interpersonal 
dominance displayed by one member of a 
dyad influenced the degree of dominance 
manifested by the other. In the present study 
the question was whether one can display 
the same social identity before another 
* who is seen to be self-centered and ego- 
tistical, as before another who is self-effacing 
or self-demeaning? To shed light on this 
„question subjects were exposed to a person 
who described herself in a very positive 
light, revealed both good and bad points, 
or who was very self-derogating. Each of 
these situations, it was felt, would pose for 
subjects a specific kind of problem in impres- 
sion management (Goffman, 1959) and these 
problems would be resolved by differential 
- self-evaluations. Before discussing the specific 
predictions a word must be said about the 
context of interaction. 
The personal attributes of another can 
seldom be considered in exclusion of the 
` circumstances in which they are encountered. 
Although there are a number of context 
factors of relevance in the present case, one 
of the more intriguing aspects of a social 
relationship has to do with the participants’ 
perceptions of how long the relationship 
will last. It has been noted, for example, 
that when interacting with a stranger a 
person is often more open and revealing 
than with a personal friend. Thibaut and 
Kelley (1959) have termed this the “stranger 
passant” phenomenon and have theorized 
3 that in a short-lived relationship the par- 
ticipants are not threatened by problems of 
dependency. The amount of anticipated in- 
teraction would thus seem to have some 
interesting implications for the present study. 
The solution to each of the problems men- 
tioned above should be differentially altered 
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as a result of the subjects’ perceptions of 
the duration of the relationship. 

We may now turn to a more complete 
description of the various problems and 
their probable resolutions. First of all, one 
major problem for a person finding himself 
faced with a highly positive other might be 
termed power restoration. Interacting with 
another who displays an impeccable person- 
ality can often be an intimidating experience. 
Such a person immediately usurps a position 
of seniority in a dyad and forces the other 
member into a usually undesirable, low- 
status position, Given little latitude of be- 
havior, as subjects in the present experiment 
were, the optimum solution to such a problem 
would seem to be to counter in kind, that 
is, to become more positive about oneself. 
This anticipated self-enhancement should, 
however, be most dramatic when further 
interaction is anticipated. When the possibility 
of future intimidation and dependency is 
nonexistent, the necessity for redressing a 
power imbalance should not be as great. 

When faced with another who demonstrates 
that he has at least a number of worthy 
attributes and enough honesty to admit 
shortcomings, a different type of problem 
emerges. It would seem that such a person 
should cause one to seek out his friendship 
and the problem can thus be termed one of 
acquaintance seeking. One method of becom- 
ing better acquainted in the present experi- 
ment would be also to demonstrate both good 
and bad features of self, that is, “be oneself.” 
However, as Thibaut and Kelley (1959) have 
suggested, when a long-term relationship is 
anticipated a person can ill afford to reveal 
weaknesses, It was thus predicted that sub- 
jects would be more positive about themselves 
when further interaction was anticipated than 
when no further interaction was expected. 

When another is encountered who is self- 
effacing, still a third problem is generated. 
The self-derogating other immediately places 
himself in a position of dependency. The 
stronger partner in such a dyad is soon 
faced with the major share of the respon- 
sibility for the welfare of the dyad. The 
weaker member often acts as a parasite, 
thriving at the expense of the personal 
resources of the other. The person encounter- 
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ing such another might be said to be faced 
with the problem of succorance avoidance. 
Before considering the solution to such a 
problem, it should also be noted that the 
person who seems to have almost no good 
qualities often elicits feelings of pity or 
sympathy. Such feelings greatly complicate 
the problem of succorance avoidance, for it is 
usually a very difficult matter to turn a 
cold shoulder on one who has admitted that 
you are everything and he is nothing. In 
such a situation the variation in interaction 
anticipation should play a very important 
role. If future dependency is expected, the 
optimum solution should be to avoid the 
self-derogating other at all costs. One seem- 
ingly effective avoidance tactic would be 
to emphasize as many differences between 
yourself and this person as possible. This 
leads to the prediction that the self-derogat- 
ing other should cause subjects expecting 
interaction to become more self-enhancing. 
However, when the relationship has no 
future, subjects should be free to express 
the pity felt for the low self-esteem other. 
One way of expressing such feelings might 
be to say, “Don’t feel so bad; when you 
get right down to it none of us are so good.” 
In the present study, such an expression 
would be manifested in a decreased emphasis 
on one’s positive features. 

To summarize, the major predictions of 
this study can be formally stated as follows: 

1. Under conditions of no anticipated 
interaction, the positiveness of self-presenta- 
tion will be a linear function of the perceived 
egotism of the other. A highly positive other 
will induce self-rating changes in the posi- 
tive direction and a self-demeaning other will 
produce changes in the negative direction. 

2. Under conditions of anticipated inter- 
action, the positiveness of self-presentation 
will be a curvilinear function of the other’s 
egotism. A highly positive and highly negative 
other will elicit positive self-presentation, and 
the medium other will produce little change. 

3. The positiveness of self-presentation 
under conditions of anticipated interaction 
will be displaced in a more positive direction 
than under conditions of no anticipated 
- interaction. 
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METHOD 
Subjects 


The subjects were 58 undergraduate fen 
enrolled in an introductory psychology course, Di 
the early stages of the course, all stude 
required to fill out a self-evaluation mi 
be described below. From this initial 
persons who scored within approximately the 
or lower 15% of the sample were elin 
from further consideration. Of the remaining 
58 subsequently participated in the experi 
Selection of subjects and assignment to 
mental conditions was carried out on a rand 
basis. One subject was excluded from the 
because of suspicion of the manipulation 
three others were randomly eliminated in o! 
equalize the cell frequencies. 


Self-Evaluation Measure 


The 30 items composing the positiveness of 
Presentation measure were taken from a mi 
developed by Dickoff (1961). Each item 
the form of a short self-descriptive phrase, | 
as “more emotionally mature than average,” 
considerate of others,” etc, Original items had b 
assigned scale values for their degree of positive 
by a group of independent raters. Fifteen of 
items were highly positive and 15 were con 
by the raters to represent negative attributes. 
instructions required a person to assign any n 
ber of points from 0 to 10 to each item accor 
to how representative he felt the item to bi 
himself. Either a large number of points a 
to positive items or a small number to 
items would thus be indicative of high self- 


Experimental Procedure 


The experimenter met with groups of 8-10 | 
jects who were not generally acquainted with 
other. He described the experiment as one d 
to explore the way in which two people 
know each other. After discussing some 
related topics, such as, types of statu 
tionships among members of a military hi 
subjects were told that they would each be 
to a separate booth. Once in the booth i 
would communicate in writing to a 
ostensibly one of the other members of the 
whose identity would be unknown. Such anon 
was said to eliminate the influence of prev 
acquaintance and to allow a written recor 
all communications to be obtained. 

The subjects were further told that once 
the booths, half of them would find a form 
self-evaluation measure and associated quesi 
on which they were to describe themselves. 
descriptions were then to be taken by the 
menter to a partner, a member of the other 
of the group. The other half of the group. 
actuality, the entire group) were to find 
zines in their booths which they could read 
waiting. After receiving the communication 


S 


* .of points assigned to positive and negative 
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second partner would fill out an identical set of 
materials which were then to be returned to the 
first communicator, 

Half of the experimental groups were then told 
that the conditions of anonymity would be broken 
neither during nor after the communication ex- 
change. Persons in these conditions, in other words, 
would never get to know the identity of the 
partner with whom they communicated (no-anticipa- 
tion condition), The remaining experimental groups 
were told that they would meet the partner after 
‘they had finished the communication exchange. 
Further, the experimenter expressed his expectation 
that the various pairs would become much better 
acquainted with each other during the semester, 
and that later in the semester they would be asked 
to participate in additional experimentation to- 
gether (anticipation condition). 

The experimenter then took each subject to a 
separate soundproof booth, where a new issue of 

a women’s magazine was found. After approxi- 
mately a 10-minute interval the experimenter 
brought the stimulus materials to the subjects. 
There were three different sets of stimulus ma- 
‘terials, one of which was delivered to each sub- 
ject on a random basis, Each set contained the 
self-evaluation measure and three handwritten 
paragraphs, ostensibly filled out by actual partners. 
Regarding the self-evaluation measure, the number 
of poipts assigned to items by the partners in 
the three relevant conditions was based on an 
assessment of the averages assigned to these items 
by the initial sample from which subjects were 
drawn. Whereas the average (medium-positive) 
partner assigned the same number of points to 
the items as the average student in the sample, the 
egotistical (high-positive) other was much more 
Positive than the average and the humble (low- 
positive) partner differed from the average by the 


. same amount but in the opposite direction. Care 
. was taken that the absolute or total number of 


Points assigned to all items by the fictitious partner 
was the same for all conditions, In addition, it 
seemed that high or low numbers of points as- 
signed to a particular item by the partner could 
introduce unwanted bias. Thus two sets of 15 dif- 
ferent numbers, one for positive and one for 
negative items, were constructed for each of the 
three sets of stimulus materials. Within each of 
these conditions the sets of numbers were then 
rotated through the items from subject to subject. 
Thus each subject received a different number of 
Points on the various items but the total number 
sae items 
within any condition remained the same. In order 
* to maintain as much constancy as possible, each 
particular configuration of digits for each of these 
conditions appeared twice, once in the anticipation 
condition and once in the no-anticipation condition. 

As noted, in addition to the self-evaluation 
Measure each subject received three paragraphs 
supposedly written by her partner. These para- 
graphs were in response to questions concerning 
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high-school or prep-school activities, relationships 
with peers, and future plans, The content of 
these paragraphs was also varied for the three 
partner types. In the high-positive condition, the 
partner said that she enjoyed high school, got 
along perfectly with others, and looked forward 
to the future. The partner in the medium- 
positive condition, however, felt that she enjoyed 
high school most of the time, got along with 
others fairly well, and had at least some doubts 
about the future. The low-positive partner felt 
very depressed about her high-school days, her 
relations with others, and her future. In spite of the 
differences in feeling tone, the attempt was made 
to keep various other aspects of the paragraphs as 
similar as possible. For example, each partner had 
participated in the very same activities with equal 
success. In addition, the total number of written 
words was constant across conditions and all para- 
graphs were written in the same hand. 

After the subjects had an ample opportunity to 
digest these materials, the experimenter reappeared, 
bringing with him a fresh set of materials identical 
to those filled out by the supposed partner. He 
again reminded each subject that these were to be 
sent to her partner. After a 10-15 minute interval 
the experimenter again returned to the booth and 
picked up the subject's self-ratings and asked her 
to make a set of ratings of her partner, ratings 
which the partner was not to see. These ratings 
were designed to assess the subject’s perceptions of 
the partner. After another 10-15 minute interval, 
the experimenter again returned and gave the sub- 
ject an additional questionnaire to fill out. This 
questionnaire dealt with such matters as the sub- 
ject’s awareness of shifts in self-presentation and 
other aspects of her behavior in the situation. 

Upon completion of the questionnaire each sub- 
ject was asked to return to the room in which the 
group had originally met. After all subjects had 
returned the experimenter attempted to assess sus- 
picion and revealed the rationale behind the ex- 
periment. All subjects were asked to cooperate by 
not divulging the details of the experiment to their 
classmates. 


RESULTS 


Validation of the Manipulations 


There was good independent evidence that 
the manipulation of the partner’s character- 
istics was effective. One of the “rating of 
partner” measures which each subject filled 
out after communication with her partner 
consisted of a series of antonyms. Each 
antonym was separated from its opposite by 
a 10-point scale, and 4 of the 18 antonyms 
(“self-centered versus humble,” “personally 
modest versus conceited,” etc.) were de- 
signed to tap perceived egotism of the part- 
ner, Combining each subject’s ratings on these 
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four items, perceived egotism scores were 
derived. An analysis of variance revealed 
that subjects in the high-positive condition 
saw the partner as significantly ($ < .001) 
more egotistical than subjects in the medium- 
positive condition, and these latter subjects 
saw their partner as significantly (p < .001) 
more egotistical than those in the low-posi- 
tive condition. 

The validation of interaction anticipation 
is more difficult to establish. Unfortunately the 
items on the postexperimental questionnaires 
could not be used to make a reasonable as- 
sessment of the effectiveness of this manipula- 
tion, However, in that the major results 
would be difficult to interpret without assum- 
ing that the instructions for the two condi- 
tions were perceived differently, it seems 
safe to assume that the subjects believed 
what they were told. 


Changes in the Positiveness of Self- 
Presentation 


As noted, positiveness of self-presentation 
was measured with a 30-item scale, of which 
15 items were stated in a positive manner 
and 15 in a negative. Although it might be 
expected that changes in the number of 
points assigned to negatively phrased items 
would be a mirror image of changes found 
in the positive items, there seemed to be 
good reason for analyzing the two sets of data 
separately. For example, the most obvious 
way a person might indicate that he feels 


TABLE 1 


ANALYSIS OF VARIANCE OF POSITIVENESS AS EXPRESSED 
on POSITIVE ITEMS 


Source df MS F 
Between subjects 53 | 338.75 
High, medium, low (B) 2 [1,936.15 | 8.13** 
Anticipation versus 
` no anticipation (C) 1 [1,365.33 | 5.73* 
BXC 2 | 562.33 | 2.36 
Error (b) 48 | 238.18 
Within subjects 54 | 131.30 
Before-after (A) 1 [1,776.33 | 30.05*** 
AXB 2 |1,220.11 | 20.64*** 
AC. 1 10.71 
AXBXC 2 12.70 
Error (w) 48 59.11 


*p <.05. 
p> <.01. 
weep < 001. 
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Fic. 1. Mean positive change on positively phrased 
items for all conditions. | 
himself to be a “good person” would be to 
make a direct statement to that effect and 
assign a large number of points to positive 
items. However, this same message can 
communicated on a more subtle level byf 
saying “I don’t feel that I am a bad sort of 
a person,” which would be analogous to 
assigning a small number of points to nega 
tive items. In other words, the two sets of | 
items might be differentially sensitive t0 
varying types of communication. 
Positiveness as expressed on positive items: 
Since the items had been administered twit 
once under relatively neutral and once under 
experimental. conditions, the effects of the 
manipulations could be gauged by comparing 
the two sets of scores. The group means 10! 
change in points assigned to positive item 
appear in Figure 1. A Type III (Lindquist 
1956) analysis of variance of the neutrat 
and experimental scores is summarized M 
Table 1. a 
Turning first to the effects of the various 
partners on subjects’ self-ratings in the no 
anticipation conditions, it can be seen that 
the prediction of a linear relationship is only 
partially verified. The significant A x B intel 
action term in the analysis of variance 
allowed orthogonal comparisons to be made 
between neutral and experimental scores 
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subjects exposed to the three types of part- 
ners. As predicted, subjects interacting with 
an egotisical other showed a significant ($ 


` <.001) increase in the number of points as- 


''the medium-positive partner 


“ness of subjects’ self-ratings slightly, 


signed to positive items. Further, as antici- 
pated, the average partner elicited less posi- 
tive change than the egotist. An orthogonal 
comparison between the experimental-neutral 
differences between the two conditions was 
significant beyond the .001 level. However, 
‘the low-positive partner failed to elicit any 
significant amount of change. à 

Assessing the changes produced by the 
three partners under conditions of anticipated 
interaction, it is seen that the curvilinear 
prediction was almost verified. The egotist 
elicited significant positive change (p< 001), 
h elicited sig- 
nificantly less positive change (p < 01), and 
the humble partner also boosted the positive- 
though 
not significantly. 

The prediction that self-presentation would 
become more positive under conditions of 
anticipated interaction than under condi- 
tions where no further interaction was antici- 
pated receives little support from these data. 
As can be seen, with each of these partners 
the means are in the predicted direction. 
However, the lack of a significant A X C 
interaction in the analysis of variance in- 
dicates that this difference is merely sug- 
gestive. 

Positiveness as expressed on negative items. 
The change means in points assigned to nega- 


TABLE 2 


ANALYSIS OF VARIANCE OF POSITIVENESS AS EXPRESSED 
ON NEGATIVE ITEMS 


Source df MS F 

Between subjects 53 | 492.54 
High, medium, low (B) 2 (1,956.46 | 4.67* 
Anticipation versus 

no anticipation (C) 1 [1,401.12 | 3.34 
BXC 2 | 341.45 
Error (b) 4g | 418.91 
Within subjects 54 99.90 
Before-after (A) 1 7.19 
AXB 2 (1,325.39 |25.30"* 
AXC 1 4.89 
AXBXC 2 | 108.18| 2.06 
*p <.05. 


#ip < 001. 
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Fic. 2. Mean positive change on negatively phrased 
items for all conditions. 


tive items appear in Figure 2. As can be 
seen, the pattern of change detected with 
this measure is appreciably different than 
that found using the positive items. Looking 
first at the changes found under conditions 
of no-anticipated interaction, it is seen that 
the hypothesis of a linear relation between 
partner’s egotism and subject positiveness is 
verified. Orthogonal comparisons based on 
the analysis of variance summarized in Table 
2 indicate that the highly positive partner 
produced a significant (p < 005) decrease 
jn the number of points assigned to negative 
items, the medium-positive partner had sig- 
nificantly less effect on subjects’ self-ratings 
p< .05), and the humble other produced 
a significant ($ < 005) increase in the 
number of points assigned to negative 
phrases. This latter increase was also sig- 
nificantly (p < -05) different from the change 
occurring in the medium-positive condition. 
Turning next to self-rating change under 
conditions of anticipated interaction, a rather 
surprising set of findings emerges. Although 
the highly positive other produced a sig- 
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nificant (p < .001) decrease in the number 
of points assigned to negative items, the 
medium-positive partner had exactly the 
opposite effect. Whereas it was predicted 
that subjects would become slightly defensive 
in this condition and thus decrease the 
number of points assigned to negative items, 
it was found that there was a significant 
(p < .01) increase. In other words, these 
subjects gave greater emphasis to their 
negative characteristics. The low-positive 
condition furnishes an additional surprise. 
Although it was predicted that subjects in 
this condition would decrease the number of 
points assigned to negative items, it is found 
that a significant ( < .01) increase occurred. 
Finally, when changes are compared under 
conditions of anticipated interaction versus 
no anticipation, it is again found that no 
significant differences emerge. Although the 
mean differences are in the expected direc- 
tion in both the high- and low-positive con- 
ditions, Figure 2 reveals that when exposed 
to the medium partner, subjects tend to be- 
come more negative under conditions of 
anticipated versus no-anticipated interaction. 
This difference, however, is just shy of 
reaching statistical significance (p < .10). 


Self-Presentation and Awareness 


The present experiment was based on the 
assumption that subjects in the various con- 
ditions were faced with varying problems of 
impression management. A secondary con- 
cern of this study was with the issue of 
whether the solutions to such problems are 
mediated by conscious awareness or are 
perhaps sufficiently habitual as to take place 
unconsciously. In order to deal with this 
question, two measures of awareness were 
utilized. The first was in the form of a 
question appearing on the postexperimental 
questionnaire. The question indicated that 
no one filled out the answers in the same way 
each time and asked whether the subject felt 
she was more or less positive during the 
communication exchange than she was 
earlier in the year. It was assumed that a 
subject who answered in the affirmative was 
likely to be aware that her self-ratings had 
been affected in some way by the manipula- 
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the way the subject answered a question 
appearing first at the end of the neutral 
testing and again on the postexperimental 
questionnaire. The question asked was: “To 
what extent do you feel that the self-rating 
just completed was truly representative of 
yourself?” and subjects indicated their an- 5 
swers on a 20-point scale. It was felt that 
any difference in these two ratings would be: 
likely to reflect awareness of shifts in self 
ratings from one testing session to the next. 
Combining these two measures, each sub- 
ject was placed into one of three groups. If 
a subject indicated that she was more or 
less positive in her self-ratings in the experi- 
ment than in the neutral session, and also 


tions. The second measure was derived from | 


indicated a difference in feelings of sincerity 
from one session to the next, she was placed 
in the high-awareness group. Any subject who 
demonstrated awareness on one measure but 
not on the other was placed in the medium- 
awareness group. Those who showed aware- 
ness on neither measure were designated low- 
awareness subjects. Because some subjects 
became more positive and others more nega- 
tive, and because positive and negative 
items were utilized differentially in the various 
conditions, the relationship between aware- 
ness and change was assessed by converting 
changes in either direction to absolute change 
scores, There were 13 subjects in the high- 
awareness group, 27 in the medium, and 12 
in the low-awareness group. 

Perhaps the most striking fact revealed 
by this analysis is that there appears to be | 
virtually no relationship between awareness 
and amount of change in self-ratings. The 
mean change for the low-awareness group 
for both positive and negative items is highly 
similar to that of both the medium- and 
high-awareness groups. Statistical analysis of 
these data revealed no significant difference 
in absolute change between any awareness | 
groups. In short, there seems to be no €v! 
dence that the changes in self-presentation 
which occurred in this experiment had to bè | 
mediated by conscious awareness. At least 
one implication of these results is that persons 
can vary markedly in the overt presentation” 
of self and yet feel equally as honest across 
situations. 
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Discussion 


That a person’s public self-evaluations are 
highly dependent on the way other persons 
present themselves and on the context in 
which interaction takes place receives sub- 
stantial support from the above findings. It 
seems clear that persons are highly selective 
concerning the information about self which 
they provide to others, and that such selection 
can be systematically explored in a laboratory 


` setting. The task remains, however, of deal- 


sing with certain irregularities in the data and 
of determining the extent to which the above 
findings can be accounted for by the rationale 
presented in the introduction. 
With regard to the subjects who faced 


. the egotistical other, it was predicted that 


there would be an increase in the positive- 
ness of self-ratings, and that this increase 
would be greater when further interaction 
was anticipated. These predictions were based 
on a power restoration rationale with the 
additional specification that when further 
interaction was anticipated, the need for 
restbring power would be enhanced. The re- 
sults fully confirmed the first of these predic- 
tions. However, with the exception of the 
fact that the means were in the expected 
direction, there was no reliable evidence 
that the variation in interaction anticipation 
had any differential effect. It remains a 
question as to whether such shifts in self- 
presentation can be attributed to the power 
restoration rationale. At least one major 
competing explanation would be that the 
highly positive other was seen as more at- 
tractive and that boosting the attractiveness 
of one’s self-ratings would be a way of 
ingratiating oneself with this person. There 
is some evidence bearing on these explana- 
tions. First of all it will be recalled that all 
subjects made a number of ratings of their 
supposed partner. Conceiving of power as 
one’s ability to drive another person through 
a range of outcomes (Thibaut & Kelley, 
1959), a person who is seen as more power- 
ful would also be seen as less dependent and 
more self-sufficient. Four antonyms on the 
rating list, for example, “independent versus 
dependent,” “reliant on others versus self- 
reliant,” had been included to reflect this 
dimension of perceived dependency. By sum- 
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ming ratings of the partner on these anto- 
nyms, perceived dependency scores were 
derived for all subjects. The means and 
standard deviations for the various conditions 
are contained in Table 3. As can be seen, the 
highly positive partner was seen as more 
independent than the partner in either of 
the remaining conditions. An analysis of 
variance revealed that the difference between 
the high- and medium-positive conditions 
was significant (p < .001). Although by no 
means conclusive, such evidence would at 
least be consistent with the power restoration 
rationale. 

If the social attractiveness of the highly 
positive other were the cause of the subjects’ 
self-enhancement, one would expect this also 
to be reflected in the ratings of the partner. 
The social attractiveness cluster was formed 
by antonyms such as “likable versus not 
particularly likable,” “unattractive person- 
ality versus attractive personality,” etc. The 
means and standard deviations of the com- 
bined social attractiveness scores for the 
various conditions can also be found in Table 
3. There is almost no difference between 
the means found in the high- and medium- 
positive conditions. The evidence thus does 
not seem to indicate that subjects in the 
high-positive conditions changed their rat- 
ings in order to be ingratiating. It should 
also be noted that one could derive the re- 
sults in the high-positive conditions from 
Festinger’s (1954) theory of social compari- 
son. These subjects could have simply been 
attempting to reduce the discrepancy be- 
tween their own and their partner’s self- 
evaluations. However, this theory would be 
of little aid in clarifying the total complex 
of results in the experiment. Findings in both 
the medium- and low-positive conditions are 
quite contrary to a social comparison predic- 
tion. 

Based on a rationale of acquaintance seek- 
ing, it was hypothesized that little self-rating 
change would be produced in the medium- 
positive condition except when interaction 
was anticipated. In this latter instance it was 
predicted that the anticipation group would 
be more positive than the no-anticipation 
subjects. The results proved to be highly 
interesting. With positively phrased items, 
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TABLE 3 
MEANS AND STANDARD DEVIATIONS OF PERCEIVED DEPENDENCY AND SOCIAL ATTRACTIVENESS SCORES 
Anticipation No anticipation 
i; Social 
Dependency PR nA Dependency attractiveness 
PEREN 33.33 32.11 33.66 
SD 5.70 3.45 3.67 
ea 25.11 27.55 25.11 30.00 
SD 6.16 5.73 5.75 6.77 
aaa 14.78 21.78 14.11 26.33 
SD 5.78 4.53 5.09 7.73 


no significant change in self-ratings was 
found either within or between groups. How- 
ever, when the negative items were consid- 
ered it was found that subjects expecting 
future interaction became significantly more 
negative in their self-ratings, whereas sub- 
jects who were shielded by anonymity pre- 
sented essentially the same self-picture as 
they had at the neutral testing. 

This pattern of results raises several is- 
sues, First of all, in order to obtain the re- 
sults which were initially predicted, certain 
preconditions would have to be established. 
For example, subjects in the medium-positive 
no-anticipation condition should have felt 
least constrained in communicating an honest 
self-picture, Meeting someone similar to one- 
self in a relationship with no future, it was 
felt, would give rise to the feeling of freedom 
to reveal one’s “true self.” Evidence bearing 
on this issue came from questions on the 
Postexperimental questionnaire which dealt 
with the subjects’ feelings of ease in com- 
municating with their partner, On a 7-point 
scale subjects rated how much their partner 
“caused me to want to tell her about myself” 
and how difficult it was to communicate with 
their partner. On both of these measures sub- 
jects in this condition indicated greater ease 
of communication than subjects in any of 
the other experimental conditions, A com- 
Parison of the mean for this condition with 
the combined means for the remainder of 
the conditions was just shy of Teaching sig- 
nificance (p < .10) for both questions, Given 
that subjects in this condition did feel fewer 
‘constraints in communicating, vis-à-vis the 


fact that these subjects were the only ones in 
the experiment who did not alter their rat- 
ings significantly, what conclusion might be 
drawn? The most likely conclusion would 
seem to be that subjects’ ratings in the neu- 
tral or group testing session represented a 
rather honest self-appraisal (although many 
have been led to doubt that students give 
an honest self-picture under such circum- 
stances). Subjects who subsequently found 
themselves in a situation which allowed hon- 
esty, that is, the medium-positive no-antici- 
pation condition, simply repeated the self- 
ratings made in the earlier circumstance. 
The question remains, however, as to why 
the subjects meeting someone similar to 
themselves with whom they expected future 
interaction became more self-derogating. An 
examination of the possibility that the mean 
tatings obtained in the neutral situation 
from these subjects might have differed by 
chance from the points assigned to these 
items by the partner revealed no evidence of 
artifact, Self-evaluation change scores (nega- 
tive items) were also correlated with each of 
the ratings of the partner made by subjects 
in this condition. The only significant cor- 
relation emerging was dramatic, Self-rating 
ge in the negative direction in this con- 
dition was correlated .66 (p < .005) with 
ratings of the partner on the egotism-humil- 
ity dimension. In other words, the more self- 
centered, egotistical, etc., the partner was 
seen, the more the subjects altered their 
self-ratings in a negative direction., Further- 
more, this correlation is actually in the nega- 


tive direction, though not significant, in the 


~ 
K 


" medium-positive no-anticipation condition. 
l This finding raises interesting possibilities. 


" First of all it is important to notice that 


both ends of the egotism rating (see Results 
section) tend to have a pejorative flavor. A 
high score would indicate that the partner 


"was seen as self-centered, etc, and a low 


score would mean that the partner was seen 
as having feelings of inferiority, etc. It will 


d also be recalled, subjects in the medium- 


positive condition used neither extreme of 
the scale in rating their partner. The impli- 
cation would seem to be that when another 
person is seen to be neither too self-centered 
nor too self-derogating, and future interac- 
tion is anticipated, the tendency is to com- 


* plement the behavior of this other person. If 


the other is seen to be rather positive about 
self, the resulting tendency is to be slightly 
self-effacing and vice versa. Such a finding 
raises an interesting question vis-a-vis earlier 
research showing that dyads in which the 
members’ personalities complement each 
other tend to be compatible (cf. Schutz, 
1958; Winch, 1955). Such studies have gen- 
erally assumed that complementarity can be 
determined by an assessment of the pre- 
established personalities of the prospective 
members. The present study raises the possi- 
bility that when acquaintance seeking is a 
prominent issue in a dyad, the members may 
alter their appearance in order to comple- 
ment each other. 

In the low-positive conditions, it was rea- 
soned that when no interaction was antici- 
pated, subjects would be free to express 
sympathy and that such an expression would 
take the form of self-derogation. However, 
when interaction is anticipated, avoidance 
of the relationship would become more sali- 
ent, and subjects should tend to become more 
self-enhancing. The findings, however, pre- 
sented a somewhat more complicated picture. 
When no further interaction was expected 
subjects did indeed become more self-dero- 
gating on the negatively phrased items. Con- 
trary to expectations, the anticipation-condi- 
tion subjects demonstrated a similar tend- 
ency, On positively phrased items, however, 
neither group showed any significant change. 

Before interpreting this pattern of results, 
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it is first useful to determine whether the 
humble partner was felt to be more pitiful, 
more dependent, and more to be avoided. 
Data were available concerning all of these 
dimensions. First of all, subjects were asked 
to rate on a 7-point scale the degree they 
felt their partner should be pitied. Subjects 
interacting with the humble other rated her 
as significantly (p < .001) more to be pitied 
than subjects dealing with either of the other 
two partner types. Regarding dependency, it 
will be recalled that on the perceived de- 
pendency measure discussed above (see Ta- 
ble 3) the low-positive partner was seen as 
most dependent. A comparison of difference 
between the medium- and low-positive con- 
ditions was significant beyond the .001 level. 
Finally, evidence for avoidance of the self- 
derogating other would seem to be indicated 
by the social attractiveness ratings found in 
Table 3. An analysis of variance revealed 
that the low-positive partner was seen as less 
socially attractive ( < .05) than either of 
the other two partner types. It is also the 
case that for the low-positive conditions at- 
tractiveness and dependency are negatively 
related; this correlation is significant beyond 
the .05 level in the anticipation condition. 
In understanding the results it should first 
be noted that, as in the high-positive condi- 
tions, the manipulation of anticipated inter- 
action produced no significant effects. Con- 
sidering the extremity of the self-ratings 
made by the high- and low-positive partners, 
it seems quite likely that the effectiveness of 
this manipulation was all but obscured, Since 
the data in both of the conditions are highly 
suggestive, a more powerful manipulation of 
this variable seems in order in further work. 
If it is assumed that the two low-positive 
groups reacted in much the same manner, 
how might the present results be interpreted? 
First of all, if the rationale is correct it would 
seem to be the case that feelings of pity for 
the partner elicited the increased emphasis of 
personal shortcomings. Further support for 
this notion comes from the correlation be- 
tween self-rating change and ratings of pity. 
Combining both groups, the correlation is .53 
(p < .025), indicating that those who saw 
the partner as more to be pitied revealed 
more personal shortcomings. One might thus 
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suspect that subjects may have used the 
negatively phrased items to express pity and 
the positively phrased items to avoid the 
partner’s dependency. However, the relevant 
correlation between social attractiveness 
ratings and change on positive items was in 
the predicted direction but not significant 
(r= —.16). 

Tt should finally be asked why positively 
and negatively phrased items should be used 
differentially, both in these conditions as well 
as in the medium-positive anticipation condi- 
tion. At least one explanation would be that 
the differential usage depends entirely on 
the fact that the positive items dealt with 
different content areas than the negatively 
phrased items. In order to assess whether 
the experimental effects were dependent on 
specific item content, a Kuder-Richardson 
reliability test was conducted on the change 
scores for both groups of items. The result- 
ing reliability coefficients were within ac- 
ceptable standards (.85 for positive items and 
.77 for negative items), and indicate that the 
effects of the manipulations were general in 
Scope rather than dependent on specific item 
content, 

A more important source of explanation 
for differential item usage in these conditions 
would seem to have to do with the subjec- 
tive meaning attached to the use of positive 
and negative phrases about the self, In the 
medium-positive anticipation condition a 
paraphrase of a subject’s thought might be, 
“PII show you I’m human by revealing some 
of my negative attributes, but I will retain 
my positive characteristics in order to show 
you that I am attractive.” In the low-positive 
conditions, such a Paraphrase might be, “I 
feel sorry for you so T will show you that I 
too have difficulties, but T will separate my- 
self from you by showing you that I have 
many good attributes,” In that the Tesults for 
the „two low-positive conditions and the 
medium. positive anticipation condition were 
all similar, this approach would suggest an 
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interesting possibility for further research: 
depending on the social context, similar pres- 
entations may be used to communicate quite 
discrepant information about the self. 
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SOME ASPECTS OF OBSERVED AGGRESSION * 


LEONARD BERKOWITZ 


University of Wisconsin 


In 3 experiments Ss, male college students, were either angered or treated in a 
neutral fashion by a person who had been labeled either as a college boxer or 
a speech major, and they were then shown a short film. Exps. I and II were 
designed to test the thesis that the anger instigator would evoke the strongest 
overt hostility from the frustrated Ss when he had been introduced as a boxer 
and they had seen a prize-fight film; by associating this person with the ag- 
gressive scene, the “boxer” label had presumably heightened his cue value for 
aggressive responses. Exp. I, employing questionnaire ratings as the hostility 
measure, confirmed this theoretical expectation. In Exp. II, using electric shocks 
as the aggressive response, however, there was also an indication that the label 
“boxer” could have strengthened the person’s cue value for aggression regard- 
less of the nature of the film witnessed by Ss. The findings in Exp. III 
confirmed the results obtained in earlier investigations by showing that the 
angered Ss’ inhibitions against aggression varied with the apparent justification 


for the observed aggression. 


The widespread publication of news stories 
about murders often produces additional 
horrors. People having no actual connection 
with the tragedy are sometimes threatened 
or even attacked by others who only read 
or heard about the killing. Why does the 
publicized violence lead so often to still more 
aggression? What effects do the news stories 
have upon their readers? 

Research into the consequences of movie 
violence can help answer such questions. Ac- 
cording to these investigations under certain 
conditions observed aggression apparently 
serves as a cue activating previously acquired 
aggressiveness habits (cf. Berkowitz, 1962, 
Ch. 9). This is suggested, for example, by 
the findings in a study by Walters, Thomas, 


` and Acker (1962). Male adults who had wit- 


nessed a filmed knife-fight scene generally 
administered more severe punishment to a 
nearby person soon afterwards (by means of 
electric shocks) than did comparable people 
who had seen a more innocuous movie. 

But there may be other reactions as well. 


1The research reported here was conducted under 
Grant M1540, from the National Institute of Men- 
tal Health, United States Public Health Service, 
and Grant G-23988, from the National Science 
Foundation, I would like to acknowledge my in- 
debtedness to Thomas Creer, Frederick DeBoer, and 
Mark Heironimus who served superbly as the ex- 
Perimenters and/or confederates. 


If the audience regards the depicted aggres- 
sion as being unwarranted or morally wrong, 
inhibitions will also be aroused. These re- 
straints will weaken the strength of the re- 
sulting aggressive responses, or may even 
cause the audience members to “Jean over 
backwards” to avoid displaying any aggres- 
sion at all. (Many people throughout the 
country exhibited this latter type of reaction 
to the news of President Kennedy’s death 
when they condemned extremists and hate 
mongers.) In two experiments (Berkowitz, 
Corwin, & Heironimus, 1963; Berkowitz & 
Rawlings, 1963) deliberately provoked men 
who then saw a case of “justified” aggres- 
sion—in which a film villain received a beat- 
ing—directed stronger hostility toward the 
person who had tormented them than did 
similarly frustrated men who had witnessed 
“Jess justified” fantasy aggression, Events 
portrayed in the mass media evidently can 
influence the viewers’ inhibitions against 
their own aggressive desires, momentarily at 
least, perhaps through altering their judg- 
ment of the propriety of aggressive actions 
for this period. 

There is some question, however, as to 
just what is the role of the observed hostility 
(cf, Bandura & Walters, 1963). Berkowitz 
(1962, 1964) stresses both the eliciting func- 
tion of the observed action as well as dis- 
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inhibitory effects. He suggests that depicted 
aggression increases the probability of later 
hostility as a direct function of such factors 
as: the strength of the observer’s aggressive- 
ness habits; the degree of association between 
the witnessed situation and the situations in 
which the hostile habits were learned, and 
the later setting; and the intensity of the 
guilt and/or aggression-anxiety also aroused 
by the observed violence (1962, p. 238). In 
more concrete terms, we might say that the 
aggressiveness habits activated by witnessed 
hostility may be only in “low gear,” so to 
speak. Other appropriate, aggression-evoking 
cues must be present before the witnessed 
violence can lead to strong aggressive re- 
sponses by the observer. These cues are 
stimuli in the postobservation situation 
which have some association with the de- 
picted event. Thus, a person who sees a 
brutal fight may not himself display any 
detectable aggression immediately after- 
wards, even if his inhibitions are relatively 
weak, unless he encounters stimuli having 
some connection with the fight. 

Applying this theoretical argument to the 
problem of observed aggression, we would 
certainly expect recently angered and non- 
restrained people to display comparatively 
strong hostility toward their frustrater; his 
presence (if only in their thoughts) serves as 
a cue evoking aggressive responses from them 
(Berkowitz, 1964). But those people who 
had seen a noninhibiting, violent movie hay- 
ing some association with their tormentor 
should direct stronger aggression against him 
than similarly provoked people who were 
also shown the aggressive movie but who 
did not connect the film with their frustrater, 
or who saw a nonarousing, neutral film, 
Observed violence as well as frustrations can 
create an aggressive predisposition, Objects 
having an association with the sources of 
both predispositions should evoke stronger 
aggressive responses than objects which are 
connected with only one of these sources, 

t This analysis does not rule out the possi- 
bility of a hostility catharsis, but it does 
restrict the scope of this notion. The an- 
gered person often has a very specific goal: 
to inflict injury on his frustrater. Attacks 
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on innocent bystanders whom he does not 
connect with his tormentor or observing 
others commit acts of aggression presumably 
will not represent the attainment of this 
specific aggressive goal. The angered indi- 
vidual achieves gratifications through per- 
forming quite specific aggressive actions. But 
more than this, a person possessing an acti- 
vated aggressive response sequence will be 
frustrated and will experience tension if there 
is a delay in attacking the source of his 
arousal. Later, when he does have an op- 
portunity to aggress, he may feel a pleasing 
tension reduction (cf. Berkowitz, 1964). 

But, suppose someone has been angered 
and then sees an aggressive movie which has 
entirely no association with his tormentor. 
Giving him a socially sanctioned opportun- 
ity to hurt his frustrater should not be com- 
pletely satisfying (in the sense of producing 
a substantial tension reduction). While he 
may “complete” the anger arousal-injury 
sequence, there should still be some addi- 
tional discomfort because the aggressive re- 
sponse chains that had been activated by the 
hostile movie are still in operation. Goal re- 
sponses for these movie-instigated aggressive 
sequences had not been performed; there 
were no attacks on people associated with the 
aggressive movie, Putting this in other words, 
the absence of a suitable external aggression- 
evoking cue (objects associated with the 
movie) means that the injurious goal re- 
Sponses completing the movie-instigated ag- 
gressive response chain have not been elic- 
ited, at least in sufficient strength. The ag- 
gressive response sequence remains in opera- 
tion to produce the felt discomfort. 

The present paper is a report of three 
experiments that were designed to test the 
above formulation. Experiment I was a pilot 
experiment inquiring whether the available 
target’s association with an aggressive film 
scene affected the amount of hostility that 
was subsequently directed against him. Es- 
sentially the same question was asked in 
Experiment II which employed a different, 
ana more direct, measure of aggressive be- 
havior. Experiment III was concerned with 
an alternative explanation of the findings 
obtained in Experiments I and TI. 


| 


OBSERVED AGGRESSION 


r EXPERIMENT I? 


._. There are a number of ways in which the 
observed aggressive event may be connected 
4 psychologically with people who are not ac- 
tually involved in the scene. In the research 
reported here this association is varied by 
means of verbal labels applied to a person 
. the subject will encounter after he sees an 
aggressive movie. These labels place the 
+ stimulus person either in the same role cate- 
] 
k: 


i 


‘gory or in a different one from that occupied 
by the protagonist in the observed event. ` 


Method 


Subjects. The subjects were male college students 
who were recruited from a number of classes out- 
side of the psychology department for a study of 

~-"“Dhysiological reactions to various stimuli,” and 

were paid for the time they served. The sample 
consists of 42 men divided evenly among six experi- 

„ mental conditions. 

Experimental conditions. The subjects were sched- 

y uled singly. When each person arrived in the labora- 
tory he was met by one of the experimenters (the 
“neutral experimenter”) and was told that the ses- 
sion would involve two separate and unrelated 
studies which were being carried out simultaneously. 
He, MH, was interested in studying physiological 
reactions, while the other experimenter was (sup- 
posedly) concerned with student attitudes. Just 
what attitudes this other experimenter was said to 
be interested in, however, varied with the condi- 
tions. 

In three of the six conditions (boxer-role treat- 
ment) the other experimenter was said to be a 
graduate student in physical education and a “for- 
mer college boxer” who was interested in student 
attitudes toward prize fighting. The remaining sub- 
jects were informed that the other experimenter was 
+ a graduate student in speech who wanted to obtain 

‘student reactions to Hollywood movies (speech- 
major-role condition). MH, the neutral experi- 
menter, took a preliminary blood pressure reading, 
introduced the second experimenter (FD), who had 
Just entered, and then left the room. 

FD, the “critical experimenter,” administered a 
brief questionnaire whose content was appropriate 
to his supposed area of concern, For two-thirds of 
the subjects he acted in a patronizing and insulting 
manner as the student filled out the form (angered 

' condition), while the remaining men were treated 
_, Ù a neutral fashion (nonangered condition). 

When the subject had finished answering the 
questions the critical experimenter left, and MH 
— 

? This preliminary investigation was carried out 
under Grant M1540 (see Footnote 1). Mark Hei- 
ronimus was the “neutral” experimenter and Fred- 
erick DeBoer served as the “critical” experimenter. 
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returned to obtain another measure of the subject’s 
blood pressure level. Following this, the neutral 
experimenter said he would now show the subject 
a brief film. Half of the angered men and all of the 
nonprovoked subjects were informed they would 
see an excerpt from the movie (about a prize- 
fighter), Champion (aggressive-film condition). To 
help them understand the scene better, they were 
told the experimenter would give them a synopsis 
of the plot. The story outline used was the “justi- 
fied fantasy aggression” context which, according to 
previous research in this program (Berkowitz et al., 
1963; Berkowitz & Rawlings, 1963), serves to 
lower inhibitions against aggressive responses. The 
other half of the subjects in the angered condition 
saw a clip from a movie about the travels of Marco 
Polo (neutral-film condition). Both scenes lasted 
approximately 7 minutes. 

As soon as the film was over, the neutral experi- 
menter asked the subject to fill out a short ques- 
tionnaire. He said the “Dean’s office” wanted to 
assess students’ reactions to the research that was 
being conducted on the campus. The subject was to 
evaluate MH and FD on separate forms as part of 
this survey of student opinions. The forms (which 
did not ask for the subject’s name) were to be 
placed in sealed envelopes and sent to the Dean’s 
office. The experimenter assured the subject that his 
responses would be kept confidential and that only 
the Dean’s office would know what he had said. 

When the ratings were made the experimenter 
explained the deceptions and procedures that had 
been employed and the purpose of the study. The 
subject was then asked not to talk about the ex- 
periment for the remainder of the semester and was 
dismissed. 

Aggression measures. The dependent variables 
employed in this experiment were the ratings made 
of each experimenter on two items in the final 
questionnaire. One item read, “I would like to have 
Mr. _____. [either MH’s or FD’s name was pen- 
ciled in on the line provided] for a quiz instructor,” 
while the second item read, “If I were going to be 
in another experiment, I would like to serve with 
Mr. _____.” For both questions the subject was to 
make a mark somewhere on a 5-inch line, with 
both lines being anchored by the phrases “Not at 
all” and “Very much.” (The “good” end of the 
scale was rotated.) Since unfavorable ratings con- 
ceivably might hurt the experimenters’ graduate 
careers, poor ratings were here taken as expressions 
of hostility. Scores on the two items were summed 
together to yield a single measure of the intensity 
of the aggression directed against each experimenter. 


Results 


An analysis of variance of the aggression 
scores yielded a significant interaction of 
Experimental Conditions x Experimenter (F 
= 4.55, df = 5/36, p < 01). Table 1 shows 
the results of the Duncan multiple range . 
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TABLE 1 
MEAN SCORES ON THE QUESTIONNAIRE MEASURES OF HOSTILITY TOWARD THE Two EXPERIMENTERS 


Ratings of Angered by critical experimenter Nonangered 
Boxing film Neutral film Boxing film 
Critical experimenter’s role: Boxer Speech major Boxer Speech mayor Boxer | Speech major 
af ; z 00 10.71, 
critical experimenter 21.42, 15.29 bea 16,00p. 17.43, 11.00, $ 
Neutral Ene areal 9.145 10.10, 11.716 10.43, 12:14.ag 


— i perimenters were subjected to one “repeated measures” analysis of variance. Cells having a 
Roa ‘in Lhe ratinus or ethanty different at the .05 level. The higher the score the greater the expressed unfriendliness 


toward the given experimenter, Each mean is based on 7 cases, 


test of the differences among the 12 means. 
As can be seen in the table, the theoretical 
expectation is confirmed. The provoked sub- 
jects who were shown the aggressive film di- 
rected the strongest aggression against their 
tormentor when this individual was associ- 
ated with the aggressive film by means of 
the verbal label, “boxer.” The importance 
of the available target’s connection with the 
aggression-instigating scene is also suggested 
by the data for the neutral experimenter. 
Although none of the differences are statisti- 
cally significant, the neutral person was 
rated most unfavorably by the subjects in 
the angered-boxing film-boxer role condition. 
For these men there presumably was some 
indirect association between the neutral ex- 
perimenter and both instigational sources: 
the critical experimenter and the aggres- 
sive film. Thus, this person had the stimulus 
properties enabling him to evoke relatively 
intense aggressive responses from them. 
One final observation. We might also note 
that the verbal label, boxer, applied to the 
critical experimenter had no discriminable 
effect when the men had been angered but 
had been shown the neutral film, or, they 
had not been provoked. In other words, the 
critical experimenter as a boxer did not elici 
a significant amount of aggression from the 
subjects witnessing the boxing movie when 
these men had not been angered. The aggres- 
sive scene in itself may have been only a 
weak activator of aggressive responses, Any 
aggressiveness habits set into motion by the 
film were too weak to lead to overt hostility 
unless the men were predisposed to act ag- 
gressively because of their anger arousal. 


EXPERIMENT II’ 


Because of the success of the preliminary 
experiment, a second and larger study was 
conducted. This investigation, employing a 
direct measure of aggression instead of ques- 
tionnaire ratings, followed essentially the 
same procedure as that used in Experiment 
I, but this time in a complete 2 x 2 x2 
factorial design, 


Method 


Subjects. There are 11 men in each of thé eight 
conditions in this experiment. Unlike the earlier 
study, all of the subjects were recruited from in- 
troductory classes in psychology, and all volun- 
teered in order to earn points counting toward 
their final course grade. 

Experimental conditions. As before, the subjects 
were scheduled singly for the “study of physio- 
logical reactions to various’ tasks.” When each sub- 
ject arrived, the experimenter explained the (05 
tensible) purpose of the investigation and then 
took the first blood pressure reading. There was 
another individual also in the room, supposedly 
another subject but actually the experimenter’s 
confederate. The experimenter asked each person t0 
introduce himself, turning first to the subject and 
then to the confederate. The remarks made by the 
confederate varied with the experimental condition. 
In the boxer-role treatment the confederate said he 
was a physical education major who was interested 
in college boxing, while in the other cases he said 
he „was majoring in speech (speech-major-role con- 
dition). A brief interchange between the experi- 
menter and the confederate reinforced the confed- 
erate’s self-designation. 


Their first task, the experimenter said, was t0 | 
work for 5 minutes on a brief intelligence test. He 
work for 5 l 


This experiment as well as the following 07° 
were conducted under Grant G-23988 (see Foot- 
note 1). Frederick DeBoer was the experimenter in 
both of these investigations, while Thomas Cree 
was the confederate, 
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gave each man the necessary materials and then 
left the room. While he was gone the confederate 
“either angered the subject by disparaging his per- 
- formance and questioning his motivation for being 
in the study (angered condition), or acted in a 
neutral fashion toward him (nonangered group). 
Following this, the experimenter returned to take 
a second blood pressure measure from each person. 
He then introduced the next situation: The two 
people would watch a short film clip. Half of the 
subjects were told they would see a scene from the 
* movie, Champion. These people were shown the 
‘same scene used in the aggessive-film condition in 
Experiment I, and again, after they had been 
fiven the “justified aggression” plot summary. The 
remaining subjects witnessed an equally long film 
clip about English canal boats‘ (neutral-film con- 
dition), When the 7-minute movie ended the experi- 
menter took another set of blood pressure readings 
from the two men, and administered a brief ques- 
„tionnaire. assessing reactions to the film. 
Immediately after this the experimenter provided 
the subject with a socially sanctioned opportunity 
to aggress against the experimenter’s confederate. 
» He informed the two men that they now would 
have two different tasks. One of them (and the 
experimenter indicated this would be the confed- 
erate) would have to design “an original and imag- 
inative floor plan for a house” knowing that he 
would, receive a number of electric shocks if he did 
not do a good job. This was a stressful situation, 
_ the experimenter explained, and he would measure 
the physiological reactions to the stress. The experi- 
menter and the confederate then left, supposedly 
to go to another room where the confederate would 
work on his assigned job. About 2.5 minutes later 
the experimenter returned with a floor plan (ac- 
tually constructed previously to be standard for all 
Subjects) and asked the subject to evaluate its 
imaginativeness. He was to give the other person 
a shock by pressing a nearby telegraph key if he 
thought the plan was very good and more than one 
Shock if it was bad. Ten shocks, the maximum that 
1m 
*The neutral film used in the earlier study was 
Not available at this time. 
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could be administered, meant “very bad.” The sub- 
ject was assured that the shocks would not do any 
physical harm. In order to lower any restraints felt 
by the subject because of his presence, the experi- 
menter left the room just as the subject was about 
to begin giving the shocks. 

As soon as his recording instrument in the next 
room told him that no more shocks were being 
transmitted, the experimenter returned to the sub- 
ject, obtained another blood pressure reading, and 
then administered the final questionnaire, ostensibly 
because the subject’s physiological reactions might 
have been affected by his mood and attitudes. When 
this last form was completed the experimenter 
explained the deceptions that had been practiced 
and the reasons for them. Finally, as a test of the 
effectiveness of the labels applied to the confed- 
erate, the subjects were asked to indicate on an- 
other short questionnaire to what extent they had 
thought of the confederate as a physical education 
major and college boxer or as a speech major during 
the experiment. 

Dependent variables. The major dependent varia- 
bles had to do with the volume of aggression di- 
rected against the experimenter’s confederate. The 
number and duration (in thousandths of a minute) 
of the shocks given by each subject were electrically 
recorded in the next room. Since these scores turned 
out to be severely skewed, a vx+Vx-+1 trans- 
formation was employed to facilitate the statistical 
analysis. 

The questions in the Movie Questionnaire admin- 
istered immediately after the aggressive film clip 
was shown were primarily intended to test whether 
the “justified fantasy aggression” manipulation had 
worked in the aggressive-film conditions. But in 
addition, two items given to all groups sought to 
determine how much the subject had enjoyed the 
movie scene. The Final Questionnaire, completed 
just after the subject gave the confederate electric 
shocks, assessed the subject’s residual attitudes 
toward the confederate and the experiment. For all 
items the subject was to place a check mark at the 
appropriate place on a linear rating scale approxi- 
mately 5 inches long and marked off in quarter-inch 
units. 


TABLE 2 


Reportep Extent To Wicu SuByects Hap THOUGHT OF CONFEDERATE AS EITHER BOXER OR 
SPEECH Major, Experment IT 


Angered Nonangered 
. Thought of confederate Boxer role Speech-major role Boxer role Speech-major role 
Ae Aes bgorenive Neve Bese areal Axarcmstye ST 
tigate ue | a ie | e | Sake | Ra e | n 


Ce Wols—The higher the score the more the 


ing a subscript in common are not sign 


jects said they had thought of the confederate as occupying the designated role. 
antiy different at the .05 level. Each mean is based on 11 cases. 
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TABLE 3 
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MEAN VOLUME OF AGGRESSION AND MANIFESTATIONS OF TENSION IN EXPERIMENT IT 


Angered. Nonangered 
Boxer role Speech-major role Boxer role 
i i ggressive al 
pa eee (tin || ine | neem Naa | ae? | ae 
Tan 
pier earned ABTrtol 3.92. | 4160 
5.35, 4.94, 4.90, 4.78, 4.42abo „3 Tabo 2a t 

2. eto 16.564 11.47, 11.90, 11.16y¢| 10.15,. | 12.04, 6.67. 8.15 + 
Attraction toward experiment $ $ 5968 

£ vi ‘iment 4.9. 5.65, 5.54oa 4.05, 3.95 5.374. 3.57. 6 

2 “My fiends ee S76, | oa] esos 5.29} 438, | 5.90 | 5.0%, | 6.75, 


j = FI i yses of variance were 
— been subjected to a Vz + Vx +1 transformation. Separate analyses o ar 
CS Anedda Calis asias subscript in common are not significantly different by Duncan multiple range test at the 
.05 level. Each mean is based on 11 cases, A high score is always "bad. 


Results 


Effectiveness of labeling. First, did the 
subjects regard the confederate as a college 
boxer in the boxer-role groups, and as a 
speech major in the other conditions? Re- 
sponses to the relevant questionnaire items 
—administered after the subjects knew the 
nature of the deceptions, it will be recalled— 
Suggest that the labeling had indeed been 
effective. The page given to the subject was 
headed by the phrase, “To what extent did 
you think of [the confederate] during the 
experiment as each of the following .. . .” 
One item was “as a college boxer,” while 
another was “as a student majoring in 
speech,” 

The Duncan multiple range tests of the 
condition means are summarized in Table 2. 
Assuming the subjects’ ratings reflect how 
they had actually thought of the confederate, 
it is clear from the table that the labeling 
had generally created the intended role per- 
ceptions, Closer inspection of the Means sug- 
gests, furthermore, that in answering the 
items the subjects were doing more than just 
repeating the information given them. Note 
that the subjects who had been angered by 
the confederate had thought of him more as 
a boxer and less as a speech major than the 
nonprovoked subjects, regardless of the role 
attributed to him. The recollection of the 
confederate’s role apparently depended 
somewhat upon his behavior, His insulting 
action Presumably was compatible with be. 


x , 
| Speech-major role 
' 


ing a college boxer and not with being a= 


speech major and, thus, heightened the im- 
pact of the boxer label, but weakened the 
effect of the speech-major designation. If 
this was the case, the labeling may well have 
been salient for the subjects during the course 
of the experiment as well as at the end of 
the session, 

Aggressive behavior. The top of Table 3 
summarizes the number and duration of the 
electric shocks given to the confederate in 
each condition. On first glance the data ap- 
pear to confirm the theoretical expectations. 
Looking first at the angered conditions, the 
results parallel those obtained in the pre- 
liminary study, Neither the film alone nor 
the label applied to the confederate alone 
had influenced the strength of the aggression 
directed against this frustrater. This insult- 
ing person received the strongest attacks 
when the men had seen the aggressive movie 
and he was associated with the boxing film 
by being labeled a “boxer.” This label ap- 
parently caused him to evoke additional 
aggressive responses from those subjects who 
had witnessed the violent scene and who, 
because of their anger, were “primed” to act 
aggressively, 


As encouraging as these results are, how- — 


ever, some caution must be introduced be- 
cause of the findings in the nonangered 
groups. Here in both film conditions the 
identification of the confederate as a college 
boxer had affected the strength of the at- 


tacks upon him. He received more and longe! 


5 


| 


> 
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shocks when he was labeled as a boxer 


s rather than as a speech major—regardless of 


<= 


> 


the nature of the film. There are at least two 
possible explanations for this finding. For 
one thing, the boxer’s associaton with fight- 
ing could have increased his cue value for 
aggressive responses. Then too, the subjects 
could have had a negative attitude toward 
boxers in general. The confederate could 
have been disliked to some extent because he 


‘supposedly was a boxer, and this unpleasant 


quality could have caused him to pull ag- 
gression from the men (Berkowitz & Green, 
1962). We do not know, of course, to what 
degree this conjectured general aggression 
cue value had affected the hostility exhibited 


by the angered subjects. However, since the 


insulted people who had seen the neutral film 
gave reliably fewer and shorter shocks to the 


_ boxer than did the provoked boxing film 


group, reactions to the label boxer alone can- 
not entirely account for the findings in the 
angered condition; the nature of the wit- 
nessed scene was also important. 
Manifestations of tension. We now come to 
a crucial question: Just what was the effect 
of the observed event? Did it lower inhibi- 
tions, elicit aggressive responses, or do both? 
The questionnaire measures of how much the 
subjects reported having enjoyed the experi- 
ment can provide an answer. In an earlier 
study (Berkowitz et al., 1963) the writer 
found that angered men with strong re- 
Straints against aggression expressed more 
unfavorable attitudes toward the experiment 


than did similarly provoked subjects having 


weaker inhibitions against aggression. The 
strong internal restraints apparently had 
Created internal tension, or at least had inter- 
feted with the achievement of a pleasant 
tension reduction. We can formulate a more 
general proposition. If the presence of 
4ggression-evoking cues gives rise to an 


-activated aggressive response sequence, an 


_ inability to complete this sequence by in- 


flicting sufficient injury on the aggression 
target will lead to feelings of tension and, 
Possibly, to indirect expression of hostility 
(cf, Berkowitz, 1964). 

M In the present experiment the two items 
ttectly assessing degree of liking for the 
Movie showed only that the prize-fight scene 
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was significantly more enjoyable than the 
canal-boat film. However, the questions deal- 
ing with the subjects’ attitude toward the 
experiment yielded more interesting results. 
They suggest that the aggressive film had 
indeed elicited aggressive responses for the 
angered men. These data are summarized in 
the bottom half of Table 3. The first item 
contained the statement, “I found the ex- 
periment conducted by [the experimenter’s 
name]... ,” while the second item read, 
“My friends would be likely to find the 
experiment conducted by [the experimenter’s 
name] . . . .” For both items the rating line 
was anchored by the phrases “Not at all 
interesting” and “Very interesting,” with the 
scale ends for these phrases being varied. 
In both cases the score was the number of 
quarter-inch units towards the “no interest” 
end of the scale. 

Beginning with the nonangered condition, 
we can see that the subjects shown the prize- 
fight film found the experiment more inter- 
esting than the subjects shown the neutral, 
canal-boat movie. This was also true for the 
men who had been angered by the confed- 
erate in the boxer role, In the case of the 
angered subjects confronted by the speech 
major, however, there is a reversal to this 
pattern. People witnessing the aggressive 
scene reported their experience as being sig- 
nificantly Zess enjoyable than did the compa- 
rable subjects shown the neutral film. 

It may be that the low level of interest 
expressed by the men in the angered-speech 
major-aggressive film condition reflects a 
high level of felt tension. The aggressive 
scene could have instigated fairly strong 
internal aggressive responses within these 
men because the insulting treatment they 
had received had created a readiness for 
aggression. However, these internal responses 
had not culminated in open attacks on the 
confederate. The speech major was not 
strongly enough associated with the source 
of this particular instigation (the film) to 
evoke a high volume of open aggression. 
Internal aggressive response sequences had 
been put into operation, so to speak, by 
the aggressive scene, but since there had not 
been an appropriate cue object which could 
elicit strong attacks, a sufficient number of 
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goal responses had not been made, that is, 
completion had not been attained. Conse- 
quently, the men felt some persisting dis- 
comfort, blamed it on the experiment, and 
reported the study as being relatively 
uninteresting. 

In the case of the speech major-angered 
men who had seen the neutral film, on the 
other hand, a much higher proportion of 
their activated aggressive response sequences 
had presumably achieved completion. Ag- 
gressive sequences had been instigated in 
them only by giving them the opportunity 
to attack their tormentor. When they took 
advantage of this opportunity and inflicted 
injury on the confederate they performed the 
aggressive goal responses terminating these 
instigated response chains, and experienced a 
pleasing tension reduction.* 

To summarize, I have suggested that both 
the aggressive film scene and the boxer label 
had served as cues evoking relatively strong 
implicit aggressive response chains in the 
angered men. Nevertheless, the strong at- 
tacks upón the confederate in the angered- 
boxer-aggressive film group may also be 
traced, at least in part, to the association 
of the boxer with the prize-fight scene, 


EXPERIMENT III 


In addition to eliciting aggressive reac- 
tions, the aggressive film could well have 
weakened the subjects’ restraints against the 
display of hostility, As mentioned above, the 
plot summary used in introducing the prize- 
fight scene had been shown in earlier research 


to lessen inhibitions against aggression. 
Experiment III sought to determine prima- 


rily whether the justified aggression context 


ë This line of reasoning in essence assumes that a 
thwarting, a situation in which a specific response 
pattern is activated but cannot reach completion, 
creates a heightened arousal state. Such an argu- 
ment, of course, is similar to many recent discus- 
sions of a frustration-induced drive state, 

êIt is unlikely that the greater discomfort ex- 
pressed by the neutral-film men who had been pro- 
voked by the boxer stems from fear of this person, 
Considering the angered people who had watched 
the prize-fight scene, those dealing with the boxer 
expressed Jess tension than the subjects confronted 
by the speech major, even though the former 
subjects had given more shocks, 
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had had a similar disinhibiting effect on t 
electric shocks, 


Method 


Forty-four additional men were recruited the 
semester following the same procedure empli 
in Experiment II. The experimental method 
identical to that used in the early study with 
exception: All subjects were shown the prize- 
scene, and all were given the less justified fant 
aggression story introduction which depicted 
movie protagonist (who was to receive a bes 
in the scene) in a relatively sympathetic light, 
data obtained from these 44 subjects were 
pared with the data from the 44 subjects 
Experiment II who had witnessed the 
fantasy aggression, using a 2 X 2 X 2 factorial 
sign (anger arousal or not, confederate role, 
degree of justification of the witnessed aggressi 


Results 


Efectiveness of manipulations. Justifica 
tion of fantasy aggression. An item in 
short questionnaire administered immedia 
after the movie clip was shown (and be 
the subjects were given the opportunity to 
shock the confederate) attests to the st 
of the justification information. This i 
asked each subject in the aggressive-fi 
groups to rate the extent to which he 
sympathized with the movie protagonist 
(played by Kirk Douglas). There was” 
highly significant main effect (F = 23.97, 
= 1/80, p= 001), with the people who 
been told the Champion was an unprinc 
scoundrel expressing considerably less si 
pathy for him than the subjects in the ol 


reported thinking of the confederate as mt 
of a boxer and as less of a speech major 
the subjects in the other group; there ¥ 
a significant main effect (at less than 
01 level) for confederate role on both it ] 
and in both cases the justified-less justifie 
fantasy aggression manipulation did 
interact significantly with the role varia 

Aggressive behavior, Shock measu! 
Table 4 reveals the mean aggression sco 
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TABLE 4 
MEAN AGGRESSION DIRECTED AGAINST CONFEDERATE, EXPERIMENT IIT 
Angered Nonangered 
Boxer role Speech-major role Boxer role Speech-major role 
Fantasy aggression 

Justified | jutea | Justified justifea | Justified jug | Justified jute 

“4. Number of shocks 5.354 5.03, 4.90, 3.874 4.42, 4.44, 3.925 3.750 
2. Shock duration 16.56, 12.42, 11.85, 9,240» 10.15a» 11,32, 6.675 10.76, 

3. Hostility ratings 58.55p 71.45. 64.640 66.55p0 | 37.270 42.91, 43.000 41.73, 


Note,—Separate analyses were performed for each measure. The shock scores on the first two lines had been subjected to a 


Ve + Vz $1 


‘ormation. Cells having a subscript in common are not significantly different at the .05 level by Duncan multiple 


range test, The higher the score the stronger the expressed aggression, Each mean is based on 11 cases. 


in each condition. Looking first at the non- 
angered groups, we can see that the 
justified-aggression context had not produced 
_ reliably stronger attacks upon the confed- 
erate than had the less-justified-aggression 

story. Just as in the earlier studies, it was 
only when the men had been angered im- 
mediately before that the predicted differ- 
ences arose. The insulted people who had 
witnessed the warranted film aggression sub- 
sequently directed significantly more shocks 
against their tormentor, regardless of the 
label applied to him, than did the subjects 
shown the less justified aggression. The dif- 
ferences were in the same direction for the 
shock duration measure, but was reliable 
only when the insulting confederate had been 
labeled a boxer, Clearly, then, the context 
of the observed violent event had affected 
, the strength of the provoked subjects’ inhibi- 
tions against aggression, At least part of the 

high volume of overt aggression against the 

Confederate in the preceding experiment can 

therefore be attributed to the justified- 

fantasy-ageression introduction to the film 

scene, 

Considering all the mechanisms that con- 

_ æivably could have influenced the adminis- 
tration of electric shocks, it is not surprising 
that the people in the angered-boxer role- 
Justified fantasy aggression group sent signifi- 
cantly more and longer shocks to the con- 
federate than did the subjects in any of the 
other conditions. However, here too we can- 
hot be sure that the boxer in this group had 
evoked a high amount of aggression because 


the angered men had associated him with 
the boxing film. Regardless of the level of 
justification provided in the story, the con- 
federate elicited stronger aggression when he 
was identified as a boxer rather than a 
speech major. The boxer label apparently 
caused the confederate to serve as a cue for 
aggressive responses. But, we also want to 
know, to what extent was this cue value 
affected by the label-mediated connection 
between the boxer and the boxing film? Did 
this aggressive scene set an aggressive re- 
sponse sequence in operation so that external 
cues associated with this scene evoked strong 
aggressive actions? 

Subsequent hostility ratings. After the 
subjects had “evaluated” the confederate’s 
problem solution by means of the electric 
shocks, the reader will recall, they completed 
a questionnaire, Embedded in a section ask- 
ing the subjects to give their impressions of 
the confederate’s personality, five items as- 
sessed their final hostility toward the con- 
federate. These items included such questions 
as, “Would you admit him into your circle 
of friends?” and “Would you like this indi- 
vidual as a roommate?” The 22-point rating 
scales on which the subjects were to answer 
were anchored by the phrases, “Definitely 
yes” and “Definitely no,” with the direction 
of a favorable response being varied. The 
scores were summed over the five items, 
with a high score meaning strong expressed 
hostility. 

In Experiment II this final hostility index 
had yielded significant differences only be- 
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tween the angered and nonangered conditions 
as a whole. In the present experiment, how- 
ever, more interesting results were obtained 
with this measure, probably because the in- 
troduction of the less-justified-aggression 
context had greatly increased the subjects’ 
restraints against aggression, These findings 
are summarized in Row 3 of Table 4. 

The men in the angered condition as a 
whole were significantly more unfriendly 
toward the confederate on this final index 
than were the combined nonangered subjects. 
Above and beyond this difference, however, 
note the scores for the people who had been 
provoked by the boxer; those who had wit- 
nessed the less warranted fantasy aggression 
expressed reliably more hostility toward the 
confederate on this final measure than the 
subjects who had observed the justified ag- 
gression. Two trends seem to account for 
this difference, For one thing, some people 
had suffered an additional frustration. By 
increasing the angered subjects’ inhibitions 
against direct aggression with electric shocks, 
the less warranted fantasy aggression had 
probably heightened the subjects’ arousal 
state. As a consequence, when the men had 
another opportunity to attack their tor- 
mentor immediately afterwards (and in a 
relatively disguised form), their aggressive 
responses were strengthened. But in addition 
to this strengthening of subsequent indirect 
aggression in the inhibited group, there ap- 
parently was something of a decrease in the 
level of final hostility displayed by the less 
restrained subjects. Their earlier strong at- 
tacks on the boxer represented the com- 
pletion of a substantial Proportion of the 
instigated aggressive Tesponse sequences 
operating within them, and this could have 
produced a weakening in the following 
expressions of hostility, 


Discussion 


_ Two parts of the writer’s theoretical analy- 
sis are clearly confirmed by the above results. 
The findings show that the film context can 
affect the observer’s inhibitions against ag- 
gression and that the people encountered 
soon afterwards vary in the extent to which 
they can evoke aggressive responses from the 
observer, However, the present evidence does 
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not unequivocally demonstrate that aggres- 


sive scenes can in themselves elicit aggressive 
responses or that external objects associated 
with the witnessed violence 
strongest overt hostility. 

There are only suggestive findings in this 
paper that point to the eliciting effects of 
observed aggression., Further, these results 
arose only under certain conditions: after the 
men had been angered. Contrary to the dif- 
ferences existing in all of the other condi- 
tiohs, the subjects who had been provoked 
by the speech major manifested stronger 
signs of tension after viewing the prize fight 
than after seeing the neutral film, The ag- 
gressive scene may have evoked fairly strong 
internal aggressive responses in these people 


since they had just been angered and were™ 


primed to behave aggressively. However, I 
have suggested, since ‘the speech major was 
not associated with the film source of this 
particular instigation, he did not have enough 
cue value to elicit the aggressive goal re- 
sponses which would have terminated these 
film-induced response sequences, The persist- 
ing response tendencies were presumably 
experienced as discomfort. 

Although this interpretation obviously rests 
on a series of conjectures and assumptions, 
there seems to be little doubt that the label 
boxer generally heightened the confederate’s 
cue value for aggressive responses. We do 
not know, of course, just why this should be. 
I have hypothesized that the aggression cue 
value could have arisen from either or both 
a Negative attitude toward college boxers and 
the association of boxers with fighting and 
with aggression generally. 

Experiment I reported in this paper did 
not reveal any tendency for the boxer to 
evoke more open hostility than the speech 
major. It may be that the questionnaire 
measure employed in this study did not 
detect the aggressive responses elicited by 
the boxer in the angered-neutral film or non- 
angered-boxing film groups. But, since the 
obtained differences were often in the wrong 
direction, this seems unlikely. My own guess 
is that the difference between the first two 
experiments stem, in part at least, from the 
characteristics of the person who played the 
tole of the boxer. The confederate employed 


w! 


evokes the ` 


[ 
i 


j 


OBSERVED AGGRESSION 


in Experiment II looked very much as if he 


a „might be a boxer; his features and body 


build may have strengthened the effects of 
the label, giving him the general ability to 
elicit overt aggressive responses. In Experi- 
ment I, however, the person who was the 
critical experimenter did not have this clear 
boxerlike quality. Perhaps it was only when 
this person was strongly associated with the 


* aggressive film by virtue of his expressed 


‘interest in the movie and the boxing ques- 
„tionnaire that he could evoke the inténse 
open hostility. 
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CATEGORIZING BEHAVIOR AS A FUNCTION 
OF MEANING DOMAIN 
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Ss categorized statements from each of 3 meaning domains considered to be 
ordered from peripheral to personally relevant: objects, war, self. A greater 
number of categories is used for objects than for the other domains, and the 
greatest concentration of items appears in self. S is considered to structure a 
meaning domain by using a 2-step process: he differentiates relatively nonsig- 
nificant domains from others by determining the number of categories to be 
used, and he protects himself from the consequences of overinclusiveness in im- 
portant categories within significant domains by concentrating items in other 
categories. Women use more categories than do men. Interdomain r’s revealed 
intra-S consistency across domains of number of categories used. 


In recent years the concept “cognitive 
structure,” the organization of meanings, has 
received ever increasing attention. Chein 
(1962) and Scheerer (1954), for example, 
have been concerned with the general role of 
cognitive concepts in the development of 
psychological theories, Brehm and Cohen 
(1962), Festinger (1957), Heider (1958), 
Kelly (1955), Peak (1958), and Witkin, 
Dyk, Faterson, Goodenough, and Karp 
(1962) have concerned themselves directly 
with formal properties of cognitive struc- 
tures, with the relationships among different 
sets of meanings, and with the behavioral 
correlates of cognitive structures, Bruner, 
Goodnow, and Austin (1956) and Hunt 
(1962) have studied the ways in which mean- 
ing categories (concepts) are formed, and 
Sherif and Hovland (1961), Gardner (1953), 
and Gardner and Schoen (1962) have 
brought into focus the process of dividing a 
meaning domain into a set of categories. The 
Present study is concerned with the last- 
mentioned process, 

1I wish to express m; reciati 
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Hovland and Sherif (1952) and Sherif 
and Hovland (1953) demonstrated that there 
is a relationship between an individual's 
stand on an issue and the way in which he 
categorizes items related to the issue. The 
more extreme the position is, the more likely 
is the person to use a small number of cate- 
gories, to use the extreme categories, and to 
place the greatest number of items inta the 
category furthest from his position. (For 4 
discussion of the significance of differential 
concentration of items for scaling procedures 
in general, see Sherif & Hovland, 1961.) 
Independently of the work by Sherif and 
Hovland, and following the reasoning which 
led Klein (1949) and Holzman and Klein 
(1950) to posit the existence of perceptual 
attitudes, Gardner (1953; Gardner, Holzman, 
Klein, Linton, & Spence, 1959; Gard- 
ner, Jackson, & Messick, 1960) studied cate- 
gorizing behavior as a cognitive control. 
“Cognitive control” refers to a relatively 
transsituational mode of dealing with stimu- 
lus events. This mode is seen as a process 
which serves as one of the mediators between 
drive States and preceptual and judgmental 
functions, Over time, they become quasi- 
autonomous and function relatively inde- 
Pendently of momentary drive states. 

As Gardner and his colleagues dealt with 
categorizing behavior, the major emphasis 
was placed upon equivalence range (a revision 
of their position appears in Gardner & 
Schoen, 1962), “Equivalence range” refers to 
the range of stimuli placed into a category; 
for a fixed number of stimuli, equivalence 


—s — 
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range is inversely proportional to the number 

a of categories. Individual differences in num- 

"per of categories used for a particular realm 
of stimulus events are taken as a reflection 
of differences in “inclusiveness of conceptual 
realms” (Gardner et al., 1960). A number 
of studies have been conducted which at- 
tempt to discover the degree to which the 
control equivalence range is related to other 
functions (Gardner, 1953; Gardner et al., 
1959, 1960; Gardner & Schoen, 1962; Petti- 
grew, 1958; Sloane, 1959; Wallach & Caron, 
1959). 

In the sets of studies by Sherif and by 
Gardner, the subject’s task is to categorize a 
set of items (statements, objects, photographs, 
and so on), Sherif and Hovland (1961) refer 
‘to this activity as an “own category” tech- 
nique; Gardner refers to it as a “sorting test” 
to determine equivalence range. One of 

` Sherif’s concerns has been with “latitudes of 
acceptance and rejection” (Sherif & Hovland, 
1961). These latitudes are equivalence ranges 
in the parts of a scale that include the sub- 
ject’s» position (latitude of acceptance) and 
the position furthest from the subject’s 
(latitude of rejection).? Gardner, on the 
other hand, has not dealt with intrasubject, 
intradomain differences in equivalence range 
across a set of categories. One reason for the 
difference in approach may be that, typically, 
Sherif has used items that are ordered along 
a clear-cut dimension—favorableness of an 
item to Negroes, for example—while Gard- 
ner’s items are subject to a greater degree 
„of idiosyncratic ordering. Sherif’s items may 
be referred to an outside criterion, and the 
term “extreme position” takes on immedi- 
ate meaning, but this is not true for 
Gardner’s items. The studies of these investi- 
gators are embedded in markedly different 
Conceptual systems, and there is a difference 
in emphasis which they would place upon 
the generality of categorizing behavior. This 
difference is not sharp (since both of them 
` would agree that categorizing performance is 
7A “latitude of acceptance . . . is defined opera- 
tionally as the range of the positions on an issue 
that an individual considers acceptable to him 
pisinding the one ‘most acceptable’ to him). The 

ude of rejection consists of the positions he 
finds objectionable (including the one ‘most objec- 
tionable’ to him) [Sherif & Hovland, 1961, p. 1291.” 
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a function of the person and of the specific 
items), but one would expect somewhat more 
generality from Gardner’s position than from 
Sherif’s. A question which is relevant to both 
positions is: What is the relationship between 
categorizing behavior with respect to a set 
of relatively neutral objects and such be- 
havior with respect to a set of statements 
dealing with a social issue? 

In considering the relationship between the 
organizing (categorizing behavior) in two 
different meaning domains, one is led back to 
the nature of cognitive structures. Cate- 
gorizing behavior involves the relatively basic 
function of dividing a meaning domain into 
subsections, This division becomes an in- 
tegral part of (and probably also is affected 
by) any hierarchical ordering which exists 
within the domain. Across domains, too, one 
would expect to find a hierarchical ordering, 
so that some domains would affect others 
more than they would be affected by them. 
Some domains would be more personally sig- 
nificant than would others. Witkin et al.. 
(1962), for instance, present the point of 
view that the degree to which an individual 
differentiates his behavioral world is depend- 
ent upon (or is related to) the way in which 
he differentiates himself. Given this point of 
view, another question becomes pertinent: 
To what degree is categorizing of a set of 
items which refer to the self related to cate- 
gorizing behavior in other meaning domains? 

In addition, in almost a self-evident man- 
ner, it is expected that the more peripheral 
the domain the weaker is the structure 
within the domain; therefore there should 
be a larger number of categories used in 
more peripheral domains. 

Given items which sample each of three 
meaning domains—relatively neutral objects, 
reactions to nuclear war, and self-statements 
—two questions are raised: 

1. Is there consistency of categorizing be- 
havior across domains; that is, are there 
significant interdomain correlations? 

2. Does categorizing behavior change as a 
function of the domain? If the domains con- 
sidered above are ordered objects, war, self 
(from least to most personally significant), 
subjects should use the largest number of 
categories for objects. 
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TABLE 1 


TNTERDOMAIN CORRELATION COEFFICIENTS FOR 
Men, WOMEN, AND TOTAL 


Wi Total 

a Eio) T) (n = 36) 
Domain | War | Self | War | Self | War | Self 
Objects | 44 | .20 40 | .59* | 47** | .47** 
Wat .80** AS .66** 
* p = 05. 
eal sh 

METHOD 


The 36 subjects were juniors, seniors, and non- 
psychology graduate students at the University of 
Oklahoma who volunteered to participate in the 
study; 19 of them were men. None of them was 
familiar with the techniques used or with the 
purpose of the study. Each subject spent about an 
hour working on three sorting tasks. The order of 
presentation was counterbalanced so that each of 
the six permutations of order of task was used, 
six subjects being assigned randomly to each 
permutation. 

The material for each of the tasks consisted of a 
number of statements, each one typed on a 4 X 6 
„inch card. Three sets of cards were used: a set of 
73 verbal descriptions of the objects used by 
Gardner (1953), a set of 92 self-referring statements 
developed by Butler and Haigh for a Q sort 
(Rogers & Dymond, 1954, pp. 79, 275-277, 388- 
389), and a set of 36 statements used to determine 
attitudes toward nuclear war.? These sets of state- 
ments and the categorizing tasks associated with 
them will be referred to as “objects,” “self,” and 
“war,” respectively. 

Before presenting the tasks to the subject, the 
experimenter attempted to establish a warm, per- 
missive atmosphere and to remove any kind of 
time pressure. The same set of instructions, modified 
appropriately for nature of the material and for 
order of presentation, was used for each of the 
tasks, The instructions (Gardner, 1953) were: 


You will be given a number of tasks to per- 
form. There are neither right nor wrong ways 
to do them, so you do them in ways that are 
comfortable for you, Although some of them 
may seem similar to each other, you regard each 
task as though it were the only one you are to 
perform. There is no timing of the tasks, so you 
can take as much time as you need. 

First of all [Again], I want you to know that 
there is no answer to this task. Everyone does 
it in his own way. I want you to do it in the 
way that seems most natural, most logical, and 
most comfortable to you. The instructions are 


3I wish to express my appreciation to David K. 
Berlo, Chairman of the Department of Communica- 
tion, Michigan State University, for making the war 
items available to me. 
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simply to put together into groups the statements 
which seem to you to belong together. You may 


as you like, so long as the statements in each 
group belong together for one particular reason. 
If, after you have thought about all the state- 
ments, a few do not seem to belong with any of 
the others, you may put those statements into 
groups by themselves, Please sort all the state- 
ments. 


RESULTS 


The interdomain correlation coefficients 
reported below are measures of the relation- 
ship between the number of categories used 
in a domain and the number of categories 
used in some other one. Sex differences with 
respect to correlations have not been found 
consistently (Gardner et al., 1959), but con- 
sistent sex differences with respect to equiva~ 
lence range have been reported (Pettigrew, 
1958; Wallach & Caron, 1959). These dif- 
ferences will be preserved in the analysis of 
the data. 

Question 1, Is there consistency of cate- 
gorizing behavior across domains? 

The answer to this question may be found 
in Table 1, For men, only the correlation 
between self and war differs significantly 
from zero, It is significantly greater than 
the correlations between self and objects 
(p <.01), but it is not greater than the 
objects-war correlations, For women, only 
the correlation between self and objects dif- 
fers from zero. No significant differences 
exist among the three correlation coefficients 
for women. All of these r’s are too unstable 
(men, df = 17; women, df = 15) to warrant 
further consideration. 

Since there are no significant men-women 
differences between corresponding correlation 
coefficients, total correlation coefficients at 
presented, For total, all three are signifi- 


TABLE 2 


DISTRIBUTION OF MEANS OVER SEX AND 
MEANING DOMAIN 


H (In Birs) 
Meaning domain 
Sex Total 
Objects War Self 
Men 2.97 2.03 1.82 2.28 
Women | 3.59 3.02 2.87 3.16 
Total | 3.26 2.50 2.32 2.69 


have as many or as few statements in a may} 


+ 
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TABLE 3 
ANALYSIS OF VARIANCE: H (in Bits) 


Source df SS MS F 
Sex 1 | 21,01 | 21.01 | 17.85*** 
Subjects within sex 34 | 40,02} 1.18 
Meaning domain 2| 18.15| 9,07 |24.98*** 
Domain X sex 2| 106| .53| 1.45 
Domain X subject 
within sex 24.69| 36 
Total 107 | 104.93 
kp = 001, 


cantly different from zero, and there are no 

significant differences among them (the dif- 

ference between r= .66 and r= .47 yields 
t= 1.45, df = 33, p = 15). Using these re- 
sults, the answer to the first question is that 

‘(for the meaning domains sampled here) 

categorizing behavior is consistent across 
_ domains. 

Question 2. Does categorizing behavior 
change as a function of the meaning domain? 

Consider a situation in which one tries to 
predict the category into which the subject 
will place an item chosen at random. If the 
subject uses only one category into which 
to place such items in a domain, his behavior 
is completely predictable—there is minimum 
uncertainty, maximum structure. If he uses 
a very large number of categories, and dis- 
tributes the items equally over all categories, 
there is a high degree of uncertainty and a 
low degree of structure. Suppose, Now, that 
the number of categories which the subject 
uses is fixed. Then the degree of uncertainty 
(and, inversely, the degree of structure) 
changes as a function of the distribution of 
items over categories; that is, uncertainty is 
maximal when the distribution is rectangular, 
and the degree of uncertainty decreases as 
the distribution departs from rectangularity. 
Therefore, the degree of structure in the sub- 
ject’s categorial system is a function of the 
„number of categories he uses and of the 
tectangularity of the distribution of items 
over categories (Attneave, 1959; Garner, 
1962), 

H (the measure of uncertainty in bits) 
and its components, number of categories 
and H/Hyaz, will be used to indicate the 
degree of structure in a domain. Degree of 
structure varies inversely with all three 
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measures. H/Hmax indicates the amount of 
uncertainty in the subject’s response set rela- 
tive to the amount of uncertainty which 
would exist if the subject used all of his cate- 
gories equally often; therefore, it may be 
used to indicate degree of rectangularity of 
the distribution of items over categories.* 
(See Garner, 1962). 

Tables 2 and 3 contain the Sex X Domain 
distribution of mean H and the summary of 
the analysis of variance. Tables 4 and 5 con- 
tain the Sex X Domain distribution of mean 
number of categories and the summary of the 
analysis of variance. For both measures, sex 
and domain effects are significant. There are 
higher H values and a greater number of 
categories for women than for men, indi- 
cating a lower degree of structure which is 
attributable to a larger number of categories 
for women than for men. Application of the 
Duncan multiple range test (Edwards, 1960) 
to the domain means reveals that for both 
measures there is a higher mean for objects 
than for war or self (e = .01), but there is 
no difference between the means for war and 
self (e = .05). That the number-of-categories 
differences cannot be accounted for by the 
numbers of items used (objects = 73, wat 


4Sherif and Hovland (1953) developed a scale 
which permitted them to combine number of cate- 
gories and the concentration of items in extreme 
categories. While the 3-point scale served their 
purposes very well, there are obvious advantages 
to having a scale with a larger number of values. 
The measure H also combines number of categories 
and concentration of items, and it takes on nearly 
continuous values, H/Hymsx provides the same ad- 
vantage for concentration of items alone. Even 
under unrestricted sorting conditions, where there 


groups along 
polar categories, these measures yield useful results. 


TABLE 4 


DISTRIBUTION OF MEANS OVER SEX AND 
MEANING DOMAIN 
NUMBER OF CATEGORIES 


Meaning domain 


Sex ee rR E O Total 
Objects War Self 

Men 13.00 5.42 5.53 7.98 

Women 17.82 10.18 11.71 13,24 

Total 15.28 7.67 8.44 10.46 
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TABLE 5 
ANALYSIS OF VARIANCE: NUMBER OF CATEGORIES 


Source df ‘SS MS F 
Sex 1 | 742.69 | 742.69 | 11.46*** 
Subjects within sex | 34 | 2,203.49 | 64.81 
Meaning domain 2 | 1,262.74 | 631.37 |24.50*** 


Domain X sex 2 11.58| 5.79 
Domain X subject 
within sex 68 | 1,752.35 | 25.77 
Total 107 | 5,972.85 
rep = 001. 


= 36, self = 92) is indicated by the fact 
that objects has the highest mean but an 
intermediate number of items. 

Tables 6 and 7 contain the Sex X Domain 
distribution of mean H/Hmax and the sum- 
mary of the analysis of variance. There is 
no significant sex effect but, again, there is a 
significant domain effect. The Duncan multi- 
ple range test reveals that self has a lower 
relative-uncertainty mean than does either of 
the other domains (a= .01), but that war 
and objects do not differ from each other 
(a = .05) 5 


5A rough indication of the effect of the differen- 
tial concentration of items is given by the mean 
number of categories used (Table 4) and the mean 
amount of information contained in the use of 
these categories (Table 2). The mean number of 
categories used is about 10, and the mean amount 
of information contained in the subject’s responses 
is 2.7 bits. The number of catgeories associated with 
2.7 bits under an ideal coding system is 6.5. The 
differential use of categories, then, reduces the ef- 
fective number of categories by 3.5. That this is 
only a rough approximation, and one which may 
be an overestimate of the effect, is shown by the 
fact that the mean relative uncertainty is .89 
(Table 6). Taking .89 of 10 results in a reduction 
of only 1.1 categories rather than of 3.5. It seems 
clear, however, that the differential concentration 
of items does reduce the effective number of cate- 
gories used by 10-35%. 


TABLE 6 


DISTRIBUTION OF MEANS OVER SEX AND 
MEANING DOMAIN 
H/H max 


Meaning domain 
Sex 


Total 
Objects War Self 
Men 90 90 84 
Women | <89 93 85 39 
Total | .90 ‘91 84 89 
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The answer to Question 2 now may be 
stated as: Categorizing behavior does change j 
as function of meaning domain. If the do- . 
mains are ordered objects, war, self as from ` 
least to most personally relevant, the least 
relevant domain (objects) has the greatest + 
uncertainty (lowest degree of structure) 
associated with it. Since the measure of un- 
certainty is a function of the number of cate- 
gories used and of the concentration of items — 
in the categories, two substatements may be 
måde: The largest number of categories is — 
associated with the least significant domain | 
(objects), and the most unequal distribution 
of items over the categories used is associated | 
with the most significant domain (self).* 


TABLE 7 Bal 
ANALYSIS OF VARIANCE: H/H max 


Source df | SS | MS| F 


Sex 1 |.004 | .004 
Subjects within sex 34 | .355 | .010 Hi 
Meaning domain 2 | .092 | .046 | 6.24" 
Domain X sex 2 | .007 | .003 
Domain X subject within sex | 68 | .505 | .007 |, 

Total 107 | .963 

** p = 01, 

Discussion 


Sex differences. Although minor differences 
appear with respect to cross-domain correla- 
tion coefficients, an r X r comparison across 


ê The data presented above may be accounted for 
by showing that different degrees of item homogene- 
ity characterize different domains. If there welt 
increasing homogeneity (independent of “signifi- 
cance”) from objects to war to self, then one would: 
expect the lowest degree of structure in objects and 
the highest degree in self. No attempt was made to 
control this variable because I do not believe that 
such control is appropriate. One of the consequences 
which follows from acceptance of a conceptus 
system which is based on the propostion that am 
individual develops his own view of the world is 
that, for many regions of activity, “reality” takes 
on an idiosyncratic cast. The further one goes from 
simple events and from achievement criteria, the 
more difficult it is to say how many categories , 
there “really” are unless one also specifies the cate- 
gorizer. Sherif and Hovland (1961) have show? 
that categorizers differ with respect to relatively 
clear-cut issues. As more and more uniquely deter- 
mined categorial systems are studied, one C4? 
specify the properties of events in different meaning 
domains only by sampling categorizers, procedures 
and items (Brunswik, 1956). 
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sexes yields no significant differences. No 


= consistent body of information exists with 


which to compare these results. 

Women exhibit a lower degree of structure 
(have higher H values) than do men, and 
this difference may be accounted for specifi- 
cally by the fact that women use a greater 
number of categories than do men; they do 
not concentrate the items over categories 
more than do men. The finding that women 
‘use more categories than do men apparently 
is consistent with the results of studies by 
Pettigrew (1958) and by Wallach and Caron 
(1959). These authors found that females 
used smaller category widths on the Petti- 
grew Category Width Scale than did males, 
and that girls tolerated less deviation from 
a standard on a similarity-judgment task 
(Wallach & Caron, 1959). Wallach and 


< Caron account for these results by referring 


Sed eel 


‘ficients among objects, 
, Support to the positions of 


them to differential sex training which results 
in greater independence of external standards 
for males than for females. Pettigrew also 
Suggests that women may have less tolerance 
for etrors of overinclusiveness than do men. 
A question may be raised as to the compa- 
tability of underlying factors related to 
category-width performance and to sorting 
performance, Further study is needed to an- 
Swer such a question and to determine 
whether the same child-rearing practices 
affect both kinds of behavior. 

Interdomain consistency. The total r’s re- 
ported in Table 1 show clearly that such 
Consistency exists; however, a strong note of 
‘Caution is required. Sloane (1959) offers evi- 
dence that there may be a factor which is 
Specific to sorting operations. In order to 
make a definitive statement about interdo- 
main consistency, it is necessary to use other 
Sets of operations to measure the categorizing 
process (Campbell & Fiske, 1959). 

The magnitude of the correlation coef- 
war, and self lends 
both Sherif and 
Gardner, Since variation in any one domain 
accounts for only .22 to .46 of the variance 
Of some other domain, a large proportion of 
the variance in a domain is specific to that 
domain (or to factors other than the ones 
Considered here). Since all of the correlations 
are reliably different from zero, it is equally 
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clear that a general form of behavior is 
involved, Hovland and Sherif (1952) and 
Sherif and Hovland (1953) have shown that 
categorizing may be used to ascertain an 
individual’s organization of a particular do- 
main, The data presented here indicate that 
the individual’s organization of events in a 
specific domain is also a function of his 
general mode of organizing events. 

The finding that self correlates with ob- 
jects and with war is consistent with the idea 
that the degree to which an individual dif- 
ferentiates his behavioral world is dependent 
upon the way in which he differentiates him- 
self (Witkin et al., 1962). In view of the 
apparent importance of self to the subject, 
it is tempting to think of self as a core 
domain around which the others are or- 
ganized. There is no direct evidence in this 
study to support this view, and it will not 
be developed further. 

Organization of domains as a function of 
domain, The findings with regard to changes 
in categorizing behavior as a function of 
meaning domain may be seen, in part, as an 
extension of the work of Sherif and his asso- 
ciates. They have demonstrated that a de- 
crease in the number of categories used and an 
increase in differential concentration of items 
across categories (heavier concentration in 
the extreme categories) are the result of the 
extremeness of the subject's position (as an 
indication of the intensity of significance of 
the issue for the subject). This demonstration 
depends upon a comparison of the subjects 
who hold different positions on a single issue. 
In a similar manner, it has been demon- 
strated here that as a meaning domain in- 
creases in personal relevance within indi- 
viduals, there is a decrease in number of 
categories used and an increase in differential 
concentration of items across categories. 

The particular fashion in which subjects 
respond to unrestricted conditions of cate- 
gorizing suggests that the organization of 
meanings involves a two-stage process. In the 
first stage, the subject controls the amount 
of information with which he has to deal 
by setting the number of categories to be 
used for a meaning domain. In the second 
stage, he makes another adjustment by using 
some categories more frequently than others. 
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This two-stage process may be understood 
by considering the fact that controlling the 
amount of information brings into play the 
risks inherent in overinclusiveness and in 
overexclusiveness, risks of accepting negative 
instances and of rejecting positive ones.” If 
the subject uses a small number of categories 
there are fewer events to deal with, there is 
a higher degree of organization, but the risk 
of misplacing items has increased, Since the 
subject must deal with all of the items pre- 
sented to him, overinclusiveness in some cate- 
gories implies overexclusiveness in others. 
For a highly significant category within an 
important domain, if the subject considers 
an error of oyerinclusiveness as being more 
serious than an error of overexclusiveness, 
then he can protect himself against accepting 
too many negative instances by placing rela- 
tively few items in the category of concern 
and relatively many items in other categories, 
(This is exactly what subjects in the Sherif & 
Hoyland, 1953, study seem to have done.) 
Subjects in the current study behave as 
though there is an optimum degree of or- 
ganization for a particular subject dealing 
with a particular domain. Suppose this 
optimum degree to be high; that is, the 
optimum level of H is low. If the subject 
achieves the optimum level of H only by us- 
ing a small number of categories, he must ac- 
cept the consequences of having too many 
misplaced items in categories of high signifi- 
cance, Since he can control these consequences, 
by manipulating the frequency of items in a 
category, the optimum level of organization 
(optimum level of H) may be achieved by 
choosing a number of categories that re- 
sults in a higher-than-optimum level of H 
and then bringing the level down to optimum 
by means of concentrating items (reducing 
H/Hyax). The first stage of organization 
apparently is used to differentiate between a 
relatively inconsequential domain (objects) 
and domains of moderate to high relevance 
(war and self). The second level of organiza- 
T Generally, investigato: i 
(Gardner, ee San ce oe ee tea eae 
grew, 1958; Wallach & Caron, 1959) have treated 
inclusiveness-exclusiveness as though it were uni- 
form over the entire set of categories. Evidence re- 


Ported in Sherif and Hovland (1961) and in this 
study indicate that this is not so. 
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tion differentiates between a highly relevant 
domain (self) and domains of moderate to, 
low relevance (war and objects). 

If this interpretation is correct, then it 
may be said that individuals control the 
information in a domain by using fewer or 
more categories according to whether the 
domain is more or less significant to them. 
Within significant domains they differentiate 
among more or less significant categories, and 
they further control information by placing 
items unequally into the categories. They 
seem willing to tolerate the risk of misplacing 
items into other categories in order to pre- 
serve the homogeneity of items in highly 
significant ones. 
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STRESS-REDUCING PROPERTIES OF SOCIAL STIMULI’ 


STANLEY KISSEL 
Rochester, New York, Child Guidance Clinic 


Previous research has suggested that stress induces motivation to be with other 
people, The present study was designed to examine experimentally the effect on 
stress of being with other people. Hypotheses tested were: The presence of an- 
other person in a stressful situation reduces stress, the presence of a friend in a 
stressful situation reduces stress more than the presence of a stranger, and 
stressful individuals with strong affiliation motivation show more stress reduc- 
tion than those with weak affiliation motivation in the presence of another 
person. In a stressful situation, stress responses were lower in the presence of 
another person, if he was a friend. The presence of a friend reduced the stress- 
fulness of the situation more than the presence of a stranger. Affiliation motiva- 
tion was found not to be significantly related to social stimuli as stress reducers. 


Much of the literature investigating the 
stress-reducing attributes of social stimuli 
has demonstrated that the presence of an- 
other person reduces the stressfulness of a 
taxing situation (Boston Psychopathic Hos- 
pital, 1955; Burlingham & Freud, 1943; 
Mandelbaum, 1952; Marshall, 1951; Schach- 
ter, 1959). An examination of relevant stud- 
ies involving social behavior of lower animals 
leads one to a similar conclusion: The pres- 
ence of another animal in a stressful situation 
results in a reduction of responses commonly 
associated with animal discomfort (Brady, 
1958; Conger, Sawrey, & Turrell, 1957; 
Davitz & Mason, 1955; Liddell, 1950, Ma- 
son, 1960), Finally, the conclusion that social 
stimuli have stress-reducing properties ap- 
pears to follow logically from Schachter’s 
recent research on the psychology of affilia- 
tion which demonstrated that stressful situ- 
ations arouse affiliative tendencies, It is thus 
reasonable to expect that under conditions of 
Stress, gratification of affiliation needs would 
lead to a reduction in stress, 

The present research was designed to in- 
vestigate the hypothesis that social stimuli 
reduces stress, The hypothesis was examined 
by subjecting individuals to stressful situa- 


1This report is based in Part on a dissertati 
completed in 1961 at the University of Bors The 
author is grateful to Lawrence W. Littig, under 
whose guidance this study was performed. The 
author also wishes to thank the members of the 
dissertation committee for their assistance and 
guidance: Olive P. Lester, Ira S. Cohen, Kenneth 
H. Kurtz, and Adelle H. Land. 
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tions, alone and in the presence of another k 
person. Furthermore, some of the aforemen- 

tioned studies suggest that the effectiveness $ 
of social stimuli as stress reducers is due to 
the presence of an organism with whom some 
previous interaction has taken place (Bur- 
lingham & Freud, 1943; Liddell, 1950; 
Mandelbaum, 1952; Marshall, 1951). It was 
hypothesized that the presence of a person 
with whom one had previously interacted (a 
friend) would diminish a stressful situation 
more so, than the presence of a person with 
whom one had no previous interaction (4 
stranger). Thus, when another person was 
present, he was either a friend, a person with 
whom some previous interaction had taken 
place, or a stranger, a person with whom n0 
previous interaction had taken place. Another 
aim of the study was to present data relevant 
to the validity of the method of content: 
analysis of thematic stories as a measure of 
the affiliation motive. Schachter (1959) 14 
ported that an anxiety-provoking situation 
elicited affiliative tendencies in his subjects. 
One might speculate that in general, the 
stronger the underlying need a person has t0 : 
affiliate, the quicker and more forceful will’ 
stress motivate that person to desire affilia- 
tion. Thus, it would seem reasonable to a 
pect that under conditions of stress, indi- 
viduals with strong affiliative needs will show . 
a greater reduction in stress responses thal 
individuals in whom affiliative needs ate 
weak when affiliation needs are satisfied. 


Ss 


r 


‘arus, Deese, & Osler, 
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METHOD 


«Definition of Terms 


Stress was differentiated into stress stimulus and 
stress response. In considering the stressfulness of a 
stimulus the motivational state of an organism, as 
well as the particular task imposed, must be taken 
jnto consideration. It is from the thwarting or 
threatening of individual motives that stimuli take 
on their stressful properties (Applezweig & 
Moeller, 1958; Broverman & Lazarus, 1958; 
Lazarus, Baker, Broverman, & Mayer, 1957; Laz- 
1952; Vogel, Baker, & Laz- 
Arus, 1958). Therefore, a stress stimulus was opèra- 
tionalized in terms of induced failure on a task, and 
motivation was specified as motive to avoid failure. 
The literature indicates that the Mandler-Sarason 
Test Anxiety Questionnaire (TAQ; Mandler & 
Sarason, 1952) is a measure of anxiety in testing 
situations (Alpert & Haber, 1960; Mandler & Sara- 
son, 1952; Sarason, 1959; Sperber, 1959) and more 
specifically a measure of the strength of the motive 
to avoid failure (Atkinson & Litwin, 1960). 

Stress as a response was conceptualized in terms 
of activation level. The concept of activation level 
has been developed and employed by many writers 
(Duffy, 1957; Malmo, 1959; Schlosberg, 1954; 
Woodworth & Schlosberg, 1954) and is viewed as 
characterizing the intensity dimension of behavior. 
This aspect of behavior is conceived as occurring 
ona continuum from a low point in sleep to a high 
point during extreme excitement. Any measure 
which systematically varies along this continuum 
can be considered an indicant of the level of activa- 
tion, The index of activation level used in the 
current study was palmar skin conductance which 
has been cited by several authors as an appropri- 
ate and effective indicant of activation (Duffy, 
1957; Malmo, 1959; Schlosberg, 1954). Stress is 
one of the many factors which produce variations, 
in the degree of activation. Generally, the more 
severe the stress, the greater will be the activation 
level. By means of palmar skin conductance, a 


. measure of the intensity of stress can be obtained. 


Therefore, stress response was operationalized in 
terms of change in palmar skin conductance; and 
the higher the conductance score the more intense 
the stress. 

Affiliation (n Aff) was assessed by means of the 
content analysis technique for measuring n Aff from 
thematic apperception stories (Heyns, Veroff, & 
Atkinson, 1958). 

A friend, for purposes of the present study was 
defined as a person with whom one had previously 
interacted, and was obtained by having the sub- 
jects submit a list of people they knew who were 
enrolled in any other, as well as their own, psy- 
chology course. Any person enrolled in psychology 
who did not appear on a person’s list was consid- 
ered to be a stranger for him. 


Subjects 


i Subjects were selected on the basis of their per- 
ormance on measures of the affiliation motive, and 
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the motive to avoid failure. Two groups consisting 
of 200 and 84 male introductory psychology stu- 
dents, respectively, were administered both meas- 
ures. They were also requested to list the names of 
friends of the same sex enrolled in the introductory 
psychology course. From these groups, 96 subjects 
were chosen to participate in the study, on the 
basis of their performance on the administered 
measures. 

The affiliation motive was measured by means of 
content analysis of thematic apperception. Two 
hundred subjects wrote four 4-minute stories and 
84 subjects wrote six 4-minute stories under 
neutral conditions. The pictures used were 2, 1, 8, 
4, 28, and 5 in the McClelland catalog (Atkinson, 
1958) and were presented in that order” Each 
picture was presented to both groups of subjects, 
one at a time, by means of a slide projector. Each 
picture was exposed for 20 seconds, after which 
subjects were given 4 minutes to write a story. The 
n Aff scores were determined according to the pro- 
cedure described by Heyns et al. (1958), by an 
experienced scorer® The experimenter scored 25 
protocols chosen at random and achieved an inter- 
scorer reliability coefficient of .98 with the scorer. 
Separate distributions were made for the four- and 
six-picture tests, and were divided at the median. 
The median n Aff score for the six-picture test was 
three and for the four-picture test was zero, Sub- 
jects scoring above the median were classified as 
being high in n Aff, and those scoring below the 
median, low in n Aff. 

The motive to avoid failure was measured by 
the TAQ. The instrument, a self-rating scale con- 
sisting of 39 questions has been fully described by 
Mandler and Sarason (1952). The higher the score 
the greater the strength of the motive to avoid 
failure. Subject’s rating was scored by dividing each 
question into five equal intervals, 1-5, in the direc- 
tion of increasing reports of anxiety and summing 
over the scale scores to obtain a total test-anxiety 
score. Separate distributions were also made for the 
test-anxiety scores in each of the two groups, and 
were divided at the median. The median score for 
the group who took the six-picture TAT was 96, 
and for the other group, 92. Subjects above their 
respective medians were classified as being high in 
test anxiety and those scoring below, were classified 
as being low in test anxiety. 


Experimental Design 


Subjects were classified on the basis of both 
measures into the following groups: high test 
anxiety-high n Aff, high test anxiety-low n Aff, low 
test anxiety-high n Aff, and low test anxiety-low n 
Aff. Twenty-four subjects were assigned to each of 
these groups and eight subjects from each of these 
four groups were then randomly assigned to each of 
Bee ee 


2When four pictures were used, they were 2, 1, 
8, and 4 and were presented in that order. 

3 Catherine Cariero who had obtained an inter- 
scorer reliability coefficient of .90 with practice 
material described by Smith and Feld (1958). 
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the following three conditions: alone, the presence 
of a friend, and the presence of a stranger, yielding 
a total of 32 subjects in each of the three condi- 
tions, 


Apparatus 


Palmar skin conductance was measured with an 
apparatus similar to one suggested by Schlosberg 
(1954). It consisted of a direct current meter, 
scaled from 0 to 100 microamperes, which was 
connected in series with a 3-volt battery and a 
subject. A 200-ohm potentiometer was used to re- 
duce the circuit voltage to 2 volts. A 20,000-ohm 
precision resistor which could be switched on in 
place of the subject’s resistance was used for cali- 
bration of the circuit. A variable resistor (800-950 
ohms), which could place a shunt across the meter 
terminals and reduce the meter sensitivity of 4 or + 
for measuring high conductance levels, was also in- 
cluded in the apparatus. The electrodes were $-inch 
wire grids which were taped to the first and third 
fingers of the subject’s nondominant hand, Meas- 
urements were converted into micromhos by divid- 
ing by 2, The apparatus was calibrated before and 
after each subject was run, 

Conductance levels were obtained while the sub- 
ject worked on a modified form of Feather’s (1960) 
perceptual reasoning task which consists of five 
line drawings which the subject must trace over 
without lifting his pencil from the drawing or 
retracing a line, Within these restrictions, one of 
the line drawings can be solved (the sample item) 
and four cannot be solved. When another person 
was present in the situation, he was asked to per- 
form a task which was quite different from the 
experimental task. The other person was presented 
with a case history which took about 10 minutes to 
read, and required to answer a number of questions 
based on the case history, 


Procedure 


When the electrodes were being placed on the 

subject* it was explained to him that he was 
participating in an experimental investigation of 
physiological reactions of persons performing a task 
and therefore electrical recordings of his reactions 
would be made. When another Person was present 
in the situation, they were also asked not to talk 
with each other and were informed that although 
the experimenter would not be present, he would 
be able to hear what was happening through the 
intercom, 
f When the electrodes had been attached, the sub- 
jects were allowed to rest for 15 minutes to permit 
GSR to attain a constant state. At the end of the 
15-minute rest period five 1-minute readings of the 
subject’s skin conductance were taken. These read- 
ings provided the base level and served as the con- 
trol variable in an analysis of covariance. 


“When two people were Present, electrodes were 
placed on both, although the other person’s elec- 
trodes were placebo electrodes. 


STANLEY KISSEL 


After the five 1-minute recordings were obtained 
the subject was introduced to the experimental task 
with a solvable practice task. When the subject 
indicated that he understood the task and the 
rules under which it was to be performed he was 
introduced to the four uhsolvable tasks with the 
following instructions: 


In front of you are four wells. Each well con- 
tains ten copies of a different design making a 
total of forty designs that you will have to 


| 
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complete. You will work in the following man- © 


ner: Remove a design from one of the wells; you 
may start whenever you like. After you are ready 


to start working say “Ready” and I will tell you | 


to begin work. You will have 30 seconds to com- 
plete work on a design before I tell you to stop. 
You will then place the design face down and 
take another design and say “Ready” after 
which I will tell you to begin. We will continue 


this way until all designs have been done. You ; 


will be able to work in any order that you wish: 


If the subject had no question at this point the 
experimenter continued with the following: 


These designs are more difficult than the practice 
design, Remember, keep your hand [with the 
electrode] as still as possible and use only your 
other hand in performing the task. Get ready to 
start and I will tell you when to begin. 


At this point the experimenter left the subject 
and retired to an observation room for the duration 
of the experiment. All further communications were 
conducted by means of an intercommunication sys- 
tem. Measurement of the subject’s skin conductance 
was made at 1-minute intervals until the experi- 
ment was completed. 

When two subjects were present, the experimental 
subject was given instructions last. The other pet 
son present was told the following: 


You will have to read this case material and then 
answer some questions concerning the case. There 
are instructions on the question sheet which até 
self-explanatory and should answer any ques 
tions which might arise during the course of the 
experiment. 


The other person was asked if he had any questions; 
and if not, the experimenter continued: 


While you are working on your task, Mr. —— 
and I will be communicating with each other 
Try not to be disturbed by this. Remember, keep 
your hand [hand with electrode] as still as p95- 
sible and use only your other hand. 


At the conclusion of the experiment, the expel 
mental subject was requested to fill out a ques 
tionnaire which was designed to obtain the sub- 


ject’s degree of acquaintance with the other perso? 


Present, It consisted of one self-rating item whi 
asked, “To what extent are you acquainted wil 
the person who was in the experimental situation 
with you?” An analysis of the responses to the 
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questionnaire demonstrated the effectiveness of the 


„procedure under which friends and strangers were 


obtained. 
RESULTS 


The GSR data were analyzed using a Co- 
variance design (McNemar, 1955) to take 
account of individual differences in initial 
conductance levels. Both resting-state and 
working-state GSR scores were positively 
‘skewed and a square-root transformation 
was applied to the data to reduce skewedness 
and nonnormality. 

Before considering the results in the light 
of the major hypothesis, it is important to 
determine the effectiveness of the stress- 
inducing task. 


TABLE 1 


Meran NONADJUSTED AND ADJUSTED PALMAR GSR 
LEVELS IN MICROMHOS FOR HIGH AND 
Low TAQ Grours 


Mean GSR level 


TAQ 
° Nonadjusted Adjusted 
High* 5.69 5.84 
Low# 5.57 5.42 
aN =48. 


The nonadjusted and covariance adjusted 
mean GSR scores obtained from high and 
low TAQ subjects during the experimental 
period may be found in Table 1. It is appar- 
ent that high test anxiety is associated with 
higher skin conductance and low test anxiety 
‘under conditions of induced failure. The dif- 
ference between the adjusted mean GSR 
readings of the two TAQ groups is statisti- 
cally significant (F = 7.2, df= 1/83, PS 
01, see Table 2). Thus, the evidence indi- 
cates that the situation was indeed a stress- 
ful one for the subjects. 

The hypothesis that the presence of an- 


aa person in a stressful situation will re- 
, duce the stressfulness of the situation was 


confirmed. Adjusted mean conductance level 
for the three groups were: alone, 5.97; 
stranger, 5.68; and friend, 5.27 (see Table 
3). The overall variation among the means 
was statistically significant (F = 7-16, df= 
2/83, p < .01, see Table 2). 

The second hypothesis was confirmed. The 
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TABLE 2 


ANALYSIS OF COVARIANCE OF CONDUCTANCE ScoRES 
¥OR THE EXPERIMENTAL CONDITION 


Source af MS F 

Test anxiety (A) 1 3.84 7,.20** 
n Aff (B) 1 32 
Presence of other (C) 2 3.83 7.16** 
AXB 1 19 
AXC 2 92 1.72 
BXC 2 -20 
AXBXC 2 2.80 525% 
Error 83 „53 

*p <.05. 

++p J 01. 


three groups ordered themselves in the fol- 
lowing manner: alone, stranger, and friend 
from highest to lowest (see Table 3). The 
adjusted differences between friend and 
stranger, .41, and between friend and alone, 
.70, were found to be significant at the .05 
and .001 levels, respectively. The adjusted 
difference between stranger and alone, .29, 
was not statistically significant (p < .10). 
Thus, in order for another person’s presence 
to have stress-reducing effects, the other per- 
son present in a stressful situation must be a 
friend. 

The final hypothesis which the study in- 
vestigated stated that subjects with high af- 
filiative needs will show less stress responses 
than subjects with low affiliative needs in the 
presence of another person in a stressful sit- 
uation. This hypothesis was not supported. 
The interaction between n Aff (B) and pres- 
ence of other (C) was not statistically sig- 
nificant (F = .20, df= 2/83, p> .05, see 
Table 2). 

An interaction between n Aff, test anxiety, 
and the presence of another person was not 
predicted; however, the second-order inter- 
action (A X B X C in Table 2) proved to be 
significant (F = 2.30, df = 2/83, P< 05, 


TABLE 3 


ORIGINAL AND ADJUSTED PALMAR CONDUCTANCE 
SCORES FOR THE ALONE, STRANGER, 
AND FRIEND CONDITIONS 


Group Original Adjusted 
Alone* 5.80 5.97 
Stranger* 5.31 5.68 
Friend* 5.82 5.27 
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TABLE 4 
SUMMARY OF THE ADJUSTED MEAN PALMAR 
CONDUCTANCE SCORES FOR THE INTER- 
ACTION OF TEST ANXIETY, n AFF, 

AND PRESENCE OF OTHER 

Alone | Stranger | Friend 
n Aff Test anxiety 

High | Low | High | Low | High | Low 

Hight | 5.86 | 6.12 | 6.43 | 5.10 | 5.46 | 5.19 
Lowe | 6.14 | 5.76 | 5.64 | 5.53 | 5.51 | 4.93 
aN =48, 


see Table 2). Table 4 presents the adjusted 
cell means necessary to evaluate the pattern 
of the interaction. Low n Aff subjects’ con- 
ductance scores were fairly uniform for the 
different levels of test anxiety across the 
different conditions of presence of another 
person. Low n Aff subjects showed higher 
conductance scores in association with high 
test anxiety than in association with low test 
anxiety. There was also a stepwise decrease 
in conductance scores from the alone to the 
stranger to the friend condition in the low 
n Aff-low test anxiety subjects, In the alone 
condition the mean conductance score for the 
subjects was 5.76; in the stranger condition, 
5.53; and in the friend condition, 4.93 (Table 
4). This was also true for subjects in whom 
n Aff was high and test anxiety low, except 
for a slight reversal in the friend and 
Stranger condition, However, with high test 
anxiety and high n Aff, the stranger condi- 
tion produced the highest conductance score 
followed by the alone condition and the 
friend condition, This is interpreted as re- 
vealing an intensification of the stress situa- 
tion by the presence of a stranger for sub- 
jects in whom test anxiety and n Aff are 
both high and the reduction of the stress- 
fulness of the situation for comparable sub- 
jects by the presence of a friend, 


Discussion 
Social Stimuli and Stress Reduction 


The hypothesis that social stimuli reduce 
Stress responses was partially confirmed by 
the present investigation. This finding com- 
plements Schachter’s (1959) conclusion that, 
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“anxiety leads to the arousal of affiliative | 
tendencies [p. 12].” If social stimuli serve: 
as a source of stress reduction and this has į 
reward value, then it follows that people in 
stressful situations will prefer to be with 
other people. Empirically verifying the old 
saying, “misery loves company,” Schachter 
further investigated this phenomenon and 
concluded, “misery doesn’t love just any kind | 
of company, it loves only miserable company, 
[p: 24].” He was thus led to hypothesize: 
“Whatever the needs aroused by manipula- 
tion of anxiety it would seem that their satis- 
faction demands the presence of others in a 
similar situation [p. 24].” However, accord- 
ing to the findings of this study, stress re- 
sponses will be reduced in people if they are 
in the presence of others known to them, 
This addition and qualification corroborates 
the anecdotal reports found in the literature 
(Burlingham & Freud, 1943; Mandelbaum, 
1952; Marshall, 1951), and is also consistent 
with experimental evidence found in com- 
parative studies, Mason (1960) demonstrated 
that an important determinant of the effec- 
tiveness of social stimuli as stress reducers 
for monkeys is the familiarity of the social 
stimuli. 

Wrightsman ® using a technique similar to 
Schachter’s reported that being with other 
people had no effect on the reduction of 
stress. From his description of the study 
those who were present in the stress situation 
were not known to each other prior to the 
experiment. This strengthens the view that 
the mere presence of any person in a stressful’ 
situation is not sufficient to lessen the stress: 
The other person present has to be someone 
with whom the person had previously intel 
acted, | 

Schachter (1959) suggested five possible 
explanations for his finding that anxiety 
induces motivation to be with other people: 
escape, information seeking, and self-divet 
sion were ruled out on the basis of his find- 
ings that, “anxiety is positively related t° 
the affiliative tendency under conditions 0 
free communication . restrictive com 
munication . . . where communication © 
impossible [p. 38].” Direct anxiety reductio" 

5 Wrightsman’s (1960) results were not published 
until after the present study had been completed. 
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and self-evaluation (understanding and eval- 
uation of one’s reaction) were the other pos- 


“sibilities proposed. Furthermore, the afore- 


mentioned finding that, “misery does not 
love just any kind of company, it loves only 
miserable company,” was taken as lending 
support to the self-evaluation hypothesis. 
Under stress conditions, affiliation affords one 
the opportunity to compare and evaluate his 
state with others and thus gain a better un- 
derstanding of the state which he is experi- 
encing. a 

Expanding social comparison theory, 
“there exists in the human organism a drive 
to evaluate his opinions and abilities [Fest- 
inger, 1954, p. 117]” to include emotions, 
Gerard and Rabbie (1961) and Zimbardo 
sand Formica (1963) suggest that self-evalu- 
ation is a potent force in motivating people 
to seek to affiliate under anxiety-inducing 
experiences. Briefly, social comparison theory 
states that whenever an individual is uncer- 
tain about an emotional experience, and 
when other means of comparing certainty 
about, the experience are lacking, he will be 
motivated to seek others in order to evaluate 
his experience by comparing it with others. 
It would follow that under stress, the greater 
the certainty regarding one’s feelings, the less 
would be the desire to affiliate. 

Gerard and Rabbie (1961) demonstrated 
that under conditions of stress, individuals 
affiliate for reasons of comparison. Subjects 
Wee provided with information regarding 
the intensity of an emotion experienced by 
three others in a situation similar to theirs. 


‘Under these conditions, the desire for social 


affiliation was reduced, thus confirming the 
theory, 

Zimbardo and Formica (1963) using a 
technique similar to Schachter’s, but refining 
the possible affiliative choices available, in- 
terpreted their findings as further supporting 
the emotional comparison model. In their 


` study fearful subjects were given the choice 
. of waiting alone, waiting with others who 


had just completed the experiment (different 
emotional state), or waiting with others who 
were in the process of completing the experi- 
Ment (same emotional state). The reason 
most frequently offered by subjects to affili- 
ate was to seek comfort “the mere presence 
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of others is comforting [p. 152, italics 
added].” 

Forty-three percent of the subjects in the 
“same state” and 38% of those in the “other 
state” desired to affiliate for comfort. When 
the two groups, that is, same and other state, 
were compared, fearful individuals affiliating 
with people in a different state do so for 
catharsis (29%) and those affiliating with 
others in a similar state did so for reasons of 
emotional comparison (31%). 

Zimbardo and Formica’s finding thus sup- 
ports a model of direct anxiety reduction as 
well as an emotional comparison theory, and 
if anything would seem to favor the notion 
of direct anxiety reduction. The findings of 
the present study would also support the view 
that the desire to affiliate under stress is a 
direct result of the anxiety-reducing proper- 
ties of social stimuli. The mere presence of a 
friend, doing something quite different from 
the experimental subject significantly reduced 
stress. However, the presence of a stranger 
in the situation was not effective in reducing 
stress. Emotional comparison theory would 
seem unable to account for these findings. 

Tt is hypothesized that social stimuli act 
directly in reducing stress. The presence of 
a friend, an emotionally pleasing stimulus, 
elicits positive feelings, which tend to com- 
pete with negative feelings which are stimu- 
lated by the noxious situation, that is, stress. 
The overall effect is thus a reduction or 
minimization of stress. It is not surprising 
that strangers were relatively ineffective in 
reducing stress. Generally friends tend to 
have attributes similar to those who in the 
past have been dispensers of security, affec- 
tion, and protection. However, strangers not 
being able to, “stimulate pleasure,” so to 
speak, will not call forth competing responses 
and thus the stress will not be appreciably 
reduced or minimized. 


Personality Characteristics and Social Stimuli 


The hypothesis that individuals with 
strong affiliative motives will show a greater 
reduction in stress responses than will indi- 
viduals with weak affiliative motives in the 
presence of another person was not con- 
firmed; thus offering no support for the 
validity of the method of content analysis of 
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TAT as an indicant of n Aff. However, the 
results indicated that for people with strong 
affiliative motives, the presence of a stranger 
intensifies the stressfulness of a stress situa- 
tion; for similar people, a friend lessens the 
stressfulness of the situation. This interesting 
finding is based upon the significant second- 
order interaction and would need to be 
replicated in a less complicated design to 
determine its value. 


REFERENCES 


ALPERT, R, & Haser, R. N. Anxiety in academic 
achievement situations. Journal of Abnormal and 
Social Psychology, 1960, 61, 207-216. 

AprLEzZwEIC, N, M., & Morrer, G. Motivation and 
psychological stress. Final Report, September 
1958, Contract No. 999 (02), Group Psychology 
Branch, Office of Naval Research, 

ATKINSON, J. W. (Ed.) Motives in fantasy, action, 
and society. New York: Van Nostrand, 1958. 
Arxtnson, J. W, & Litwin, G. H. Achievement 
motive and test anxiety conceived as motive to 
approach success and motive to avoid failure, 
Journal of Abnormal and Social Psychology, 1960, 

60, 52-64, 

Boston PSYCHOPATHIC HOSPITAL. Experimental psy- 
choses. Scientific American, 1955, 192, 34-39. 
Brapy, J. V. Ulcers in “executive” monkeys. Scien- 

tific American, 1958, 199, 95-100, 

Broverman, D, M., & Lazarus, R. S. Individual 
differences in task performance under conditions 
of cognitive interference. Journal of Personality, 
1958, 26, 95-105, 

Buruincuam, D., & Frevup, A. Young children in 
war-time, London: Allen & Unwin, 1943, 

Concer, J, J., Sawrey, W. L., & Turret, E, S. An 
experimental investigation of the role of social 
experience in the production of gastric ulcers in 
hooded rats. American Psychologist, 1957, 12, 410, 
(Abstract) 

Davrrz, J. R, & Mason, D. J. Socially facilitated 
Pane ofa pene in rats. Journal of 

‘omparative ai siological 
Ree y: gi Psychology, 1955, 

Durry, ErzabeTH, The psychological significance 
of the concept of “arousal” or “activation.” 
Psychological Review, 1957, 64, 265-275, 

FEATHER, N. T. Persistence in relation to achieve- 
ment motivation, anxiety about failure, and task 
difficulty. Paper read at Midwestern Psychologi- 
cal Association, Chicago, May 1960, 

Festincer, L. A theory of social comparison proc- 
esses. Human Relations, 1954, 7, 117-140, 

Grrarp, H. B, & Rasse, J. M. Fear and social 
comparison. Journal of Abnormal and Social Psy- 
chology, 1961, 62, 586-592. 

Heyns, R. W., Verorr, J, & ATKINSON, J, W. A 
scoring manual for the affiliation motive. In J W. 


STANLEY KISSEL 


Atkinson (Ed.), Motives in fantasy, action, and 
society. New York: Van Nostrand, 1958. Pp. 205- 
218. 

Lazarus, R. S., Baker, R. W., Broverman, D, M, 

aia Maver, J. Personality and psychological stress, 
Journal of Personality, 1957, 25, 559-577. 

Lazarus, R. S., DEESE, J., & Oster, Sonta F. The 
effects of psychological stress upon performance. 
Psychological Bulletin, 1952, 49, 293-317. 

Lippett, H, Some specific factors that modify toler- 
ance for environmental stress. In H. G. Wolfe, 
S. G. Wolfe, Jr, & C. C. H: 
and bodily disease. Baltimore: Wil 
ķins, 1950, Pp. 155-171. 

Matrmo, R. B. Activation: A neuropsychological 
dimension. Psychological Review, 1959, 66, 361- 
386. 

Manpersaum, D. G. Soldier groups and Negro 
soldiers. Berkeley: Univer. California Press, 1952. 

Manoter, G., & Sarason, S. B. A study of anxiety 
and learning. Journal of Abnormal and Social 
Psychology, 1952, 47, 166-173. ‘ 

Marsnatt, S. L. A. Men against fire. Washington, 
D. C.: Combat Forces Press, 1951. 

Mason, W. A. Socially mediated reduction on emo- 
tional responses of young rhesus monkeys. Jour- 
nal of Abnormal and Social Psychology, 1960, 
60, 100-105. 

McNemar, Q. Psychological statistics. New York: 
Wiley, 1955, 

Sarason, I. G. Intellectual and personality” corre- 
lates of test anxiety. Journal of Abnormal and 
Social Psychology, 1959, 59, 272-275. 

Semacuter, S. The psychology of affiliation. Stan- 
ford: Stanford Univer. Press, 1959. 

Scutosserc, R. Three dimensions of 
Psychological Review, 1954, 61, 81-88. 

Smits, C. P, & Ferp, Smera, How to learn the 
method of content analysis for n Achievement, ™ 
Affiliation, and n Power. In J. W. Atkinson (Ed.); 
Motives in fantasy, action, and society. New 
York: Van Nostrand, 1958. Pp. 685-735. 

Srerser, Z. The test anxiety questionnaire scoring 
norms for a noncollege population. Journal of 
Abnormal and Social Psychology, 1939, 58, 12% 
131, 

Vocer, W., Baker, R. W, & Lazarus, R. S. The 
role of motivation in psychological stress. Journal 
of Abnormal and Social Psychology, 1958, 56 
105-112, 

WoopwortH, R. S., & ScmLossrro, H. Experimental 
psychology, New York: Holt, 1954. p 
WRriīcHTsman, L. S, Jr. Effects of waiting with 
others on changes in level of felt anxiety. Jom 
nal of Abnormal and Social Psychology, 1960, 
61, 216-221. y 
ZımtearDo, P, & Formica, R. Emotional compari- 
son and self esteem as determinants of affiliation: 

Journal of Personality, 1963, 31, 141-162. 


ams & Wil- 


emotion. 


(Received April 2, 1964) 


re (Eds.), Life stress ` 


EE Sa E a a 


of Personality and Social Psychology 
Vol. 2, No. 3, 385-396 


measurement instruments. 


Considerable recent literature (Barron, 
53; Berkowitz, 1957; Bieri, 1955; Ehart, 
57; Higgins, 1959; Leventhal, 1957; 


jindy & Berkowitz, 1957; Mayo & Crockett, 
1964; Plotnick, 1961; Scott, 1962, 1963; 
Sechrest & Jackson, 1961; Ware, 1958; 
jonc, 1960) has dealt with cognitive com- 
lexity as a variable which influences people’s 
ceptions and evaluations of events. Al- 
ugh various writers have given somewhat 
ifferent meanings to the concept of cognitive 
omplexity, it has generally been postulated 
that some persons are prone to employ few 
mensions when they perceive and evaluate 
Stimuli, or are inclined to make only very 
s discriminations among dimensions of 
leaning. Other persons are believed to em- 
y many dimensions and/or to make fine 
discriminations along the dimensions they 
ploy. 

In many of these investigations cognitive 
Complexity has been conceived as a disposi- 
tion to view the person-objects in one’s social 
vironment in a complex or differentiated 
hion, Other writers have implied that 
gnitive complexity-simplicity is a quite gen- 
eral trait pervading all realms of cognitive 
unctioning, Finally, some writers (Scott, 
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_ GENERALITY OF COGNITIVE COMPLEXITY-SIMPLICITY 
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An investigation of the generality of cognitive complexity was conducted using 
113 male Ss. Ss responded to a battery of instruments several of which were 
designed explicitly to appraise cognitive complexity-simplicity and several others 
which could be construed as measures of such a variable. A factor analysis re- 
vealed that no single, unitary dimension could be held accountable for the test 
intercorrelations. Instead, several factors were extracted, no one of which was 
especially large in terms of the proportion of variance it extracted. The results 
suggest that cognitive complexity may consist of a number of distinct, possibly 
independent, tendencies not all of which are educed by any of the present 


1963; Zajonc, 1960) have viewed complexity 
as a somewhat less enduring state applying 
only to a particular cognitive domain that 
has been made salient. 

In other studies instruments have been 
developed which appear to assess behaviors 
similar to those evoked by tests of complex- 
ity. Steiner (1954) developed a trait discrep- 
ancy scale in order to measure the degree to 
which subjects were willing to acknowledge 
the possibility that unequally “good” traits 
coexist in the same individual. Steiner and 
McDiarmid (1957) demonstrated that the 
Assumed Similarity between opposites (ASo) 
score of Fiedler (1955) was jointly deter- 
mined by two empirically independent com- 
ponents, an elevation difference component 
and a profile dissimilarity component. A high 
score on the elevation difference component, 
because it reflects a tendency to dichotomize 
one’s interpersonal environment into “good” 
and “bad” persons would seem to indicate a 
cognitively simple outlook. A high score on 
the profile dissimilarity component would 
appear to reflect a complex outlook since it 
indicates a willingness to regard others as the 
possessors of a mixture of good and bad 
traits, Harvey, Hunt, and Schroder (1961) 
have formulated a theory of personality 
structure which postulates a continuum of 
conceptual functioning from very simple 
(“concrete”) to very complex (“abstract”). 
The instrument with which such functioning 
is assessed is a sentence completion test 
(Schroder & Streufert, 1962). This test is 
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particularly adapted to the assessment of 
conceptual structure in regard to interper- 
sonal stimuli. Most recently, authoritarian- 
ism has been viewed as a possible index of 
breadth of perspective (Kelman & Barclay, 
1963). These authors propose an interpreta- 
tion of F Scale performance as indicating the 
individual’s capacity to shift contexts and ac- 
cept differences; to be aware of a range of 
customs, values, and approaches to life, and 
to expect differences between people and be 
tolerant of them, Kelman and Barclay’s in- 
terpretation, especially as regards “expecting 
differences” and “being aware of a range 
of customs,” seems to bear considerable re- 
semblance to the definitions of cognitive 
complexity set forth by other investigators. 
Two previous attempts to assess the gen- 
erality of cogntive complexity have been 
made. A study by Bieri and Blacker (1956) 
investigated the generality of cognitive com- 
plexity across personal as well as nonpersonal 
situations, Significant relationships were found 
between complexity as measured by the Role 
Construct Repertory (RCR) Test devised by 
Kelly (1955) and measures of Rorschach- 
determinant complexity and Rorschach- 
content complexity. These findings suggested 
that relatively consistent, enduring modes of 
cognitive schematization characterize the 
individual’s behavior across situations, 
Allard and Carlson (1963) designed three 
tests of cognitive complexity, all using the 
RCR Test format. In one test the objects 
about which constructs were generated were 
personal friends, in the second they were 
famous people, and in the third the objects 
were geometric designs, The intercorrelations 
of the three tests were between .57 and .67, 
These data also lend some support to the 
generality ofa complexity-simplicity variable 
ìn cognitive functioning. Unfortunately, 
neither of these studies encompassed a very 
wide variety of methods for assessing com- 
plexity, The proposition that cognitive com- 
plexity is a unitary, consistent mode of 
cognitive functioning remains in question, 
Gardner and Schoen (1962) assert that the 
term cognitive complexity seems grossly over- 
general, There is ample evidence, according 
to these authors, “that most persons are rela- 
tively ‘complex’ in some areas of cognition; 
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relatively ‘simple’ in others.” Scott (1963) 
contends that as yet there is little evidar 
for a unitary trait of complexity-simplicity 
pervading all areas of the individual’s cogni- 
tive functioning. The degree of complexity- 
simplicity probably varies over different 
cognitive domains depending upon the 
amount and kind of knowledge the individual 
possesses, and upon the kinds of functional | 
demands with which the domain is confronted 
in daily life, 

‘The latter views clearly imply that cogni- 
tive complexity is not a general personality 
trait. Even if people are predisposed to 
respond in cognitively complex or cognitively 
simple ways, their predispositions are severely 
limited ones which have application to only 
very narrow categories of situations. 

The present research investigates the inter- 
relationships of several measures which are 
presumed to reflect people’s tendencies to be 
cognitively complex or simple. The measures 
take a variety of forms and appear to sample 
complexity-simplicity as it relates to several 
different aspects of the cognitive structuring 
of one’s interpersonal environment. The tests 
were limited to this domain because most 
of the research in this area (particularly that 
emanating from the original article by Bieri, 
1955) has dealt with cognitive complexity in 
the perception of persons, even though @ 
more general trait has often been implied. 
Before proceeding to the question of whether 
complexity is a general trait which pervades 
all realms of cognitive functioning it seems 
wise to establish its generality in this oF 
important domain. Although all of the devices 
included in this research are paper-and- 
pencil instruments, they represent the major 
approaches to the measurement of cognitive 
complexity. If a factor analytic appraisal 0 
the scores yielded by these measures reveals 
a single factor which accounts for a large 
portion of the variance of scores, and if th 
content of this factor is appropriate, Wê 
may conclude that evidence in favor of # 
general personality trait, “complexity of pe™ 
Son cognition,” has been obtained. If, how 
ever, the analysis generates several factors 
Which appear to reflect different types of C08 
nitive complexity, we shall be inclined t 
conclude that cognitive complexity is not 4 
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unitary trait. Failure to obtain any factors 
which suggest cognitive complexity may 


` mean either that no such predisposition exists 


or that none of the instruments included in 
the test battery is a valid measure of it. 

In addition to measures which have been 
designed explicitly to appraise cognitive 
complexity-simplicity, the test battery in- 
cludes several other instruments which in the 
view of the present writer may reflect such 


ʻa variable. The aim of the research is to 


sample a wide variety of possible measures 
of complexity-simplicity, rather than a more 
selective sample that may on a priori grounds 
be expected to test a particular conceptuali- 
zation of complexity-simplicity. Thus the 
authors of some of the instruments employed 


` in this study may justifiably feel that the 


battery includes a number of measures with 
which their own complexity scores should 
not be expected to correlate very highly, and 
omits others with which it should correlate. 
The present emphasis on diversity of meas- 
ures reflects our desire to determine whether 
very different approaches are measuring the 
same complexity-simplicity dimension. If 
such is not the case, additional research can 
be conducted to learn which of the many 
available instruments do measure a specific 
complexity-simplicity dimension. 


METHOD 


The instruments used in the study are described 
below, Certain modifications were made in some of 
the instruments to lessen the probability that re- 
sponse sets or artifacts would affect scores appreci- 


. ably. 


Tests 


i Bieri Test of Cognitive Complexity. The test used 
in this study resembled Kelly’s (1955) RCR Test. 
However, following the precedent of Tripodi and 
Bieri (1963), constructs as well as persons were 
Specified on the grid in which the subject marked 
his responses, Tripodi and Bieri found that specify- 
ing the constructs had little effect on subjects 
scores, 

Two additional, minor, modifications were intro- 
duced in an attempt to lessen the effect of response 
Sets, The cells in each row of the grid employed by 
Bieri can be viewed as graphic rating scales, the 
ends of which are identified by bipolar adjectives. 
The bipolar adjectives appear at the right side of 
each row of the grid, separated by a dash (eg, 
responsible—irresponsible). In the original form of 
the Bieri instrument the more desirable of the two 
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adjectives always appeared on the same side of the 
dash. In the version employed in this study the side 
of the dash on which the desirable adjective appears 
is counterbalanced. It was believed that the conven- 
tional use of plus and minus responses also enhances 
the possibility of a response set, since plus marks 
themselves have a slight evaluative connotation, The 
letters “L” and “R” were substituted for plus and 
minus marks,* 

To test the effectiveness of these modifications 
both the original test (using plus and minus marks 
in the grid, and uncounterbalanced attributes) and 
the modified test (using L and R in the grid, and 
counterbalanced attributes) were administered to 
113 male undergraduate subjects 1 week apart. The 
correlation between complexity scores obtained using 
the original and modified instruments was .64. The 
correlation between number of plus marks and com- 
plexity score for the original test was —.55. The 
correlation between number of “favorable” responses 
(number of times the more desirable side of the 
attribute was assigned) and complexity score for 
the modified test was —.46. The decrease in size 
of the correlation was not significant. However, in 
both instances, the relatively high correlations indi- 
cated that the tendency to use a “favorable” re- 
sponse is negatively associated with cognitive 
complexity as measured by this test. 

Interconcept Distance Measure of Cognitive Com- 
plexity. This measure is patterned after an instru- 
ment devised by Ware (1958). Ware’s subjects rated 
diverse concepts (eg, silk, food, Army, mother, 
etc.) on 20 7-point semantic differential scales 
(Osgood, Suci, & Tannenbaum, 1957). The scales 
were correlated across concepts for each subject 
and the matrix of intercorrelations was factor ana- 
lyzed. As an indication of the “dimensionality” of 
the semantic meaning space Ware took the per- 
centage of variance extracted by the first factor. 
If the scales are highly intercorrelated the factor 
analysis yields few factors and this score is rela- 
tively high. It is low when the factor analysis 
yields several factors, indicating what Ware terms 
‘diversity of individual semantic meaning space.” 
Ware obtained other measures of “diversity,” but 
space limitations prevent our dealing with all of 
them here. 

The instrument used in the present study involved 
several modifications of the original Ware measure, 
both in the form of the semantic differential which 
was employed and in the scoring of responses. Only 
personal concepts (eg brother, disliked co-worker, 
teacher, etc.) were used. An attempt was made to 
choose concepts that would comprise a representa- 
tive sample of persons in the individual’s social 
environment, Fourteen 7-point scales (e.g., solitary- 
sociable, logical-intuitive, etc.) were selected to 
represent several semantic dimensions applicable to 
person-objects as determined in previous research 
with the semantic differential. 

The analysis used the mode} presented by Gul- 


2This and other instruments used in the present 
study are included as appendices in Vannoy (1964). 
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liksen and Tucker (1951). It was applied using 
person-objects as points in the hypothetical multi- 
dimensional space and semantic differential scales as 
vectors. The standard deviation of the distances of 
all of the points from each other point was taken 
as an indication of “cognitive complexity.” In a 
two-dimensional case such a configuration would 
be represented by two small clusters of points sepa- 
rated by a large distance. The resulting standard 
deviation of interconcept distances would be large. 
The configuration of points for a cognitively com- 
plex individual should exhibit a more even distribu- 
tion throughout the space. There would be a greater 
degree of independence among the dimensions be- 
cause persons would be viewed as alike on one 
dimension but different on another. There would 
also be points located at intermediate positions 
along the dimensions. The more even distribution 
of points would result in a smaller standard devia- 
tion of interconcept distances. Consequently, the 
standard deviation of these distances is taken as 
an indication of a “complex” cognitive space. 
Semantic Differential Interaction Variance Measure 
of Cognitive Complexity. This measure also was 
obtained from the semantic differential responses. A 
two-way analysis of variance of each subject’s re- 
sponses was performed with concepts and scales 
serving as factors. As a measure of complexity the 
square root of the ratio of the interaction sum 
of squares to the total sum of squares was obtained. 
The square-root transformation was used in order 
to obtain a more symmetrical distribution of this 
complexity measure, and to keep the units compa- 
rable with those of other measures used in the 
study. A high score indicates the degree to which 
ratings assigned to concept-scale pairings are treated 
as separate situations by the subject and are rela- 
tively unaffected by the response the subject usually 
makes to the concept or scale. In general, a high 
score on this measure should indicate a tendency to 
be highly sensitized to differences among different 
persons, 
} Social Distance Questionnaire. The procedure used 
in constructing this scale was an extension of that 
employed by Triandis and Triandis (1960, 1962) 
and by Triandis (1961). An example of a typical 
judgment subjects were required to make is given 
in the item below: 


A Person who is uncritical of others, extremely com- 
petent in his work, conventional and congenial 
Would: sist se 


twould not 
seck the friendship of this person. 


There were 16 items or “complex stimuli” in the 
test, each of which appeared three times, each time 
with a different type of behavior (social distance 
response) specified below it, Each item was made 


up of a different combination of the two levels of 


8A complete description of the scorin: H 
d ig technique 
employed for this analysis, including formulae, aay 
be found in Vannoy (1964), 
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the four factors: conventionality, criticalness, con- 
geniality, and competence. The factors were selected 
to correspond to areas of sensitization of persons 
at different levels of conceptual functioning accord- 
ing to the theoretical framework of Harvey et al. 
(1961). The factorial design of the instrument per- 
mitted an estimation of the proportion of total 
variance in each subject’s responses attributable to 
each of the several different sources listed below: 


Social Distance A: The proportion of variance 
due to “A” main effects (Conventionality) 

Social Distance B: The proportion of variance 
due to “B” main effects (Criticalness) 

Social Distance C: The proportion of variance 
due to “C” main effects (Congeniality) 

Social Distance D: The proportion of variance 
due to “D” main effects (Competence) 

Social Distance E: The proportion of variance 
due to “E” main effects (Type of Behavior) 

Social Distance I: The proportion of variance 
due to interactions. 


Tentatively, it was hypothesized that “complex” 
persons would exhibit sensitization toward com- 
petence and that “simple” persons would exhibit 
sensitization toward conventionality, in line with 
the theory of Harvey et al. It was also believed that 
a high proportion of variance attributable to inter- 
actions between the factors would characterize @ 
cognitively complex person. As conceived here, & 
complex person, because he employs many dimen- 
sions, would be expected to utilize more than one 
factor in making a judgment. As a result, his 
willingness to accept or reject the stimulus person 
should be conditional upon the presence of various 
combinations of traits rather than upon the presence 
or absence of a single trait. The conditional nature 
of his judgments should be reflected in the inter- 
actions of the factors. 

Modified Scott Test of Cognitive Complexity. 
This test was patterned after one developed by 
Scott (1962). Because of several modifications made 
by the author, this test should be considered only 
as one resembling the Scott test and not in any 
way identical to it. In this test the subject was 
presented with the titles of 12 significant persons 
to be placed in groups that were “similar in some 
important respect.” The subject was instructed to 
form as many groups as he thought were needed 
and to continue doing so until he had exhausted 
what seemed to him to be the important bases 
for grouping. The distribution of scores obtained bY 
use of the H statistic (Attneave, 1959) was highly | 
negatively skewed. The H score is at a maximum í 
when the subject groups persons in such a way that 
no two persons ever appear in the same combina- 
tion of groups. This, in fact, did occur with the 
present sample in a large proportion of the cases- 
As a substitute for H the actual number of group 
formed by each individual was taken as the sub- 
ject’s score. This score is roughly comparable to th¢ 
Scores used in studies of equivalence range (Gardnety 
1953) in which the subject’s score is the number © 


i 


| 
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groups he requires to categorize a heterogeneous 
array of objects. 

Elevation Difference Component of the ASo 
score, The method for obtaining this score is de- 
scribed in Steiner and McDiarmid (1957). It is the 
difference between the mean value of the ratings 
assigned to a “good” co-worker and the mean value 
of the ratings assigned to a “bad” co-worker on a 
series of 8-point semantic differential scales. It 
reflects the extent of the individual’s tendency to 
regard one co-worker as endowed with more 
desirable traits than the other. 

Profile Dissimilarity Component of the ASo score. 
Steiner and McDiarmid view this component of 
the ASo score as one which reflects the indi- 
vidual’s tendency to regard his two co-workers as 
possessing different combinations of good and bad 
traits, The square root of the difference obtained 
by subtracting the subject’s squared elevation dif- 
ference score from his ASo/N score is his profile 
dissimilarity score, 

Intolerance of Trait Inconsistency. This test was 
developed by Steiner (1954). A more recent publica- 
tion dealing with a revised form of the instrument 
is Steiner and Johnson (1963). Each item in the 
15-item test consists of two pairs of traits. One 
pair consists of traits which are unequally good 
(as determined by the judgments of a control group 
in an earlier study). The other pair consists of 
traits judged to be equally good. Subjects are asked 
to décide which of the two pairs of traits are more 
likely to occur together in people. Scores are ob- 
tained by counting the number of times the subject 
indicates that the more equally good pair of traits 
has the higher probability of co-occurrence in people. 
_Authoritarianism. Ten statements from the posi- 
tive F Scale (Adorno, Frenkel-Brunswik, Levinson, 
& Sanford, 1950) were included in this question- 
naire along with 10 negative F statements, 5 from 
Couch and Keniston (1960) and 5 from Triandis 
and Triandis (1962). Each item was answered ac- 
cording to a strongly agree (+3), moderately agree 
(+2), etc., to strongly disagree (—3) set of instruc- 


i tions, Scoring was accomplished in a manner sug- 


gested by Triandis and Triandis (1962). P is the 
percentage of times the positive F statements are 
agreed with and Q is the percentage of times the 
negative F statements are agreed with. Subtracting 
` from P gives a measure of authoritarianism that 
IS independent of response set due to acquiescence. 

he P-Q score was obtained for each subject. 1.00 
Was added to each score to eliminate negative 
numbers, 

Intolerance of Ambiguity. This scale was devel- 
Oped by Budner (1962). It contains eight positively 
stated and eight negatively stated Likert-type items 
to which the subject is asked to respond with +3 
(Strongly agree), +2 (moderately agree), etc. Unlike 
i, Scale used in this study this test was scored 

e conventional manner, On the positive items 
ñ score of 7 was assigned for strong agreement, 6 
ta Moderate agreement, and so on. The negative 
fs ms were scored in the reverse direction. According 
the scale’s author a high score indicates a 
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tendency to perceive ambiguous situations as sources 
of threat. 

Category Width. A shortened version of the Cate- 
gory Width Scale constructed by Pettigrew (1958) 
was used, It consists of the 10 items which corre- 
lated most highly with laboratory tests of category 
width in the study by Pettigrew. These are Items 
1, 2, 5, 6, 8, 13, 14, 15, 16, and 19. 

Independence of Judgment. In a study by Barron 
and Asch reported in Barron (1953) the mean dif- 
ference in complexity score (as measured by the 
Barron-Welsh Art scale) between independents 
(those whose yielding scores in the Asch situation 
were in the lowest quartile) and yielders (those 
whose yielding scores were in the highest quartile) 
was significant at the .05 level or better. Independ- 
ents scored higher in complexity, In a separate 
study, Asch and Barron collaborated in the con- 
struction of an 86-item attitude questionnaire to 
test hypotheses concerning differences between inde- 
pendents and yielders. The scale used in this study 
consists of 22 of these items which showed differ- 
ences significant at the .05 level between the highest 
27% of scorers and the lowest 27% of scorers on 
the Barron-Welsh Art scale. Because of the positive 
relationship of independence of judgment to the 
Barron measure of complexity the former measure 
was included in the present study in order to 
evaluate its relationship to other measures of com- 
plexity. Although it would also have been desirable 
to include the Barron-Welsh Art scale measure of 
complexity, the time required for administration of 
the scale precluded its use. 

Sentence Completion Test of Level of Integrative 
Complexity. In this test the subject is presented 
with a selected set of stems (eg., When I am in 
doubt ...; Rules... ; etc.). He is asked to use 
each stem as a basis for completing one sentence 
and to write at least one additional sentence. The 
stems used in the test were selected primarily to 
assess the complexity of conceptual structure re- 
lating to interpersonal stimuli. Each protocol is 
scored according to the level of cognitive structuring 
it reflects.t The specific rules for rating personality 
structure on the basis of completed stems are de- 
scribed in Schroder and Streufert (1962, pp. 10-19). 

Cooperative School and College Ability Test 
(SCAT). This test was administered to all the 
students in the sample at the time they enrolled 
at the University. While scores on verbal ability 
were of special interest in this study, the quantita- 
tive scores were included to investigate their 
empirical relationship to other scores, 


Administration of the Tests 


‘The tests described above were administered in 
two 2-hour testing sessions separated by 1 week. 
The sample consisted of 113 male subjects recruited 
from introductory psychology classes during the fall 


47The author is indebted to H. M. Schroder, 
Princeton University, who arranged for scoring of 
the protocols by trained personnel under his 


direction. 
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TABLE 1 
Means AND STANDARD DEVIATIONS OF THE 
20 VARIABLES 
Variable M SD 
1. Bieri Test of Cognitive Complexity | 38.74 | 9.03 
2. Semantic Differential Interaction 
Variance Measure of Cognitive 
Complexity -80 | .08 
3. Interconcept Distance Measure of 
Cognitive Complexity -18| .05 
4. Social Distance A: Conventionality -03 | .06 
5. Social Distance B : Criticalness 08h 
6. Social Distance C: Congeniality 14| 14 
7. Social Distance D: Competence 23} 19 
8. Social Distance E: Type of 
Behavior 02} .05 
9. Social Distance T: Interaction AL] 41 
10. Number of Grow s—Modified 
Scott Test of ‘ognitive 
Complexity 8.32 | 3.44 
11. Elevation Difference Component of 
the ASo score 2.20 | 1.44 
12. Profile Dissimilarity Component of 
the ASo score 1.98 | .65 
13. Intolerance of Trait Inconsistency | 8.07 2.15 
14. Authoritarianism 88] .31 
15. Intolerance of Ambiguity 47.91 | 9.85 
16. Category Width 36.50 | 7.33 
17. Independence of Judgment 10.72 | 3.14 
18. Schroder Sentence Completion 
Test of Level of Integrative 
Complexity 4.19 | 1.89 
19. Cooperative School and College 
Aptitude Test : Verbal 40.40 | 9.79 
20. Cooperative School and College 
Aptitude Test: Quantitative 41.35 | 5.30 


Semester of 1963 at the University of Illinois. Most 


of the subjects required about 1,5 hours to complete 
each session, 


ANALYSIS AND REsutts 


Product-moment 
puted between 


Table 1 shows the means and standard 
deviations of the various measures, Intercor- 
relations between the measures are reported 
in Table 2, The factor loadings of the eight 
rotated factors appear in Table 3, The un- 

5 Computational facilities of the IBM 7094-140) 
system at the University of Illinois (partially sup- 


Ported by Grant NSFGP 700 from the National 
Science Foundation) were used in this study. 
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rotated factor loadings and percentage of 
common and total variance accounted for by 
the unrotated factors appear in Table 4, 
The six largest factors were considered in- 
terpretable, Interpretation was based on an 
examination of tests having rotated loadings 
above .30. Only very tenuous interpretations 
of Factors VII and VIII can be made. A 
discussion of each of the factors follows, 


Factor I 

; Test Loading 
Independence of Judgment =i 
School and College Aptitude Test: Verbal 64 
Authoritarianism —.57 
Intolerance of Ambiguity —.68 


Factor I, the largest factor extracted, ac- 
Counted for 24.3% of the common variance 
extracted by the eight factors. An examina- 
tion of the items of the tests which loaded 
highly on this factor Suggests a person at 
one extreme who requires a highly ordered 
existence where uncertainty is at a minimum. 
He must interpret events unequivocally be- 
cause, as Budner (1962) has noted, situa- 
tions characterized by novelty, complexity, 
or insolubility are threatening to him. Pos- 
sessing, as he Presumably does, simpler 
verbal Concepts, such a person forces experi- 
ence into preordained Categories, permitting 
few alternatives and a limited perspective. 
At the opposite extreme is found the verbally 
bright person who maintains a highly dif- 
ferentiated Perspective. The factor suggests 
that possession of a more complex verbal 
apparatus is conducive to, or at least corre- 
lated with, a more varied and possibly more 
equivocal interpretation of experience. 


Factor J1 
Test Loading 
Interconcept Distance 70 
Social Distance B : Criticalness 62 
Bieri Test of Cognitive Complexity — Al 
School and College Aptitude Test : 
Quantitative —.33 


This factor accounted for 19.6% of the 
Common variance. The factor loadings have 
been reflected to facilitate interpretation. 
While the Sentence Completion Test of In- 
tegrative Complexity (Schroder & Streufert, 


TABLE 2 
INTERCORRELATIONS OF THE 20 MEASURES 


(53) 


Note.—Communalities appear in parentheses, 
“See Table 1 for description of measures, 


TONALSNOD ALIIVNOSaGg V SY ALIXATANOD TAILIN900 


Toe 
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TABLE 3 \ 
ROTATED FACTOR MATRIX 3 
Factor d 
Measures iA £5 — =e af 

i II ul IV v VI VII VII 
1 10 AT 18 09 —.34 —.37 05 08 
2 03 30 54 —.16 .09 —.15 .07 —.04 
3. —15 —.70 at —.08 —.05 13 —.ll —.14 
4. 13 epee) 11 —12 05 —.14 .06 19 
5 —.10 —.62 01 .03 —.01 —.08 —.09 06 
6. 01 AS —.58 0 —.04 01 Al 01 
i 01 14 06 —.08 AT 04 .03 12 
8. 11 -02 —.08 61 —.14 08 —.06 04 
9. —.12 .00 —.07 4 07 —.03 —.07 05 
10. 06 .02 —.10 -11 —.25 05 —.35 09 
11 07 —.18 02 08 53 01 .08 —.21 
12. 11 16 46 02 00 05 2 | no 
13, —.04 00 —.01 04 02 56 .00 OL 
14, —.57 —.08 —.10 —.14 —.02 27 05 09 
15. —.68 —.01 —.02 .09 04 —.10 —.07 —.06 
16. 20 20 01 —.07 02 —.01 48 —.08 
17. 72 18 13 = 12; 08 06 02 12 
18. Al «00 —.09 12 —.04 02 —.10 45 
19. 64 12 = 08 01 —.15 16 17 
20. 07 33 —.05 —.06 —.06 11 35 28 

"See Table 1 for description of measures. 

s 
TABLE 4 


UNROTATED FACTOR MATRIX AND PERCENTAGE OF COMMON AND TOTAL VARIANCE ACCOUNTED 
FOR BY UNROTATED FACTORS 


Factor 
Measures 
I Il m IV v VI VII VIII 
1 A o | —46 | =29 | +06 |) =.10 04 02 
2 Ea ASA S EN Ei 08 10 —.05 00 
3: 61 13 37 =11 | —.14 00 12 09 
4, 27 |. = 09 00 O etetiog E oa 117 09 
5 43 13 28 | 24 | 0a | 106. | — 04 25 
6 01 yeni) Lae at Sirah oreo. cl!) ‘27 06 | —10 
7 SA SEO 315 24 22 | —01 | —31 T —.03 
8 —01 Ado tl oe ET 33 18 14 06 
9. 14 Ob saad SI eg 42 04 | —.03 08 
10 06 O E1 ESET LETA HoR o | Sua 
1 08 | —09 30 13 42 | —13 | —.03 | —.04 
12 23 E30 04 | —09 13 20 19 11 
18, 13 ‘05 06 32 ‘00 43 09 | —.06 
4 48 | —23 | —19 LE EES Moen 05 09 
15, 49 | —27' | ='36 | —o1 Hae’ —.08 02 
16 —37 | -12 04 25 ‘01 NR 26 14 
17 — <65 Te 35 100 | —06 da 07 || =.08 
18 z8 29 | —09 102 olc 107 AA 13 
R 3 37 17 02 00 | —13 | —01 09 
=335 |) 04] iio 34 | —.12 T E 21 
Percentage of common 
variance accounted for | 34. 5. 5 
ee eet 4 16.8 15.0 10.5 8.3 6.4 5.1 3.5 
variance accounted for | 13.2 64 5.7 4.0 3.2 2.5 2.0 1.3 


* See Table 1 for description of measures, 
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1962) is not represented on this factor, the 
pattern of loadings provides some support for 
a portion of the theory (Harvey et al., 1961) 
upon which it is based. Sensitivity to critical- 
ness or resistance to control from external 
sources is indicative of arrestation at the 
next to lowest level of conceptual develop- 
ment, System II. According to Harvey et al. 
there is a rudimentary differentiation of self 
at this level but the environmental field con- 


‘sisting of others is “bifurcated” and “cate- 


. 


gorical,” that is, highly polarized. The inter- 
concept distance measure indicates such a 
tendency, hence its loading with the B com- 
ponent of the Social Distance instrument on 
one pole of this factor supports this view. 


The negative loading of the Bieri Test of 


Cognitive Complexity on the opposite pole 
attests to the corresponding lack of differenti- 
ation of the social environment. The loading 
of quantative ability suggests that the ca- 
pacity to deal with mathematical concepts 
may be associated to some degree with the 
propensity to utilize multiple constructs in 
diffetentiating the social environment. 


Factor III 

Test Loading 
Semantic Differential Interaction Variance 54 
Profile Dissimilarity 46 
Social Distance C: Congeniality — 58 


Factor III accounted for 12.6% of the 
common variance. An appropriate label for 
this factor would appear to be “awareness of 


' individual differences.” The high loadings of 


Profile dissimilarity and semantic differential 
interaction variance are both manifestations 
of the tendency to regard persons as quite 
different from one another. This factor sug- 
gests a person who is apparently unwilling to 
make judgments about persons on the basis 
of only limited information. The high loading 


` of the Social Distance C factor suggests that 


Persons who are willing to make such judg- 
ments (ie., to judge others primarily on the 
basis of their congeniality) have limited 
awareness of the range of individual differ- 
€nces. Their emphasis on a single trait per- 
haps leads them to ignore or to minimize 
the importance of other dimensions. 
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Factor IV 
Test Loading 
Social Distance E: Type of Behavior 61 
Social Distance I: Interaction 54 


Most of the variance attributable to inter- 
actions in each of the subject’s social dis- 
tance responses appeared to have involved 
interactions of the E factor (Type of Be- 
havior) with others. The correlation between 
the proportion of variance accounted for by 
the E factor and the proportion of variance 
accounted for by Interactions was .40. Nei- 
ther of these measures was highly correlated 
with any other measures in the study. Factor 
IV may be regarded as having been “cre- 
ated” by the relatively high correlation be- 
tween these two measures, 


Factor V 
Test Loading 
Elevation Difference Component 
of the ASo score 53 
Social Distance D: Competence AT 
—34 


Bieri Test of Cognitive Complexity 


This factor accounts for 9.6% of the com- 
mon variance. An appropriate label for this 
factor would appear to be “task-oriented 
interpersonal attitudes.” It resembles quite 
closely the trait syndrome postulated by 
Fiedler (1958) as “leadership effectiveness 
traits.’ The person who scores highly on 
this factor is extremely task oriented, thus 
the inordinate concern with competence in 
others, He also perceives large differences 
between his most and least preferred co- 
workers, The negative loading of the Bieri 
scale indicates again that a somewhat cate- 
gorical overconcern with one particular at- 
tribute is associated with a simple cognitive 
structure. In this instance it is apparently 
the dimension of competence that dominates 
the individual’s judgments. 


Factor VI 
Test Loading 
Intolerance of Trait Inconsistency p 


Bieri Test of Cognitive Complexity 


Factor VI accounted for 8.6% of the com- 
mon variance. The pattern of loadings on 
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this factor suggests “black versus white” 
categorization. A high intolerance of trait 
inconsistency indicates a propensity to view 
persons as being either thoroughly good or 
thoroughly bad. As was suggested earlier, a 
low complexity score on the Bieri instru- 
ment may indicate a tendency to indiscrim- 
inately assign favorable traits to liked per- 
sons and unfavorable traits to disliked per- 
sons, One of the distinguishing character- 
istics of the cognitively simple person, it ap- 
pears, is a predisposition to divide his social 
world into opposing camps. 


Factor VII 
Test Loading 
Category Width 48 
School and College Aptitude Test: 
Quantitative 35 


Number of Groups Formed in the Modified 


Scott Test of Cognitive Complexity = 35 


This factor accounted for 8.1% of the 
common variance, Although it is difficult to 
interpret this factor, it appears that the 
Category Width and Scott tests define a 
continuum of broad versus narrow categori- 
zation. Broad categorizers, it appears, require 
only a few categories to characterize diverse 
kinds of personalities. Conversely, narrow 
categorizers require many groups for such 
characterization. The loading of quantitative 
ability on the SCAT test suggests that broad 
categorizers possess mathematical skills su- 
perior to those of narrow categorizers. 


Factor VIII 


Test Loading 
Schroder Sentence Completion Test of 
Level of Integrative Complexity AS 


This factor accounted for 6.3% of the 
common variance. The relatively low corre- 
lation of the Schroder measure with others in 
the matrix resulted in its being represented 
on a factor by itself. That other measures 
of cognitive complexity do not appear on this 
factor may be due to the fact that they are 
largely, measures of what Schroder and 
Streufert (1962, p. 2) term dimensional com- 
plexity. Dimensional complexity, according 
to these authors, is a necessary but not suf- 
ficient condition of integrative complexity. 
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In other words, it is possible for cogni 
structure to be highly dimensionalized 
differentiated and yet remain unintegrated 
To the extent that there exist highly dim 
sionalized but unintegrated cognitive str 
tures, the relationship between this meas 
and measures of dimensional complexity 
necessarily be attenuated. 


DISCUSSION 


The results of this study suggest that cog 
nitive complexity is not as general a trait 
has sometimes been implied in the literatu 
The factor analysis did not yield a large fi 
factor on which all of the tests or even 
large proportion of them were substanti; 
loaded. The view of Gardner and Sch 
(1962) and Scott (1963) that no sin 
principle can account for complexity-simp! 
ity fits the present data more closely 
the view of those who would postulate a uni 
tary trait. It must be emphasized again, how: 
ever, that these results pertain to only oni 
area of research on cognitive complexity+-the 
way in which the individual construes person: 
objects in his environment. i 

It is entirely possible that a quite different 
factor structure would have emerged had 
sample been heterogeneous with respect 
sex or made up of females alone. The prese 
data do not, of course, provide any informa 
tion regarding possible differences in cogni 
tive structure between the sexes. To the 
author’s knowledge, sex differences have been 
found in responses to only one of the mi 
ures used in the present study, the catego 
width scale, Females were reported to hi 
narrower category width than males (Pe 
grew, 1958). į 

An examination of the factors sugges 
three broad classes of behavioral tendencie 
in which only a part of the total realm 
behaviors was included. First, there appea 
to be a tendency to emphasize one or a Ve 
few judgmental variables (e.g., competence 
“goodness,” congeniality) to the exclusion © 
others. Opposed to it is a tendency to 
sensitive to many variables. Since differen 
Persons emphasize different variables, 
single factor is sufficient to represent 
distinction. 4 
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The second important tendency which is 
< highlighted by our data concerns the manner 
in which people use judgmental variables. 
Some persons assign people to only two or 
three positions on such variables, whereas 
others make much finer distinctions. Factor 
II appears to reflect this difference rather 
clearly. Finally, it appears possible that there 
is a tendency for certain persons to maintain 
* a narrow perspective which permits a highly 
ordered view of the world. Factor I is in- 
sterpretable in this light. 
Although each of these three behavioral 
tendencies may be viewed as an aspect of 


; cognitive complexity-simplicity, it is impor- 


tant to remember that they are manifested 


. „by different factors. A person who is cog- 


nitively simple according to one of these 
three criteria might be cognitively complex 
on one or both of the other two. This study 
has yielded little support for the view that 
there exists a unitary cluster of traits such as 
would fit the current conceptualization of 
cognitive complexity. 

A "separate study involving three of the 
present measures employed the Campbell and 
Fiske (1959) method of convergent-discrimi- 
nant validation.® Inasmuch as the valida- 
tional criteria were not met, the data of that 
analysis also support the conclusion that 
what has been termed cognitive complexity 
may consist of several relatively independent 
conceptual dispositions. The present data do 
not, of course, permit us to reject the possi- 
bility that none of the tests is a relevant 
indication of the construct. One remaining 
explanation for the relatively low degree of 
test intercorrelation is that the tests are 
Unreliable. A thorough examination of the 
reliabilities of the various instruments was 
not feasible in the present study. While pre- 
vious studies indicate that the reliabilities of 
Most of the tests are fairly high (.70 or 

: above) the reliabilities of several of the re- 
, cently developed tests of cognitive complex- 
ity are not known. 

The high loadings of three of the com- 
Ponents of the Social Distance Questionnaire 
on three of the factors suggests an additional 


ue of this analysis are presented in Vannoy 


ee 
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predilection which may permeate, to some 
extent, the tendencies discussed above. Plac- 
ing undue emphasis on any single dimension, 
it appears, is tantamount to not differentiat- 
ing to a very great degree among the remain- 
ing dimensions. Stated another way, the per- 
son who is sensitve to many judgmental vari- 
ables may assign approximately equal degrees 
of importance to each. 

Most of the measures used in this study 
appear to reflect only one or two of the 
classes of behavioral tendencies discussed 
above. Future research should continue work 
on a battery of instruments in order to de- 
termine whether the differentiation of various 
cognitive domains actually consists of several 
distinct, relatively independent judgmental 
tendencies. Since different tendencies ought 
to have different behavioral consequences, 
observation of differences between the be- 
haviors of persons who differ in their re- 
sponses to test instruments would increase 
our understanding of this aspect of cognitive 
processes. Measures which assess differentia- 
tion of the cognitive domain consisting of 
persons in the environment might be related 
to measures of conceptual differentiation in 
general.. The type of behavior exhibited in 
tests of cognitive complexity may be, as 
Gardner and Schoen (1962) attest, simply 
one aspect of a more general characteristic 
of human thought usually referred to as con- 
cept formation. 
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SOME FACTORS INFLUENCING THE ASSIMILATION 
s OF DISPARATE INFORMATION IN 
IMPRESSION FORMATION* 
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The assimilation of disparate information into a unified impression of another 
person was studied by varying 5 factors: cognitive complexity, 


sex, order of 


presentation, presenting the information in univalent blocks or with alternating 


valence, and requiring or not requiring S to write an impression after receiving 


4 of the information. The presentation of information in univalent blocks and 


with an intervening impression 
cency effects. Among male Ss, those 


led to univalent impressions which showed re- 
high in complexity showed less univalence 


than lows in their final impressions; among female Ss this relation did not hold, 
and was sometimes reversed. An explanation of these sex differences is advanced 
in terms of a possible contradiction between the Ss’ values and the stimulus 


material. 


The present investigation is one of a series 
of studies of the effects of cognitive com- 
plexity, and of variations in the presentation 
of stimulus information, upon whether or not 
subjects will reconcile traits of opposite 
valense in the impressions that they form of 
others, 


It has long been recognized that many 


= subjects find it difficult to integrate two sets 


of information, opposite in valence, into a 
coherent and multivalent impression of the 
other person (Asch, 1946; Haire & Grunes, 
1950), That most subjects do form univalent 
impressions from such information has led to 
a group of investigations of the effects of the 
order of presentation of information upon the 
„valence of subjects? impressions. In such 
studies, the information is presented in two 
univalent blocks; the valence of subjects’ 
final impressions, tells whether the impression 
was dominated by the first or the second 
block of information, The evidence indicates 
that either primacy or recency effects may be 
Produced, depending upon how often and at 


1 Hy . : 
This paper is an adaptation and extension of a 


` dissertation by the senior author submitted to the 


ame of Psychology, Clark University, in 
a ie fulfillment of the requirements for the degree 
Nien of Philosophy. The research was sup- 
Thetit in part by Grant M1808 from the National 
cae of Mental Health, United States Public 
Allen oe Grateful appreciation is due to 
mental E assistance in recruiting experi- 


what points the subjects are required to re- 
cord their impressions of the other person. If 
the two blocks of information are presented 
sequentially, with no intervening impression 
obtained from the subjects, then primacy 
effects are found (Anderson & Barrios, 1961; 
Asch, 1946; Luchins, 1957). On the other 
hand, if the subjects hear one block of in- 
formation, record their impressions, then hear 
the disparate information in a separate block, 
marked recency effects are found (Luchins, 
1958). 

In two recent investigations, the complex- 
ity of the subjects’ cognitions about other 
people has been used to account for indi- 
vidual differences in the formation of multi- 
valent impressions from disparate informa- 
tion. Thus, Nidorf (1961) showed that sub- 
jects high in cognitive complexity were bet- 
ter able than those low in complexity to as- 
similate information opposite in valence into 
a unified, multivalent, impression of another 
person. Mayo and Crockett’ (1964) have 
shown that in Luchins’ second condition— 
when an intervening impression is recorded— 
recency effects vary with cognitive complex- 
ity: subjects high in complexity gave multi- 
valent final impressions, those low in com- 
plexity showed marked recency effects. 

In the present study, two experimental 
variations in the presentation of information 
were introduced: the material was either 
presented in two univalent blocks, or by 
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alternating positive and negative items; and 
subjects were either required to record an im- 
pression after receiving half of the informa- 
tion, or they did not do so. Assuming the 
operation of organizing processes such as 
those discussed by Luchins (1958) and by 
Asch (1952), it was expected that subjects 
who received the information in two uni- 
valent blocks would form more univalent final 
impressions than subjects who received alter- 
natingly positive and negative information. 
Whether the univalent final impressions in 
the block condition would reflect recency or 
primacy effects was expected to depend upon 
whether or not subjects recorded an impres- 
sion after receiving half of the information. 

Crockett (1965) has presented the hypoth- 
esis that subjects high in cognitive complex- 
ity should be better able than those low in 
complexity to accommodate both positive and 
negative attributes in their impressions of 
another person. In the present experiment, 
cognitive complexity was expected to inter- 
act with the experimental variations in in- 
fluencing subjects’ final impressions; that is, 
the effects of receiving the information in two 
univalent blocks and of writing an interven- 
ing impression were expected to be shown to 
a large degree among subjects low in cogni- 
tive complexity, and to be relatively weak 
among subjects high in complexity, 


PROCEDURE 
Subjects 


Subjects were students in an introductory psy- 
chology course at Springfield College. All subjects 
were volunteer participants paid at the rate of 
$1.25 per experimental session, From an original 
sample of 200 male and 86 female students, 112 
males and 64 females were selected from the ex- 
tremes of the distribution of cognitive complexity 
Scores which will be described below2 


Measure of Cognitive Complexity 


, In this measure, the subject was first required to 
identify eight different individuals, known to him 
who fitted the eight different categories that ate 
generated by requiring half of those described to be 
older than the subject and half to be his peers, half 


2 One male subject did not appear for the experi- 
ment. In order to meet the assumptions of the 
analysis of variance, his scores on all of the ob- 
jective measures were estimated by a procedure 
Suggested in Snedecor (1956), and 1 df was sub- 
tracted from the error terms. 
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to be people he liked and half to be disliked, and 
half to be male and half to be female. The subject's ‘ 
task was to identify eight different people to fit 


scribe each individual in writing as fully as he 
could within a 3-minute time limit. 

The measure of cognitive complexity was the 
number of different interpersonal constructs that 
the subject used in these eight descriptions. The 
distributions of complexity scores were kept sepa- 
rately for men and women; subjects high and low 
in “complexity were selected relative to the distribu- 
tion of their own sex, not to the total distribution 
of cognitive complexity scores. Male subjects who 
were labeled “low” in cognitive complexity had 
complexity scores ranging from 18 constructs to 
42; those labeled “high” had scores ranging from 
46 to 87. Among female subjects, the scores in the 
low group ranged from 19 to 45; those in the high 
group from 49 to 77. 


Experimental Communication 


All subjects listened to the same tape recording 
used by Mayo and Crockett (1964). Eight speakers 
each illustrated two salient traits of a young man 
named Joe with an anecdote; of these eight speakers, 
four described positive traits and four described 
negative ones. The experimental communication 
was varied systematically in the following ways: 

Block versus alternating presentation. Half of the 
subjects heard the information in two univalent 
blocks; that is, either the four positive speakers 
preceded the four negative speakers, or vice versa. 
For the other half of the subjects a tape was pre- 
pared which inserted a negative speaker after each 
Positive speaker, and vice versa, with the order 0 
speakers in the original tape preserved. 

Order of presentation of information. For half of 
the subjects, the first speaker on the tape presented 
Positive information, for the other half the first 
speaker presented negative information. 

Writing versus not writing an intervening impres- 
sion. Half of the subjects were asked to record 
their impression of Joe after hearing four speakers 
and again after hearing all eight speakers, using the 
same measuring instruments on each occasion. The 
remaining subjects heard all eight speakers in 5 
quence before recording their impressions. 


Summary of Design 


The experiment was conducted as a 2° factorial 
design. Two of these factors—sex of subject an 
cognitive complexity—involyed differences betwee? 
subjects; the other three factors—block versus 
alternating presentation, order, and writing an i 
tervening impression—corresponded to the expe 
mental manipulations. 


Dependent Variables 


Impressions of Joe were recorded on three w 
ferent instruments presented in the following 0rd® 


these categories, to spend a few minutes mentally | 
comparing and contrasting the interpersonal char- 
acteristics of these eight people, and then to de- 
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to all subjects: a 5-minute free description; a pre- 
diction of Joe’s behavior in six hypothetical situa- 
tions, each of which involved a problem of ethical 
conflict; and a check list containing 25 pairs of 
opposing interpersonal traits. This adjective check 
list included the 8 traits presented in the tape record- 
ing and their opposites along with’ 17 additional 
pairs of traits. These two sets of traits were scored 
separately, yielding a set of scores on the stimulus- 
adjective check list and another set on the inferred- 
adjective check list. 

From the subjects’ responses to these instruments, 
two dependent variables were obtained * as follows: 
(a) The recency score, defined as the number of 
«responses to the objective measures which were 

similar in valence to the most recent set of infor- 
mation the subject received. (b) The change score 
which was computed only for those subjects who 


| wrote an impression after receiving half of the 


information as well as at the end. The score was 
obtained for each objective measure by counting the 
à number of items on which the subject’s final im- 
pression differed from his intervening impression. 
It should be noted that this score is not a “difference 
score” in the usual sense of that item; instead, it is 
the number of actual changes in responses to indi- 
vidual items from Measure 1 to Measure 2. 
Scores on both of these measures were subjected 
to standard analysis of variance. 


RESULTS 


The three different objective instruments 
(the conflict-situations questionnaire and the 
two adjective check lists) yielded very simi- 
lar results. For the sake of brevity, only re- 
sults for the 17-item inferred-adjective check 
list will be presented below. 


Recency Effects 


Table 1 summarizes the analysis of vari- 
ince of the recency scores. It is evident that 
a significant main effect obtained for the 
block versus alternating variation: subjects 
in the block condition showed greater recency 
than those in the alternating condition. The 
interaction of block presentation and record- 
ing an intervening impression, presented in 
Table 2, reveals that subjects who received 


_ the information in two univalent blocks and 


A os of two additional variables, the uni- 
an ce of the subjects’ impressions and the inte- 
Gaa $ the written impressions, were also ob- 
differ ; esults of analyses of these scores do not 
aes appreciably from the results obtained for the 
ie cy and change scores; therefore, only the latter 
pees are presented in this paper. Readers 
ee ed in results on the other variables are re- 

ed to Rosenkrantz (1961). 
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TABLE 1 
Summary OF ANALYSIS OF VARIANCE OF 
RECENCY SCORES 
Source af MS F 

Sex (S) _ 1 7.25 — 
Complexity (C) 1 .28 — 
Block versus alternating (B) | 1| 62.24 | 6.01* 
Intervening impression (I) 1 6.96 — 
Order (0) 1| 27.05 | 2.60 
SXC 1 6.64 — 
SXB 1| 59.39 | 5.70* 
SXI 4 6.50 2.10 
sxo EEE — 
CXB 1| 33.69 | 3.23 
CxI 1| 27.05 | 2.60 
cxo 1 19.78 1,90 
BXI 1 | 122.78 | 11.78* 
BXO 1 6.97 — 
Ixo Delton | a0) 
SXCXB 1| 54.24 | 5.20* 
SxOXCXB 1| 80.32 | 7.70* 
Other interactions (pooled)* | 14 | 30.23 2.16 
Error 143 | 10.42 = 


a For none of these sources of variance did the F ratio achieve 
significance. 
*p <.05. 


also wrote an intervening impression showed 
significantly greater recency than those in 
the other groups. However, contrary to ex- 
pectation, subjects in the block condition 
who did not write an intervening impression 
did not show strong primacy effects. 

The expected first-order interaction of cog- 
nitive complexity with the manner of pre- 
senting the information did not appear in 
these data. Instead, there was a significant 
interaction among cognitive complexity, sex 
of the subject, and block versus alternating 
presentation. Male subjects behaved accord- 
ing to the hypothesis: those high in complex- 
ity showed neither primacy nor recency ef- 
fects, while lows showed marked recency 


TABLE 2 


MEAN RECENCY SCORES FOR Groups WHICH DIFFERED 
IN MODE OF PRESENTATION OF INFORMATION 
AND IN WRITING AN INTERVENING 
IMPRESSION 


Mode of presentation 


Intervening 
impression 
Block Alternating 
Written 10.98 811a 
Not written 8.904 9.380 


Note.—In this and subsequent tables, cell meat with the 
same letter subscripts do not differ significantly at the ‚05 level, 
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TABLE 3 


Mean Recency Scores or Groups Wich DIFFERED 
IN COGNITIVE COMPLEXITY, SEX, AND MODE 
OF PRESENTATION OF INFORMATION 


Sex and cognitive complexity 


Mode of 


R aAA Male Female 

High Low High Low 
Block 9.57a 11.50 9.50ab 9.25ab 
Alternating] 9.21, 7.71y 8.13 uy 9.2520 


effects under block-presentation conditions 
(Table 3), For female subjects, on the other 
hand, there was no interaction between com- 
plexity and mode of presentation of the infor- 
mation. The unexpected interaction of sex 
of subject with block presentation of infor- 
mation is also evident in Table 3; the effects 
of block presentation were carried by the 
male subjects. 

The significant third-order interaction in 
the recency scores was too complex to be 
interpretable. 


Change in Impressions 


For subjects who wrote an intervening im- 
pression, the comparison of their final im- 
pression with their earlier one permitted a 
replication and extension of the study of 
Mayo and Crockett (1964). Table 4 sum- 
marizes the analysis of variance of these 
data. 

A significant main effect of mode of pres- 
entation of the information resulted from 


TABLE 4 


SUMMARY OF ANALYSIS OF VARIANCE OF 
CHANGE SCORES 


Source af MS R 
Sex (S) 1 1.94 — 
Complexity (C) 1 4.55 — 
Block versus alternating (B)| 1 | 676.55 | 68.20** 
Order (0) 1 05 = 
SXG 1 85.49 | 8.62** 
5 XB 1 15.42 1.55 
EXO EM 1 | 5585 | 5:63* 
Other interactions (pooled)s | 8 13.14 | 1.33 
Error P EEKAN e 


“For none of these 
achieve significance. 
*p <.05. 
D> <.01, 


sources of variance did the F ratio 
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the fact that subjects who heard two uni- 
valent blocks of information changed their , 
impressions a great deal, while those who 
viewed alternatingly positive and negative 
information hardly changed their impressions 
at all. 

The significant main effect of complexity 
which Mayo and Crockett reported was not 
found in these data. Instead, a Sex X Com- 
plexity interaction appeared once again: com- 
plexity was inversely related to change among 
måle subjects, but was directly related among 
female subjects (Table 5). 

The final significant source of variance in 
change scores was the interaction of the 
subject’s sex with order of presentation. Fe- 
male subjects who heard negative speakers 


TABLE 5 


Mean NUMBER OF CHANGES FROM I'ixst IMPRESSION 
TO SECOND FOR Groups WHicH DIFFERED IN 
SEX AND COMPLEXITY 


Sex 
Complexity 
Male | Female 
High 4.82ab 6.56a 
Low 5.86ab 3.509 


first changed significantly more than did | 
those who heard negative speakers first; @ 
nonsignificant difference in the opposite di- 
rection held for male subjects. 


Discussion 


Two aspects of these results merit some 
what extended discussion: the lack of pri- 
macy effects among subjects who received the 
information in univalent blocks but who did 
not write an intervening impression, and the 
pervasive interaction between cognitive com- 
plexity and the sex of the subjects. | 

The absence of the usual primacy effects 
probably reflects the forcefulness of the stim- 
ulus material that was used in this study.” 
Joe was described by one set of people wh? 
liked him and by another set who disliked 
him, and the disparity between these two sets 
of descriptions was striking. Recognition 0f 
the contradictoriness of the two blocks of i0 
formation was probably considerably mo 


4 
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compelling than would have been true, for 
instance, if the subjects had been read a list 
of personality traits. Given this extreme dis- 
parity, it was not easy for the meaning of 
items in the second block of information to 
be.assimilated to the context established by 
the first block, as Asch (1952) and Luchins 
(1958) presume to have occurred in the other 
studies, Anderson and Hubert (1963) have 
» recently argued that the usual primacy effects 
in these studies reflect not the operation of 
-general organizing processes in perception, 
but the subjects’ failure to attend carefully 
to items in the second half of the material. In 
terms of the latter interpretation, the force- 
ful quality of the descriptions in the second 
block of information may have compelled the 
‘subjects to attend to that material, thereby 
ensuring that this information was presented 
in their impressions. It should be noted, how- 
ever, that the marked recency effects in the 
block-presentation group which did write an 
intervening impression is consistent with 
Luchins’ (1958) interpretation: writing the 
intervening impression led the subjects to 
extend and consolidate their impressions; the 
disparity between the second block of infor- 
mation and the existing impressions was then 
greater among these subjects than among 
those who did not write an intervening im- 
Pression, resulting in rejection of the initial 
information and greater recency in their 
ı final impressions. 

The most striking and unexpected aspect 
of these results, however, was the fact that 
the hypothesized effects of cognitive com- 
plexity occurred only among male subjects; 
the expected relationships did not hold, and 
were sometimes reversed, among females. A 
teexamination of the data in the experiment 
by Mayo and Crockett (1964) shows not the 
slightest trace of a Sex X Complexity inter- 
action. It is important, therefore, to try to 


. Account for the unexpected interaction in the 


Present study. One plausible explanation rests 
* upon inferences about the values of the sub- 


» jects and upon a knowledge of the values that 


Were attributed to Joe in the tape recording. 

As to the values of the subjects, Arsenian 
(1943) presented evidence of the strong 
teligious-humanistic values of students at 
Springfield College a generation ago. In a 
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personal communication, Arsenian described 
recent data that show the same pattern of 
values in the present student body at Spring- 
field College. Although no objective evidence 
is available, our subjective impression is that 
students at Clark University, where the Mayo 
and Crockett experiment was conducted, do 
not hold such extreme religious and hu- 
manistic values. Thus, there is probably a 
considerable difference in the values of the 
two subject populations.. 1 

The description of Joe provides many lis- 
teners with a picture of a ribald, active, 
somewhat hedonistic and bohemian young 
man, one who might readily be rejected on 
moralistic grounds by someone with strong 
religious and social-welfare values. It seems 
plausible that, through contacts in sports and 
in similar masculine activities, men like those 
at Springfield College would be more likely 
than women at that school to have interacted 
favorably with other men like Joe. If this 
differential rate of interaction makes men less 
likely than women to reject such a person on 
moral grounds, the Sex x Complexity inter- 
action begins to make sense: among men, the 
effects of complexity were as expected; 
among women, a moralistic rejection of Joe 
may have washed out, or even reversed, the 
effects of complexity. The presumed absence 
of such rejection of Joe among female sub- 
jects at Clark University would account for 
the lack of a Sex X Complexity interaction 
in the experiment by Mayo and Crockett. In 
its general form, this explanation suggests 
that long-range motivational and valuational 
orientations of an individual may affect and 
be affected by the kinds of interpersonal 


constructs he develops. 
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tion game with 3 successive 


score p’s < .005, .06). 


While many studies have examined the 
rmance of subject pairs in conflict games, 
performance in coordination games has 
n relatively neglected. In the coordination 
subjects are required to come to se- 
ces of identical object selections, and in 
ich games it would appear that cooperation 
between the participating subjects would be 
ghly facilitatory, if not absolutely necessary, 
t coordination to take place. The inter- 
tive aspects of a game can in fact be used 
a definition of cooperation (Azrin & Linds- 
i 1956). 
When communications in the coordination 
e are restricted, the cooperative and inter- 
onal aspects become less clear. Subjects 
typically isolated from each other in 
ths and fed information concerning the 
‘tner’s behavior. Under such circumstances 
nication channels or the information 
communicated may become so restricted 
prohibit the operation of complex social 
ors. Sidowski, Wyckoff, and Tabory 
956) devised and explored a “minimal so- 
situation” which was essentially a two- 
son, two-choice game where each subject 
mtrolled the reinforcements to his partner. 
ne subjects were instructed as to the ex- 
‘stence and influence of the partner while 
thers were not. Awareness or ignorance of 
€ partner’s existence did not influence the 
This investigation was supported by Grant MH 
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PERSONALITY AND ACQUISITION FACTORS IN THE 
DEVELOPMENT OF COORDINATION STRATEGY: 


CHARLES L. FRY 


University of Virginia 


36 college Ss played 3 repetitions of a 2-person, 3-object-choice, tacit coordina- 
partners. Absolute ascendance, authoritarianism, 
and intelligence scores were unrelated to game performance. Variance analysis 
showed that S pairs with discrepant ascendance scores outperformed pairs with 
similar scores (p < .035). Interpretation suggests a struggle for dominance be- 
tween similar ascendance pairs might retard their coordination development. 
Significant chi-squares showed Ss increasingly adopted a specific sequence of 
game-play solutions, and Ss who played together increasingly developed similar 
response biases in object selections. Converging expectations of pattern and ob- 
ject selections help account for Ss’ significant improvement in performance as 
they progress from partner to partner (variance analysis of individual and pair 


subjects’ performances in spite of the fact 
that cooperation would seem advantageous 
and efforts to effect it might be anticipated in 
the informed group. 

Schelling (1960) outlined some games 
which permitted only tacit communications, 
He predicted that in such games a subject 
would be able to imagine himself in his part- 
ner’s position and anticipate his partner’s 
behavior at above chance levels in much the 
same manner as military strategists outwit 
their opponents. Solomon (1960), using a 
two-person, three-choice, tacit coordination 
game, demonstrated that pairs of schizo- 
phrenics perform less well than pairs of nor- 
mals in the game. Along with clinical the- 
orists like Bateson and Cameron, Solomon 
argues that schizophrenia is rooted in com- 
munication and social failure and this leads 
to the debilitating consequences for them in 
the game. The present study attempted to 
further demonstrate the social nature of the 
game by demonstrating relationships between 
game performance and interpersonal response 
characteristics of the participants, Three 
characteristics were explored: ascendance, 
authoritarianism, and intelligence. The experi- 
ment was explicitly designed to examine the 
influence of ascendance using the other char- 
acteristics as controls, but correlations with 
overall game performance of authoritarianism 
and intelligence were incidentally included. 

The rationale for expecting ascendance or 
authoritarianism to be related to game per- 
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formance lies in the expectation that persons 
high in these traits would be more likely to 
initiate patterns or structure in an unstruc- 
tured social situation. For the two-person, 7 
choice, tacit coordination game, coordination 
is impossible until both players adopt identi- 
cal choice patterns. Because the players are 
restricted from communicating overtly, the 
pattern to adopt must be independently ar- 
rived at by both players, or at least one 
player must infer the pattern from regulari- 
ties of past responding. An obvious strategy 
for attaining coordination would involve one 
player’s generating a selection pattern which 
his partner could then recognize and adopt. 
Such a strategy could be expected to facili- 
tate coordination. With respect to ascendance, 
two different hypotheses can be offered, both 
of which were explored in this study. First, 
subjects with high ascendance scores would 
perform better in the game situation than 
subjects with low ascendance scores. This 
would be anticipated because ascendant per- 
sons would be more likely to generate the 
needed patterns, thereby improving perform- 
ance, Second, pairs of subjects with highly 
discrepant ascendance scores would perform 
better than pairs with similar ascendance 
scores, This might be expected because simi- 
lar ascendance pairs regardless of the abso- 
lute level of ascendance would fight over who 
should set and who should copy the pattern. 
Such a struggle would lead to a performance 
decrement. 

In order to examine the development of 
coordination skills it was decided to allow 
each subject to play the game several times 
with each of several partners. It was antici- 
pated that transfer effects would allow sub- 
jects to improve in their ability to coordinate 
in successive games with a given partner, It 
was also expected that with practice subjects 
would experience an overall improvement in 
their ability to coordinate in this situation 
regardless of the partner involved, This would 
lead to performance improvement as subjects 
progressed from partner to partner. 


METHOD 
Subjects 


The subjects were 16 female and 
students who were paid $3 for 3 ho 
ing and game playing. 


20 male college 
urs of test tak- 
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Procedure 


Subjects took a battery of tests prior to the game || . 


play sessions. This battery included: the Allport and 
Allport (1928) A-S Reaction Study for ascendance, 
the Wonderlic (1942) Personnel Test for intelligence, 
and the California F Scale. (Adorno, Frenkel-Bruns- 
wik, Levinson, & Sanford, 1950) for authoritarian- 
ism. Subjects were divided into four levels of as- 
cendance based on the four quartiles of ascendance 
on the ascendance test. From each of these levels a 
group of nine experimental subjects was chosen 
(four females and five males) so as to equate intel- 
ligence and authoritarianism in cach ascendance 
levél group, a 

For game-playing purposes the subjects met only 
once in sets of four subjects each. The subjects in 
these sets were all of the same sex, and one subject 
came from each of the four levels of ascendance 
groups. The game was played in pairs and each 
subject played the game three times with each of 
the other three set members (a total of nine games 
per subject). While any given subject pair played 
together the remaining two set members also played 
together. The specific order of the pairings of the 
subjects from the various ascendance levels was 
counterbalanced using a Three Problems X Three 
Pairings Latin square design which permitted the 
examination of levels and differences in level effects 
free from confounding order effects. 

Each subject was placed in a booth adjoining his 
partner’s. Booth assignments were made consistent 
with the pairings designated by the Latin square 
design. Booths were open both front and back, but 
prevented subjects from seeing each other. From 4 
supply of 27 different sets of four identical objects 
each subject was given an identical set of three 
different objects which he placed on the counter in 
his booth invisible to his partner. For example, each 
subject might be given a door key, a typewriter 
eraser, and an 8-inch black wooden rod. At a signal 
from the experimenter each subject was instructed 
to select one of the three objects in such a way as 
to match his partner’s selected object. The selection 
Procedure involved the subject’s picking up the 
selected object so that the experimenter, seated 
facing the subjects in front of the booths, could see 
the selected object but the partner could not. Fol 
lowing selection, subjects were told to “hold out 
the object so that each could see the partner’s selec- 
tion extended in front of the partition separating 
the subjects. If both objects were the same a “match 
occurred and both subjects were rewarded with an 
additional point of score. If a “mismatch” took 
place both subjects lost a point. The subject with 
the fewest “mismatches” at the end of the experi- 
ment received a prize. Trials, or attempts at match- 
ing, continued using the same three objects until 
either a criterion solution of 10 “matches” in a row 
occurred or until the subjects had completed Trial 
100. This ended Game 1. 

5 With the same subject pairings Game 2 was then 
introduced. The task for the subjects was essentially 
the same as for Game 1 except that the old objects 


J 


prohibited from reusing 
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ones (eg, a toy car, a 
hook) and subjects were 
formally identical game 
solutions. The latter was accomplished by handing 
notes to each subject who reached a solution on 
Game 1 which specified the solution previously used 
and which cautioned him against using that solution 
again (eg, “The solution which you have just 
reached was one in which you alternately selected 
either of two objects. You may not use this solu- 
tion again.”). For Game 3 the same pairs were 


were replaced with new 
clothespin, and a curtain 


, again given new objects and were again cautioned 


Hi 


` effects from problem to problem 


against reusing solutions (from both Game 1 and 
Game 2). 

Games 4-6 were replications of Games 1-3, but 
they were played with a different partner and with 
new objects. Subjects were instructed at the begin- 
ning of Game 4 that the game was the same one 
they had been playing, but that since they were 
playing with new partners any previous restrictions 
as to their solutions were void. Hence subjects were 
‘again free to use any solution. Again restrictions on 
reusing patterns were added in subsequent games as 
solutions were found by that particular subject pair. 
Games 7-9 replicated Games 4-6 with new objects 
and played in the last remaining pair combination. 
Thus each subject attempted nine object selection 
games, three successive games with each of the other 
three set members each of whom came from a differ- 
ent ascendance level. 

New objects were introduced with each new game. 
The use of objects rather than the more common 
buttons, levers, etc., was designed to serve two 
purposes, First, since subjects ordered the objects in 
front of themselves, having received them from the 
experimenter in an unordered pile, the ordering of 
objects in front of two pair members was not likely 
to be identical (as would typically be the case with 
some sort of response box). This might be expected 
to decrease the “matches” due to position biases 
tather than actual object choices. Second, new ob- 
Jects, and hence new object responses, could con- 
veniently be introduced with each new problem. 
This would be expected to reduce the transfer 
due to identical 
problem components. 

Prior to the beginning of play for Game 1, but 
after the subjects had been completely instructed as 
to the operations of the game, each subject filled out 
a questionnaire? Essentially the questionnaire asked 
each subject to indicate whether it seemed to the 
subject that the problem could reasonably be solved, 
and to defend his answer. In defending his answer 
the subject was asked to indicate: how a solution 


` might occur if the subject thought the problem 


solvable, why a solution was impossible if the sub- 


Ject thought the problem was not solvable, or what 
o tions made the subject uncertain as to 
whether a solution was possible. 


T_T 

a edisty following each problem subjects 

Ba responded to a questionnaire inquiring whether 

ag how solutions had been found. This aspect of 
e questionnaire did not prove fruitful. 
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RESULTS 


All combinations of simple correlations 
were run on game performance mismatch 
scores, Allport and Allport (1928) A-S scores, 
Wonderlic Intelligence scores, and California 
F Scale scores. None of these correlations 
proved significant. 

As a test of the first hypothesis concerning 
subjects’ absolute ascendance levels, a three- 
way analysis of variance of Three Pairings X 
Four Levels of Ascendance X Three Prob- 
lems was run using subjects’ mismatch scores. 
The three pairings refer to the games played 
with each successive partner. Games 1, 2, and 
3 form the first pairings, Games 4, 5, and 6, 
the second, etc. The three problems refer to 
the nth game played with all partners. Games 
1, 4, and 7 form the first problems, Games 2, 
5, and 8, the second, etc. The levels of as- 
cendance effect was compared with a between- 
subjects error term. The remaining effects 
were considered within-subject effects. The 
pairings effect F ratio was 5.37 (p < .005) 
and the problems effect F was 7.74 (p< 
001). None of the remaining effects, includ- 
ing the ascendance levels effect, approached 
significance (all Fs < 1.0). A consistent de- 
crease in mismatch was found as one pro- 
gressed from partner to partner. Across prob- 
lems, however, the first problems were as- 
sociated with the fewest mismatches, the 
second problems the most, and the third in 
between. 

A second analysis of variance was per- 
formed on pair rather than individual mis- 
match scores. This second analysis of vari- 
ance can be seen in Table 1. As with the first 


TABLE i 
ANALYSIS OF VARIANCE OF Parr Error SCORES 


Source df MS F 
Pairings (A) 2 | 108.4 | 2.88* 
Pair RENTE discrepancy (B) 11172.2497% 
Problems within pair 2 | 156.1 | 4.14*** 
AXB 2 1.1] 03 
AXC 4| 199| .53 
BXC 2'1: 228:5 _.76 
AXBXC 4 | 68.7 | 1.82 
Error 144 |_ 37.7 
i 
* 
34 Sos. 
+p <.025. 
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analysis this one included pairings and prob- 
lems effects, but in lieu of the absolute level 
of ascendance effect this second analysis com- 
pared the performance of discrepant versus 
similar ascendance score pairs. Pairs were 
considered similar in ascendance when the 
two players came from adjacent quartile lev- 
els of ascendance, and discrepant when their 
ascendance levels were more remote, The sig- 
nificance of the pairings and the problems 
effects (p < .06, p < .02, respectively) are 
in line with but not nearly as convincing as 
those found in the first analysis. In addition, 
a pair difference in ascendance effect was also 
found ($ < .035), with discrepant pairs mak- 
ing fewer mismatches than similar ascendance 
pairs. 

The pregame awareness of probable solu- 
tions did not affect Game 1 performance., 
From the pregame questionnaire: 11 sub- 
jects indicated a need for 1 subject to set up 
a pattern, 6 subjects mentioned patterns but 
not how to establish them, 9 saw the situa- 
tion as pure chance on each trial, and 10 sub- 
jects simply did not know or did not answer 
the question appropriately. All pairs reached 
criterion on the first problem by Trial 30, but 
pregame questionnaire responses did not sig- 
nificantly predict the number of mismatches 
to criterion in Game 1. 

Definite pattern preferences and sequences 
of pattern preferences were evident in the 
data. Since all pairs reached criterion on all 
problems, there were 162 solutions, Three 
patterns accounted for all but 4 of these solu- 
tions. The pattern of repeatedly selecting the 
same object (AAA) was used 54 times, the 
pattern of repeatedly rotating through all 
three objects (ABC) was used 54 times, and 
the pattern of alternately selecting either of 
two objects (ABA) was used 50 times. The 
intrusion of alternate patterns occurred only 
in the second or third problem in the first 
pairing. A definite sequence in the use of pat- 
terns developed during game playing. This 
sequence was AAA-ABA-ABC for Problems 
1, 2, and 3, respectively. Oddly enough this 
Sequence was not even the modal sequence 
used in the first pairing. With successive pair- 
ings this sequence became increasingly com- 
pelling until only one pair failed to use it in 
the last pairing. A chi-square comparing that 
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sequence of solutions with other Solution Se- 
quences X Three Successive Pairings proved 
highly significant (p < .001). 


DISCUSSION 


The results appear to refute the hypothesis 
that subjects’ levels of ascendance influenced 
game playing, but support the hypothesis 
that the discrepancy between the ascendance 
levels of game-playing partners affected game 
performance. The findings showed that larger 
differences between partners’ ascendance lev- 


els were associated with better performance 


than smaller differences. These were the re- 
sults predicted and can easily be understood 
in terms of the depreciating effects of a strug- 
gle for dominance in setting the pattern be- 
tween two fairly equally dominant individu- 
als. Such a struggle would be less likely when 
one of the individuals was clearly more domi- 
nant than the other. These findings seem im- 
portant in three ways. First, they tend to 
make less speculative the statement that real- 
life personality factors are at work in the 
tacit coordination game by demonstrating 
performance’s relationship to a personality 
attribute of its players. Second, the nature 
of that attribute, ascendance, is deeply tied 
to interpersonal communications and there- 
fore argues for the importance of interper- 
sonal communications in this game situation. 
Third, the findings coupled with the failure 
of ascendance, authoritarianism, and intelli- 
gence to correlate significantly, may lend hope 
to game researchers that in this and other 
games, subjects’ self-reports may be shown to 
relate more closely to game performance 
when differences between players’ scores 
rather than absolute scores are examined. 

As to the development of coordination 
skills, three of the four relevant F ratios 
Proved significant and the fourth showed such 
a strong trend (p= .06) that it seems rea- 
sonable to say that both the pairing and prob- 
lem effects were demonstrated. As hypothe- 
sized the number of mismatches decrease 
with each new pairing. Thus subjects appat- 
ently learned to coordinate in the game situa- 
tion. Two reasons for this improvement are 
clearly evident in the results, First, in the 
initial pairing the subjects had to establish 
the patterns to use, the order in which the 


1 


were to be used, and how the ob- 
e to fit into those patterns. In sub- 
pairings, as the particular patterns 
equences of patterns become increas- 
sompelling, the subjects more and more 
sto fit the objects to the obvious pat- 
nd, convergence or response bias 
ng objects developed with game prac- 
should increasingly permit pairs to 
their object selections as they progress 
sroblem to problem. Elevating the num- 
matches should facilitate the fitting of 
o the patterns. 
‘to the hypothesized problem effect, the 
were unexpected. Far from the con- 
improvement in performance with suc- 
problems within pairs which was hy- 
ed the performance in the first prob- 
best and on the second problem was 
One possible explanation for this phe- 
n concerns the fact that to a certain 
there were two sequences of patterns 
for supremacy in the game situation. 
terms of the development of coordina- 
i, perhaps the most impressive finding of 
nvestigation concerns the extent of the 
ation effects. Coordination training in 
‘ituation led not only to certain object 
on patterns becoming prepotent, but 
30 led to certain sequences of patterns and 
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certain object characteristics being favored. 
Some of these effects can be attributed to as- 
pects of the specific game situation used. The 
restriction preventing the reuse of patterns 
with a particular partner forced subjects to 
find a sequence of patterns, some of which 
were more obvious or conceptually simpler 
than others. 
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RATINGS OF FAVORABLENESS OF STATEMENTS 


ABOUT A SOCIAL GROUP AS AN INDICATOR 
OF ATTITUDE TOWARD THE GROUP* 


CLAIRE SELLTIZ,2 HAROLD EDRICH, anv STUART W. COOK è 


New York University 


Earlier studies reporting effects of judges’ attitudes on ratings of favorableness 
of statements have suggested the possible usefulness of such ratings as disguised 
attitude measures, Studies in which Negroes have been the attitudinal object 
have used a set of statements a high proportion of which are unfavorable by 
current standards. In the hope of making the technique more sensitive as an 
indicator of attitude, a new set of statements about the position of Negroes 
was developed, representing a more symmetrical distribution of favorable and 
unfavorable items. These were administered in 3 different geographical regions 
to groups of Ss differing in attitude toward Negroes, 2 criteria of judges’ atti- 
tudes were used: (a) membership in organizations such that it seemed reason- 
able to suppose that many of the members would hold specified attitudes about 
race relations; and (b) scores on a self-report inventory of attitude toward Ne- 
groes. Within each region, separate analyses using these 2 criteria for classifying 
judges yielded similar results: scale values assigned to statements by judges with 
different attitudes differed significantly. Difference in ratings of the unfavorable 


} 


and intermediate statements were systematic, with the prointegration Ss rating 


them as more unfavorable. 


This study is a sequel to one conducted by 
Zavalloni and Cook (1965), exploring the 
possibility of using ratings of favorableness 
of statements about a social group as an 
indirect measure of raters’ attitudes toward 
the group. This possibility was suggested by 
Hovland and Sherif (1952) in discussing 
their finding that, contrary to earlier beliefs, 
judges’ attitudes influence their ratings of 
the favorableness of items in the construc- 
tion of a Thurstone-type scale, These in- 
vestigators hypothesized that subjects with 
extreme attitudes at either end of a scale 
would tend to make fine discriminations 
among items at the end of the scale corres- 


1 This study was part of a program of research 
on the measurement of social attitudes supported 
by grants from the National Science Foundation 
and the Air Force Office of Scientific Research. The 
general framework of the Program is presented in 
Cook and Selltiz (1964). We are indebted to 
Thomas Henchy for assistance in all phases of this 
study. In addition, thanks are due to Jesse Geller, 
Walter Porter, William Schleicher, Fredric Weiz. 
ae and John Woodmansee, for help in collecting 
lata. 
? Now at the City College of the Ci iversi! 
Poet ity University 
8 Now at the University of Colorado, 
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ponding to their own position and to*lump 
together items at the end opposite their own | 
position, piling them in the extreme cate- 
gories. Their findings supported this predic- 
tion strongly in the case of items at the un- | 
favorable end of the scale, less strongly with 
respect to items at the favorable end. Subse- 
quent research (e.g, Hovland, Harvey, & | 
Sherif, 1957; Manis, 1960, 1961; Upshaw, 
1962; Weiss, 1959; Zavalloni & Cook, 1965) 
has supported the hypothesis that ratings ate 
affected by raters’ attitudes—and thus, by 
implication, the possibility that such ratings 
may be used as a basis for inferring the 
raters’ attitudes—but have varied in theit 
findings about the direction of differences it | 
ratings by favorable and unfavorable sub- 
jects. | 

The primary purpose of the present study 
was to develop a new pool of items about the 
social position of Negroes, corresponding 
more closely to current views on this issue 
than those used in earlier studies, and to test 
whether ratings of this set of items would 
also be influenced by raters’ attitudes, and 
thus whether the ratings might be used as af 
indicator of raters’ attitudes. 


: 
: 


RATINGS OF STATEMENTS AND ATTITUDE 


METHOD 
ı Materials 
Farlier studies of the influence of raters atti- 
tudes on ratings of favorableness of statements 
about Negroes (Hovland & Sherif, 1952; Upshaw, 
1962; Zavalloni & Cook, 1965) have all used the 
pool of items assembled by Hinckley (1932) in 
developing his scale of attitude toward the social 
position of the Negro. In developing a new set of 
items to be rated, we were guided by data from the 
two most recent of those studies—those by Upshaw 
and by Zavalloni and Cook. In both of these studies 
the item ratings showed a skewed distribution. 
Whereas Hinckley had reported that the items. were 
evenly distributed across the entire scale, Upshaw 
found that, 30 years later, southern college students 
rated only 31 of the 114 items as favorable (scale 
values 7.50 or higher on a scale from 1 to 11) and 
53 as unfavorable (scale values 4.49 or lower). 
Zavalloni and Cook, with a sample in which north- 
ern- students predominated, found the distribution 
even more skewed; 29 of the items were rated as 
favorable, 72 as unfavorable. For the present study, 
therefore, a new set of items about the social posi- 
tion of Negroes was developed, with more equal 
proportions of favorable and unfavorable items as 
judged by present-day standards. 
qh developing the new set of items, we were 
guided, by data from the Upshaw and Zavalloni- 
Cook studies. In general, we kept those items of the 
Hinckley pool on which there were sizable differ- 
ences between the median scale values assigned by 
pro-Negro and anti-Negro subjects in both of those 
studies and on which the differences between the 
scale values assigned by these groups of subjects 
Were in the same direction in the two studies. We 
also used such items as models in constructing new 
items. In general, we eliminated items that were 
tated essentially the same by pro- and anti-Negro 
subjects, or that showed inconsistent patterns 
ae the ratings by subjects with different atti- 
On the whole, the extremely unfavorable items in 
the Hinckley pool were not rated differently by 
ecis with different attitudes in the Upshaw and 
a Cook studies; almost all subjects rated 
tt se items very low. Of the 20 most unfavorable 
ems, we kept only 4; these were kept in order to 
ee some statements that almost all subjects 
ee see as belonging in the most unfavorable 
em gory and that could thus help to provide a 
mmon reference point for the unfavorable end of 
„the scale, 
gn both the Upshaw and Zavalloni-Cook studies, 
ai of Hinckley’s moderately unfavorable items 
ae rated differently by subjects differing in atti- 
sat ponent Negroes, with pro-Negro subjects 
DES em as more unfavorable than anti-Negro 
A s. Fourteen of these items were kept ver- 
GTS a another two were reworded for greater 
BEN oe new items intended to be moderately 
ae ‘able, but with more contemporary content, 
e written; for example, “Id rather not have 
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Negroes attending the same classes with my chil- 
dren.” 

Zavalloni and Cook found that midscale or inter- 
mediate items also tended to be sensitive to raters’ 
attitudes, except for a few clear statements of lack 
of interest in or knowledge about Negroes or race 
relations, which all groups of subjects placed close 
to the midpoint. These latter statements were 
dropped. In the Zavalloni-Cook study, the direc- 
tion of differences in ratings of the remaining inter- 
mediate items was the same as for the moderately 
unfavorable items; that is, they were rated lower 
(more unfavorable) by the pro-Negro than by the 
anti-Negro subjects. These items were of three 
general types: statements advocating a separate- 
but-equal policy, statements attributing both posi- 
tive and negative characteristics to Negroes, and 
statements making exceptions of a few outstanding 
Negroes. We kept four of these items, added others 
expressing these three themes in terms of more 
current issues, and also added two types of items 
that seemed to us to represent intermediate positions 
not included in the Hinckley pool: statements rec- 
ommending a gradualist approach to integration, 
and expressions of willingness to accept Negroes in 
situations involving only impersonal superficial 
contact. 

The earlier studies gave little guidance for con- 
struction of favorable items. There was no favorable 
item in the Hinckley pool on which the median 
ratings by all groups of subjects in the Upshaw 
and Zavalloni-Cook studies showed a completely 
consistent relation to subjects’ attitudes. Examina- 
tion of the favorable items in the Hinckley pool 
suggested one possible explanation, namely, that 
many of the supposedly favorable items had a 
strong derogatory or condescending undertone; for 
example, “A wide awake Negro is physically su- 
perior and in other respects equal to the white man.” 
Such items were discarded. There were nine items 
whose ratings showed relatively consistent relations 
to subjects’ attitudes in the Upshaw and Zavalloni- 
Cook studies; most of these were unambiguously 
positive (e.g. “J believe that the Negro is entitled 
to the same social privileges as the white man”), In 
both of those studies, these items tended to be rated 
as more favorable by the pro-Negro subjects than 
by the anti-Negro subjects, though this pattern was 
not completely consistent on any of them. Of these 
nine items, eight were kept verbatim, and one was 
reworded for greater clarity. We added several 
items similar to these, asserting the equality of 
whites and Negroes in vi ious contexts. In addi- 
we added items expressing points of view 
Hinckley series: assertions 
that the activities of Negroes and/or the existence 
of a multiracial society have positive values for 


on their right to equal treatment have benefited 
not only their own race but the whole country”) ; 
items explicitly advocating integration (“The Negro 
should be given equal rights through integration”) ; 
expressions of willingness to accept Negroes in close 
personal-social relationships (“Ii a Negro family 


410 


moved into my neighborhood, I would welcome 
them the same as I would welcome any other new 
family”). 

The final set consisted of 106 items, of which 33 
were from the Hinckley pool. Each item was 
printed on a separate card. The cards were arranged 
in random order, except that 4 strongly favorable 
and 4 strongly unfavorable items were placed 
early in order to give subjects a sense of the range 
of items. The 4 unfavorable items used for this 
purpose were taken from the most unfavorable 
items in the Hinckley pool; all of them had been 
given median ratings of less than 2.00 (on a scale 
ranging from 1 to 11) by all groups of subjects in 
both the Upshaw and Zavalloni-Cook studies. These 
items were placed as 2, 4, 16, and 28 in the present 
set. Of the 4 favorable items used for this pur- 
pose, 1 was new; the other 3 were from the 
Hinckley pool and had received median ratings 
higher than 9.40 by all groups of subjects in the 
two earlier studies. These strongly favorable items 
were placed as 1, 3, 8, and 17 in the present set. 
The cards were presented to all subjects in the 
same order. 


Procedure 


The instructions for sorting the cards (rating the 
items) were based on those given by Thurstone and 
Chave (1929).* The marker card for Category 1 
was labeled “most unfavorable,” that for Category 
11, “most favorable.” The marker cards for the 
other categories, including the middle one, were 
numbered but not labeled, in order to facilitate 
displacement of items in the middle range. 

After subjects had sorted the item cards in terms 
of favorableness toward the social position of Ne- 
groes, they were given other cards with which to 
indicate their own position. One of these cards 
read “Closest to my own views,” another “Most 
objectionable to me,” and others “Acceptable” or 
“Objectionable”; subjects were instructed to place 
these cards with the appropriate piles into which 
they had sorted the items. 

A questionnaire not relevant to the present study 
was administered next; it was included primarily 
to introduce some items about current issues other 
than race relations, in keeping with the explanation 
given in recruiting subjects (see below). Finally, 
subjects were given a self-report inventory of atti- 
tude toward Negroes. 

SEG 


*A complete list of the items, the instructions, 
and the self-report attitude inventory have been 
deposited with the American Documentation Insti- 
tute. Order Document No. 8481 from ADI Auxil- 
iary Publications Project, Photoduplication Service, 
Library of Congress, Washington, D. C. 20540, 
Remit in advance $2.00 for microfilm or $3.75 for 
photocopies and make checks payable to: Chief, 
Photoduplication Service, Library of Congress, ? 

5 This is a revised version of a questionnaire 
originally developed by Mary Evans Collins. In the 
present study, 20 new items were added to the 
inventory, 
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Subjects were tested in groups of 3-35. Total 
testing time averaged about hours. The tests | 
were administered in a single session, except in the 
case of 1-hour classes, where two class sessions 
were used (see below, Subjects). 


Subjects 


Subjects were American college students. After 
elimination of those whose data were not usable 
(see below), the total N was 607. In order to avoid 
relying solely on responses to a self-report inven- 
tory as the indicator of attitude, subjects were 
selected on a behavioral criterion of membership in 
organizations or participation in activities from" 
which attitudes about race relations might be in- 
ferred. The kinds of criterion subjects we were 
seeking, and our assumptions about the kinds of 
organizations and activities in which they might 
be found, are described below: 


Group I. Subjects with strongly equalitarian | 
views on race relations and extreme involvement 
in the issue. We assumed that Negro students 
active in organizations working for integration 
were likely to meet this description. 7 

Group II. Subjects with strongly equalitarian 
attitudes and’ active concern with race relations. 
We assumed that white students active in the 
same prointegration organizations as Group I 
were likely to fit this criterion. bh: 

Group III, Subjects with equalitarian attitudes 
but not active concern. We assumed that white 
students taking elective courses in intergroup rela- | 
tions or minority group problems were likely to 
fit this description, since they voluntarily expose 
themselves to information about minority groups | 
which is typically presented with an equalitarian 
view, but are not participating actively in efforts 
to change race relations. (All subjects were asked 
to list the organizations they belonged to. White 
subjects in these classes who reported member- 
ship in prointegration organizations were trans- 
ferred to Group II; Negroes in these classes wer? 
placed in Group I.) X 

Group IV. Subjects with anti-Negro attitudes. | 
It is difficult to find student organizations ° 
Which most or all members can be assumed to fit 
this description. However, other studies in the | 
Present program had found relatively high Pro" 
portions of such subjects in right-wing politica 
organizations and in some fraternities and son 
ties. In the present study, both these kinds 
groups were used as sources of subjects in this 
category. In selecting the fraternities and soror | 
ties, we used as informants students in strategit 
Positions in the campus social structure, and rep- ~ 
resenting different points of view—at least 0P? 
student active in fraternity or sorority affairs, al 
least one active in a liberal organization, and 4 — 
least one active in student affairs but not stronsiy — 
affiliated with either the fraternity system or 
liberal organizations. These informants were aske 
to name the fraternities or sororities most 8°% 
erally believed to have strong, though not neces- 
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sarily official or explicit, policies against admitting 
Negroes. Only those fraternities or sororities men- 


* tioned by a majority of the informants at a given 
institution were used as sources of subjects. 


Examination of the ratings of ‘tems by different 
groups of subjects in the Zavalloni-Cook and 
Upshaw studies suggested that some kinds of items 
are perceived differently in different parts of the 
country, over and above individual differences in 
attitudes toward race relations. Thus we tried to 


. secure subjects from each of the above criterion 


groups in each of three geographical areas: north- 
east, midwest, and border south, We were not, 
however, able to secure representation of all groups 
in all geographical areas. The distribution of usable 
subjects in each of the three areas is shown in the 
first column of Table 2. 

In addition, in the border southern city and in 
one of the midwest universities, “heterogeneous” 
subjects, not selected on the basis of organizational 
‘membership or activity with respect to race rela- 
tions, were secured through introductory psychology 
courses, ads in college papers, or solicitation in 
college dormitories. There were 136 usable subjects 
secured in this way in the border city, 12 in the 
midwest, 

Subjects in organizations were approached, through 
the local chapters of their organizations, as “in- 
formed students with points of view on public 
issues.” Each such subject who took part in the 
study was paid $1.50, the payment being made either 
to the organization’s treasury or to the individual 
subjects, as the organization chose. Heterogeneous 
subjects recruited through ads or direct solicitation 
were also paid $1.50. Subjects tested during class 
periods (intergroup relations classes) or in fulfill- 
ment of a course requirement to serve as experi- 
mental subjects were not paid. 


Checks on Adequacy of Data, Distribution of 
Items, and Subjects’ Own Positions 


t Preliminary examination of the data showed some 
ra that seemed to be errors—manifestly favor- 
e items being placed at the unfavorable end of 
i e scale (ie, being rated 1, 2, or 3), and mani- 
C unfavorable items being rated 9, 10, or 11. 
aes it appeared that a subject had consistently 

i ersed the two poles of the scale, his ratings of 
all items were changed to conform to the intended 


TABLE 1 
DISTRIBUTION OF Trem SCALE VALUES 


M 
Saas Description of category Siet S 
1.00-2.99 | Extremel: 
ly unfavorable 28 

ee Moderately unfavorable 18 
Ot 7.00 | Intermediate 11 
ae Moderately favorable 19 

-01-11.00 | Extremely favorable 30 
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TABLE 2 


SELF-REPORTS OF ATTITUDES BY SUBJECTS IN 
DIFFERENT CRITERION GROUPS 


Score on: 
Self-report Category of 
N attitude items closest to 
inventory* own position» 
M SD M SD 
Northeast 
Group I AE eee — 10.71 | 1.07 
Group IL 102 | 55.13 | 1.79 | 10.60 99 
Group HI 94 | 53.75 | 4.21 | 10.33 | 1.06 
Group IV |114 | 41.93 12.26 | 8.47 | 3.06 
Fi TSAA, Ea be 
A 57 
Midwest 
Group IT 26 | 55.46 | 2.10 | 10.65 | 1.00 
Group IV 55 | 43.94 | 8.81 | 9.29 | 1.88 
F 32.61*** 11.66*** 
` 4 
Border South 
Group IL 31 | 54.55 | 2.65 | 10.78 15 
Group IV 23 | 20.87 | 11.92 | 6.64 | 3.38 
F 49.87*** 44.21*** 
à h 1 


a Possible range: 0 (most prejudiced) to 57 (least prejudiced). 
b Possible range: 1 (most unfavorable) to 11 (most favorable). 
e Negroes were given special instructions on the attitude 
inventory; the data are not relevant here. 
a All statistical tests are two-tailed. 
++p < 001. 


direction of the scale. However, only a small pro- 
portion of the apparent errors could be accounted 
for in this way. Subjects who made 12 or more such 
errors, but did not consistently reverse the scale, 
were eliminated from the analysis, There were 32 
such subjects, most of them anti-Negro (members 
of organizations in Group IV and/or having rela- 
tively prejudiced scores on the self-report inven- 
tory). Elimination of these data had the effect of 
reducing, rather than increasing, the differences 
reported in the Results section (see below), since 
their retention would have made the mean ratings 
of unfavorable items by the anti-Negro subjects 
even more favorable and the mean ratings of the 
favorable items even less favorable. 

Table 1 shows the distribution of items in terms 
of the mean scale values assigned by the total 
sample in this study. It can be seen that the 
distribution is almost completely symmetrical, 

We examined the scores of subjects in the various 
criterion groups on the self-report attitude inventory 
as well as their identification of the category of 
jtems being rated that was closest to their own 
position, Mean scores of the various criterion groups 
on these measures are shown in Table 2. 

Scores on both measures support our assumptions 
about the ordinal positions of the groups; they are 
in keeping also with our expectation that there 
would be much greater heterogeneity in Group Iv 
than in Groups I, I, and III. However, in terms 
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TABLE 3 


Rances OF SCORES IN DIFFERENT QUINTILES ON 
ATTITUDE [INVENTORY 


Range of scores in 


Region mk } TA 
ji Middl seast 
PES quintile prendes 
Northeast | 5-42 53-54 37 
Midwest 21-39 48-52 56-57 
Border South 7-20 35-46 ae 


of absolute positions, as indicated by these self- 
report measures, the subjects in Group IV are not, 
on the whole, strongly anti-Negro, On the basis of 
their own reports on both measures, subjects in 
Group IV, on the average, would seem to be best 
described as moderately favorable in both the 
northeast and the midwest, and as intermediate or 
moderately unfavorable in the border south. 
However, the great range of attitudes represented 
in Group IV (and also among the heterogeneous 
subjects in the border south) made it possible to 
identify subjects who, by their own report, were 
more unfavorable than the average subject in 
Group IV. To this end, all subjects within each 
region (including the heterogeneous subjects, if any) 
were grouped on the basis of their scores on the 
self-report attitude inventory, regardless of their 
criterion group status, and the most prejudiced, 
middle, and least prejudiced quintiles selected for 
comparison, Since the distributions of scores in the 
three regions differed, the ranges of scores repre- 
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sented in corresponding quintiles also differed, 
Table 3 shows the ranges of scores on the self- 
report inventory represented in the lowest, middle 
and upper quintiles in each region. 


Analysis 


Two parallel analyses of the rating data were 
carried out within each region: one comparing the 
ratings of the items by subjects in the different 
criterion groups; the other comparing the ratings 
by subjects in the most prejudiced, middle, and 
least prejudiced quintiles, as identified by the atti- 
tuge inventory. The two analyses within each region 
are not, of course, independent, since all the sub- 
jects in the first analysis appear also in the second. 
However, the data from the three geographical 
regions are independent, thus providing three partial 
replications of the study. Throughout the analysis, 
the items were grouped into five subsets (extremely 
unfavorable, moderately unfavorable, intermediate, 
moderately favorable, and extremely favorable) on 
the basis of the mean ratings by the total sample 
in the present study. (See Table 1 for the scale 
values of items included in each subset.) Compari- 
sons were based on mean ratings by subject groups 
of all the items within each subset. 


RESULTS 


Table 4 compares the mean ratings by the 
criterion groups, within each of the three 
geographical regions, of the five subsets of 
items, 

Within each region, the pattern of ratings 


TABLE 4 
MEAN RATINGS or DIFFERENT Tyres or Items BY SUBJECTS IN DIFFERENT CRITERION GROUPS 


Type of item 
Region and Exi 
criterion groups | unfavoratte | Moderately Intermediate “ay Favorable 
M sD M SD M sD M SD Mu) | Sa 
nomi 
roup 174 | 132 | 268 | 95 | 503 | 150 a 
Group I 189 | .70 | 303 | 1:16 | 449 | 1149 PE 138 10 54 1.11 
Gioup Tit | 206 | 62 | 354 | 196 | Sar | 415 | ssi | tee | 979 | 97 
toup IV 2.47, ayes? | 409, | 107 | 609 | 134 | soo | 1:20 | 917 | 1.06 
Site 7 24.44 16.178 5.14 9.900" 
Group It | 2.01 | 69 | 314 | 117 | 4.96 
i i . Laroy 83 
Group IV 230k ‘62 405, hens? | 618 | 103 520 Ea et 90 
Bordar South j i 18.65%% 1.73 1.24 
roup 2.04 | 70 | 3.47 | 118 | 5.73 1 
Grop TV | 2.89 | 53 | 4a | 100 | 677 | ve | B02 | 1 | 392 | 6 
F 22.78 18.67%% 712 7 ean 18.0" 
i pag of groups and numbers of subjects in each are given in section on subjects, 
+p < 001. 
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TABLE 5 


t Mean RATINGS OF DIFFERENT TYPES OF ITEMS BY WTE SUBJECTS IN DIFFERENT QUINTILES 
ON ATTITUDE INVENTORY 


Type of item 
Region and Extrem E 
attitude quintile SRSA: MoS ebla Intermediate NTA ARN 
M SD M SD M SD M SD M SD 
Northeast 
Least prejudiced | 1.87 69 3.02 1.10 | 4.87 1.39 8.03 1.17 9.93 84 
Intermediate 2.10 64 3.49 » 98 5.69 1.30 8.34 1.08 9.60 1.06 
Most prejudiced 2.73 68 4.57 1.02 6.54 1.1 8.10 1.27 9.14 | 1,09 
ali 21.157** 28.46*** 19.83" 91 P25 
Midwest 
Least prejudiced | 2.08 17 3.10 1.17 4.68 1.25 7.15 1.15 9.38 | 1.10 
Intermediate 2.12 50 3.73 82 5.73 87 8.05 94 9.28 84 
Most prejudiced 2.37 74 4.32 97 6.68 1.07 8.46 98 9,48 1.10 
F 89 6.57* 15.12*** 2.06 16 
_ , Border South 
Least prejudiced 2.03 70 Biol 1.13 5.81 1.51 8.72 1.18 10.02 74 
Intermediate 2.31 59 4.64 1.34 6.94 1.57 8.41 1.27 9.57 | 1.08 
Most prejudiced | 2.86 T 5.44 1.65 7.25 1.31 8.19 1.09 9.00 .90 
lig 13.26*** 17.32*** 9.47** 1.84 11.27%5% 
*p <.05. 
om OL. 
+) $001. 


of the unfavorable items—both the extremely 
and the moderately unfavorable—is as pre- 
dicted by Hovland and Sherif (1952), and 
as found, on the whole, both by those investi- 
gators and by Upshaw (1962) and by Zaval- 
loni and Cook (1965): these items are rated 
lower (ie, more unfavorable) by subjects in 
the right-wing political organizations and 
exclusionist fraternities or sororities. More- 
Over, in the northeast, where it had been 
Possible to secure subjects from all four types 
of criterion groups, the mean ratings by the 

four groups follow the predicted order. 
_ On the whole, ratings of the intermediate 
items also follow the pattern predicted by 
Hovland and Sherif and found by those in- 
Vestigators and by Zavalloni and Cook: 
Within each of the three geographical regions, 
the equalitarian subjects rate these items 
significantly lower (less favorable) than the 
Subjects from right-wing groups and exclu- 

* Slonist fraternities, 

pona on the moderately favorable items 
E no consistent pattern. Scores on the 
5 remely favorable items show greater con- 
| sistency than those on the moderately favor- 
t anle, items, Within each of the three regions, 
ese items are rated higher (more favor- 


able) by the subjects in prointegration 
organizations and intergroup relations classes 
than by those who are members of right- 
wing groups or exclusionist fraternities, This 
pattern is the reverse of that predicted by 
Hovland and Sherif (1952) and of that 
found for the unfavorable and intermediate 
items, which were rated lower by the equali- 
tarian subjects than by those in right-wing 
political groups and exclusionist fraternities. 
Moreover, a corresponding reversal on the 
favorable items was noted both by Upshaw 
(1962) and by Zavalloni and Cook (1965). 

Table 5 compares the mean ratings of the 
five subsets of items by subjects in the least 
prejudiced, middle, and most prejudiced 
quintiles on the attitude inventory. These 
results show the same pattern as the ratings 
by criterion groups. 

Since ratings of the extremely unfavorable, 
moderately unfavorable, and intermediate 
items differed consistently among different 
attitudinal groups in all three regions, each 
subject’s mean ratings of these three subsets 
were summed to yield a “summated judg- 
ment score.” Table 6 compares members of 
the criterion groups on this measure; in 
Table 7 the same measure is used to compare 
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TABLE 6 


SumMMATED JUDGMENT SCORES OF SUBJECTS IN 
DIFFERENT CRITERION GROUPS 


Summated judgment score 
Region and 
attitude group 
M SD 
Northeast 
Group I 9,45 2.61 
Group II 9.86 3.00 
Group UL 11.47 2.26 
Group IV 12.64 2.67 
F 24.27** 
` 42 
Midwest 
Group IT 10.12 2.98 
Group IV 12.53 2.15 
F 16.70*** 
A : 42 
Border South 
Group IL 11.24 3.03 
Group IV 14.46 1.72 
F 20.25**. 
» 53 
H p <.001, 


subjects in the least prejudiced, middle, and 
most prejudiced quintiles on the attitude 
inventory. As would be expected from the 
preceding data, within each region, groups 
of subjects with different attitudes differ sys- 
tematically in these “summated judgment 


TABLE 7 
SUMMATED JUDGMENT SCORES of Warre SUBJECTS IN 


DIFFERENT ON ATTITUDE INVENTORY 
Summated judgment score 
Region and attitude group 
M SD 
Northeast 
Least prejudiced 9.76 2.87 
vic a 
‘ermediate quintile 11.27 
Most prejudiced ast 
quintile 13.84 2.48 
F 29. k 
Midwest 
Least prejudiced 9.86 3.07 
aeeai until 
ermi e quintile 11.57 
Most prejudiced 13.37 2 
quintile f 
* 
Border S Fia 
east prejudiced 11.3 
Sp MR Se nA 
ntermediate quintile 13.88 2, 
Most prejudiced 15.55 308 
quintile 
F 18.63*** 
Hk p < 001. 
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scores.” All F ratios are significant al 
than the .001 level; eta coefficients bel 
favorableness scores and group membi 
are .42, .42, and .53 in the northeast, 
west, and border south regions, respect 
Moreover, ¢ tests show that, within ea 
the regions, each of the subgroups 
significantly from all of the others. The 
exception is for Criterion Groups I and 
(Negro and white members of prointeg ) 
organizations) in the northeast. 
In addition to the comparisons of m 
ratings by different groups of subjects, | 
relations were computed within each re 
between scores on the self-report atti 
inventory and the ratings of favorabli 
As would be expected from the differe 
among the attitudinal groups in their ratin 
of the items, ratings of the extremely 1 
favorable, moderately unfavorable, and in 
mediate items show moderate correlation 
with scores on the attitude inventory. T 
correlations between summated judgn 
score and the attitude inventory score 4 
almost identical in the three geogtaph 
regions: 43, .42, and .45. 


SUMMARY AND DISCUSSION 


Earlier studies reporting that judgment 
the favorableness of statements about 
attitudinal object are influenced by judi 
attitudes toward the object have suggest 
the possible usefulness of such ratings 
disguised measure of attitude. Studies 
which Negroes have been the attitudinal 
ject have used a set of statements a hig 
Proportion of which are, by current st 
ards, unfavorable. In the hope of making 
technique more sensitive as an indicator 
attitude, a new set of statements was 4 
veloped, representing a more symmetrical 
tribution over the continuum from unfavo 
able to favorable. Two different (thou! 
not entirely independent) criteria of ju 
attitudes were used: membership in organ 
tions such that it seemed reasonable to SU 
Pose that many, if not all, of the membe 
would hold specified attitudes about 
relations; and scores on a self-report inv 
tory of attitude toward Negroes. Sep: 
analyses using these two criteria for clas 
Ing judges yielded similar results: 
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values assigned to statements by judges with 

. different attitudes differed significantly. The 
study was carried out in three different geo- 
graphic regions, and the data analyzed sepa- 
rately for each region, thus providing three 
replications. 

Subjects with differing attitudes differed 
systematically in their ratings of unfavorable 
and intermediate items, with pro-Negro sub- 
jects rating these items as more unfavorable 
than anti-Negro subjects. This was true, in 
all three geographic regions, and for both 
criteria of raters’ attitudes. This finding is 
consistent with predictions made by Hovland 

and Sherif (1952) and with findings of a 
number of other studies (e.g, Hovland & 
Sherif, 1952; Upshaw, 1962; Zavalloni & 
“Cook, 1965). 

Scores based on ratings of the unfavorable 
and intermediate items proved to be sensitive 
to even slight differences in attitudes. Hov- 
land and Sherif had suggested that ratings 
are influenced by attitude only in the case 
of subjects with extreme attitudes and 
stron’ involvement in the issue; thus, by 
implication, only the scores of subjects at 
opposite extremes of the attitude continuum 
should differ significantly. However, we found 
Significant differences even between mean 
ratings by two groups that may both be 
presumed to be in the favorable region of 
the attitude continuum—white subjects who 
are active members of prointegration organi- 
zations and white subjects who are taking 
elective courses in intergroup relations but 
‘who do not belong to prointegration organi- 
zations, Scores of both these groups differed 
significantly from those of subjects belonging 
to extreme conservative political organiza- 
tions or to fraternities generally believed by 
other students at the given institution to be 
strongly opposed to admission of Negroes as 
Members. Moreover, within each of three 


' . geographical regions, when subjects were 


divided into quintiles on the basis of their 
Scores on the self-report attitude inventory, 
both the most favorable and the least favor- 
able quintiles differed significantly from the 
middle quintile, as well as from each other, 
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in their ratings of the statements. This abil- 
ity to distinguish among favorable subjects, 
and to distinguish a middle group from both 
favorable and unfavorable subjects, is con- 
sistent with findings of both Upshaw (1962) 
and Zavalloni and Cook (1965). 

Differences in ratings of the favorable 
items were less consistent, but tended to be 
in the direction opposite to those for the 
unfavorable items. On the whole, the more 
pro-Negro subjects rated these items as more 
favorable than did the anti-Negro subjects. 
This finding is the reverse of that predicted 
by Hovland and Sherif (1952), but is con- 
sistent with findings by Upshaw (1962) and 
Zavalloni and Cook (1965). The latter two 
papers contain discussions of possible reasons 
for this reversal. 
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PARENTAL DISCIPLINE, STIMULUS CHARACTERISTICS OF 


OUTGROUPS, AND SOCIAL DISTANCE IN CHILDREN 


RALPH EPSTEIN anv S. S. KOMORITA* 
Wayne State University 


Based on the assumption that both parental discipline and situational variables 
are related to the development of prejudicial attitudes in children, the effects 
of 3 independent variables (perceived parental punitiveness, racial, and socio- 
economic characteristics of the outgroup) upon social distance were investi- 
gated. The experimental situation consisted of presenting slides of a fictitious 
group, and depending on the appropriate condition, this group was depicted as 
white or Oriental and middle or working class. The working-class condition 
elicited significantly greater social distance than the middle-class condition; 
however, no significant differences between levels of race were obtained. A 
curvilinear relationship between perceived parental punitiveness and social 
distance was obtained, and this result was interpreted in terms of Bronfen- 
brenner’s concept of “optimal levels of control” and in terms of identification 


theory. 


The “scapegoat” hypothesis has been the basis 
of both psychoanalytic (Adorno, Frenkel-Bruns- 
wik, Levinson, & Sanford, 1950) and social psy- 
chological (Allport, 1954; Berkowitz, 1962) con- 
ceptualizations of prejudice. Two basic assump- 
tions underlie this hypothesis: severe and punitive 
parental discipline, primarily directed towards 
aggression, may increase rather than inhibit the 
instigation to aggress; and, anticipation of punish- 
ment for aggression directed toward the ingroup 
results in displacement from the original sources 
of frustration to outgroups. Research regarding 
this hypothesis, however, has yielded equivocal 
and contradictory results. Whereas positive re- 
sults have been reported by some investigators 
(Miller & Bugelski, 1948; Pettigrew & Cramer, 
1959), the appearance of negative findings 
(Masling, 1954; Mosher & Scodel, 1960; 
Stagner & Congdon, 1955), suggests that further 
research is required in order to specify the con- 
ditions under which the scapegoat hypothesis 
can. account for an appreciable part of the 
variance, 

Tt is plausible that these inconsistent findings 
may be attributed to the lack of comparability 
of subjects, measuring instruments, and frustra- 
tion conditions, Equally important, however, has 
been the failure to determine the subject’s 
perception of the frustrating conditions. In terms 
of parent-child relations, it is reasonable to as- 


1The authors are indebted to E. Roeper and the 
staff of the City and Country School, Bloomfield 
Hills, Michigan, for their cooperation during the 
execution of this study. 


predictive power of a measure based solely on 
parental reports of discipline. For this reason, 
the current study focuses on children’s percep 
tions of parental punitiveness towards aggression 
as an independent variable and its relation to 
the development of social distance attitudes. 

As Allport (1954) has suggested, the scapegoat 
hypothesis emphasizes the motives of the preju- 
diced individual but is ambiguous regarding those 
characteristics of the outgroup which facilitate | 
its selection as a target for displaced hostility- 
Thus, few outgroup characteristics have been 
studied so far, for example, prior dislike (Berko- 
witz, 1962), strangeness (Allport, 1954), cand 
visibility (Williams, 1947). A basic hypothesis m 
the current study is that the development 0 
social distance towards an outgroup reflects 30 
interaction between perceived parental punitive | 
ness and two main characteristics of the out- 
group, namely, social status and race. 

It is reasonable to assume that one result of 
severe parental discipline may be to sensitize the : 
child to power relations of strong-weak, superior 
inferior as these dimensions are culturally defined 
by social status. Assuming that the child views 
himself as having lower status relative to ia 
parents, it is likely that punitive discipline wil 
signify to the child that low social status 15 a 
crucial variable in determining the acceptability” 
of other persons. On this basis, it was predicte 
that children who view parents as punitive W 
develop greater social distance towards an 0% 


sume that a child’s misperceptions or distortions 
of parental behavior might serve to lower the 
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group as compared to children of nonpunitive 
_ parents. Furthermore, children of punitive par- 
‘ents are more likely to develop greater social 
distance toward a low-status rather than a 
middle-class group and toward Oriental rather 
than white children. Previous research (Harris, 
Gough, & Martin, 1950; Weatherley, 1963) which 
demonstrates a positive relationship between 
parental punitiveness and prejudice have failed 
to control for level of prejudice in the parents. 
' This control is crucial insofar as these relation- 
ships could be attributed to parental attitudes 
rather than discipline. For this reason, it Was 
decided to study the development of prejudice 
towards a fictitious group since it is unlikely that 
parents would have social-distance attitudes 
towards such a group. Thus, an obtained rela- 
tionship between parents’ discipline and children’s 
attitudes could not be attributed to parental 
attitudes. 


METHOD 
Subjects 


The sample consisted of 120 boys and girls who 
comprised the third through eighth grades at the 
City and Country School, a private school in Bloom- 
field Hills, Michigan, This school serves children 
whose Stanford-Binet IQs are generally above 
average, and the socioeconomic background of these 
children, as determined by father’s occupation and 
oe is predominantly middle to upper middle- 
class, 


Measure of Parental Punitiveness 


The Parental Punitiveness Scale (PPS) was de- 
veloped by the authors to measure children’s percep- 
tions of parental punitiveness towards aggression. 
A detailed description of the development of this 
scale is reported elsewhere (Epstein & Komorita, in 
press). Briefly, the scale consists of 45 items which 
Measure parental punitiveness towards physical, 
verbal, and indirect aggression in each of five major 
Situations: aggression towards parents, teachers, 
siblings, peers, and inanimate objects. A sample 
item follows: 


Tf I threw a rock at someone's car, 


My a. send me to bed without a. MY 
` AUER supper MOTHER 
ULD b. take away my television b. WOULD 
; privileges 
c. whip me c 


d. have a long talk with me d. 


can abjecte task is to respond to each item by 
of ee one of the four alternatives, first in terms 
of Bat the father would do, and second, in terms 
te at the mother would do. Responses to each 
of ain differentially weighted from 1 to 4 in terms 
i neir relative severity and the score on the scale 

msists of the sum of the weighted scores. The 
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scale is scorable separately for father and mother. 
However, since the correlation coefficient between 
father and mother versions was found to be .60, the 
two scores were pooled to yield a single, average 
punitiveness score. The split-half reliability of this 
average punitive score, with the Spearman-Brown 
correction, was .81. 


Experimental Conditions 


Three independent variables were used: race of 
target group: Oriental versus white; socioeconomic 
class of target group: lower versus middle class; and 
high, medium, and low groups on the basis of scores 
on the PPS. Thus, a 2X 2X3 factorial design was 
employed with 10 subjects in each of the 12 experi- 
mental conditions. 

The basic purpose of the experimental conditions 
was to create specific cognitions regarding a fictitious 
group, the “Piraneans,” Accordingly, subjects were 
presented slides which depicted Piraneans as either 
middle or lower class and Oriental or white. Race 
of Piraneans was varied by presenting slides of four 
Oriental or white children, two boys and two girls 
each, who were representative of the subjects’ age 
range. Socioeconomic class was varied by presenting 
slides depicting residence and working place of 
Piraneans. For example, the working-class slides 
depicted scenes of a ramshackled house, deteriorated 
slum streets, and street construction, whereas the 
middle-class slides depicted a new split-level home, 
suburban streets, and a modern office building. 

Prior to the group administration of the slides, 
the following instructions were given: 


There is a group of people whom most of you 
have never seen. As a matter of fact, you have 
probably never heard of this group. They are 
called Piraneans. Would you like to see some 
slides of the Piraneans? 


After viewing the slides, subjects completed a 
7-item social distance scale with regard to Piraneans. 
These items ranged from, “Would you want to 
marry these people when you grow up?” (minimal) 
to, “Would you want these people to visit your 
country?” (maximal). Each item could be answered 
by checking one of four alternatives ranging from, 
“very much yes,” to “yery much no.” 

In order to minimize the potentially confounding 
factors of differential clarity and brightness, the 
slides were matched as closely as possible in terms 
of these variables. The “low socioeconomic” slides 
were based on scenes within Detroit slums whereas 
the “middle socioeconomic” slides were based on 
photos of suburban areas. Postexperimental inter- 
views with a sample of subjects indicated that very 
few were able to state the specific locale of the slides 
although several subjects believed that the photos 
were taken within the United States. 

Finally, in order to determine the relationship be- 
tween the PPS and generalized social distance, ratings 
of the following groups were obtained 2 weeks after 
the experimental sessions: German, French, Catholic, 
Italian, Mexican, Negro, Japanese, Jewish, and 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF PIRANEAN 
SOCIAL DISTANCE SCORES FOR 
EXPERIMENTAL GROUPS 


Vorking class Middle class 
Seat) URI M social 
punitive- distance 
ness | Oriental | White | Oriental] White 
High 13.0 13.40 | 11.7 11.60 | 12.43 
ki aons | dss) | an | (49 
Medium | 16.40 6.10 | 11,90 | 1290 | 14.33 
2.25) | (2.24) | (1.74) | (2.03) 
Low 13.20 | 13.80 | 10.20 | 12,90 | 12.53 
(2.25) | (1.88) | (4.50) | (1.36) 
M social distance 
M working class 14,32 
M oriental 12.73 
M white 13.45 
M middle class 11.87 


a Standard deviations are enclosed in parentheses. 


Russian, Social distances scores for each of these 
groups were then pooled to obtain a measure of 
generalized social distance. 


RESULTS 


The social distance scores for the 12 experi- 
mental groups are summarized in Table 1. 

An analysis of variance of these scores showed 
no significant interactions between the three vari- 
ables. With regard to main effects, the effects 
of race were not significant at the 5% level 
(F = 1.07, df = 1/108); however, the main ef- 
fect for social class was significant at the 1% 
level (F = 12.51, df = 1/108), and the effect of 
parental punitiveness was significant at the 5% 
level (F = 3.18, df = 2/108). 

The extreme right-hand column in Table 1 
indicates, contrary to predictions, a nonmonotonic 
relationship between parental punitiveness and 
social distance attitudes. Children who perceived 
parents as employing a moderate degree of puni- 
tiveness developed greater social distance towards 
the fictitious group as compared with children 
whose parents were viewed as employing either 
permissiveness or highly punitive discipline. 
Moreover, note that this curvilinear relationship 
between parental punitiveness and social distance 
is found for both social-class conditions. 

In order to determine the consistency of this 
relationship, the correlation coefficient between 
PPS and the generalized social distance measure 
was computed. The correlation coefficient was 
—.21 but the scatterplot of the two variables 
again suggested the same type of functional rela- 
tionship, that is, a concave downward function. 
An F test for departures from linearity of regres- 
sion resulted in an F value of 4.13, and with 
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df =3/115, this is significant at the 1% level. 
The correlation ratio, eta, was 33. These results , 
together with the results of the main analysis, 
although based on the same sample of subjects, 
indicates that the nonmonotonic relationship be- 
tween punitiveness and social distance towards 
outgroups is reliable and consistent. 


DISCUSSION 


A major finding in this study is that upper- 
middle-class children, attending a school where 
tolerant attitudes towards minority groups are 
encouraged, show minimal prejudice (social dis- 
tance) toward Orientals but appreciable social 
distance toward the working class, These results 
are consistent with recent research which sug- 
gests that the outgroup’s occupational status is 
relatively more important than race, among 
middle-class adults, in the United States 
(Triandis & Triandis, 1962). The significance of 
both the subjects’ and the outgroup’s social 
status is further supported by Westie’s (1952) 
report that relative to lower-class subjects, 
upper-class persons express less social distance 
toward Negroes, and that the outgroup’s social 
status appears to be a more important determi- 
nant of social distance among upper-class pet 
sons. In this connection, the question may 
raised as to whether “class prejudice” may be 
an instance of a more fundamental “belief preju- 
dice” (Rokeach, 1960). If further research shows 
that middle-class persons attribute beliefs and 
values to lower-class persons which are discrepant 
from their own, it is plausible that high social 
distance may be an attempt to minimize cognitive 
dissonance which might be elicited by contact 
with persons who entertain dissimilar attitudes ` 
and beliefs. However, an equally plausible and 
perhaps more parsimonious interpretation is that 
middle-class children learn from primary a 
secondary reference groups, to avoid working: 
class individuals who tend to be associated with 
undesirable attributes, for example, “lack 0 
ambition,” “ignorance,” and “inferiority.” Fur- 
ther research is currently underway to deter- 
mine the relative validity of these alternative 
interpretations. 1 

The curvilinear relationship between parenti 
punitiveness and social distance is contrary to 
previous investigations (Harris et al., 1930 
which have shown a positive, monotonic rela 
tionship between the two variables. It is plausible 
that this discrepancy is due primarily to © 4 
ferences in measuring instruments and at 
populations. Previous studies, for example, hiv" 
not systematically explored parental discipline * 
perceived by the child. It is reasonable to assum? 
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that a child’s perceptions may differ from par- 
ental accounts and caution is required in assuming 

* that discipline, reported by parents as mild or 
severe, will be experienced by children at that 
level. Since recent evidence (Bronfenbrenner, 
*1961) suggests that parental reports of discipline 
may be less accurate than children’s, perhaps 
greater emphasis should be placed on the latter. 
Furthermore, whereas previous studies have been 
conducted with lower and lower-middle-class 
populations, the use of an upper-middle-class 
population in the current study suggests that 
significant qualitative differences in discipline 
among social-class groups may produce divergent 
results regarding the relation between parental 
discipline and social distance. 

Caution is also required in generalizing our 
findings to populations other than middle-class 

. children, Employing the concept of “optimal 
levels of control,” Bronfenbrenner (1961) has 
demonstrated that middle-class children, as com- 
pared with working-class children, require milder 
degrees of parental control to achieve a specific 
level of leadership and responsibility. Middle- 
class parents who employ severe controls, beyond 
a given optimal level, seem less influential. In 
terms of our findings, these results raise the 
question as to whether our middle-class subjects 
may have experienced greater frustration and 
anxiety under moderate discipline as compared 
with working-class children, A further question 
arises as to whether the moderately disciplined 
child, as compared with the low or highly pun- 
ished child, may experience maximal uncertainty 
regarding punishment. He may not expect severe 
punishment, but might receive it occasionally. 
This uncertainty would preclude adequate adapta- 
tion to the discipline and result in maximum 
frustration. 

The conceptualization that moderate punitive- 
hess may be highly frustrating, and thereby give 
rise to hostile affect which eventually may be 
displaced, is congruent with the power theory of 
identification (McCandless, 1961). According to 
this theory, moderate discipline, which combines 
elements of reward and punishment, appears to 
be most conducive to identification. An impor- 

_ tant result of the moderately disciplined group’s 
greater parental identification would be increased 

, effectiveness of a given degree of punitiveness. 
A moderate degree of punitiveness among highly 
identified children may be experienced as more 
frustrating than severe discipline applied to chil- 
dren with weak parental identifications. Support 
for this formulation is suggested by recent 
evidence (Epstein & Liverant, 1963) that highly 
identified children are more responsive to adults’ 
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influence techniques as compared with weakly 
identified children. 

Whereas previous research (Harris et al., 1950; 
Miller & Bugelski, 1948) has emphasized the 
personality characteristics of children who are 
prone to develop prejudice, the current study 
suggests the meaningfulness of exploring the 
joint influence of parent-child relations and the 
stimulus characteristics of outgroups upon the 
development of these attitudes. It has been 
clearly demonstrated that discrimination and 
prejudice towards a particular minority group 
may contribute to social immobility and inferior 
socioeconomic status of that group. However, 
although it has been generally assumed, there 
has been little empirical research showing that 
menial or inferior social status, in turn, may 
contribute to the development of these negative 
attitudes. 

In conclusion, the results of this study indicate 
that a child’s direct or vicarious participation 
in a social structure which assigns menial or 
low socioeconomic status to a group will tend 
to facilitate the development of social distance 
towards that group. We therefore have a vicious 
cycle in which prejudice and low social status 
are both cause and effects of each other. Thus, 
in accordance with the principle that in order 
to reduce prejudice the environmental support 
for the prejudice must be eliminated (Krech & 
Crutchfield, 1948), a major implication of these 
results for attitude-change programs is that the 
objective or perceived social status of the out- 
group must be changed as well as the psycho- 
logical functioning of the prejudiced personality. 
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The hypothesis was that persuasive communications are more effective when 
advertised as containing new arguments than when advertised as containing 
familiar arguments, holding the actual novelty of arguments constant. The 
effects of expected novelty of arguments upon selectivity of exposure were 
also investigated. 148 undergraduates were tested in a simulated jury situation. 
Expected novelty of arguments was varied by instructions. Opinion change 
following a persuasive communication was measured. 3 measures of exposure 
were used; ratings of article titles, selection of 1 title as most interesting, and 
time spent reading the communication. Opinion change was greater wien Ss 
expected new arguments than when they expected familiar arguments, although 
the communications used were identical. Anticipated novelty of arguments did 
not itself affect selectivity of exposure, although acquitters were more selective 
than convictors, particularly when they expected new arguments. 


Numerous students of propaganda have pointed 
to the audience’s prior familiarity with issue and 
argument as an important determinant of per- 
suasive effects. For example, Klapper ( 1960) has 


1 The authors wish to express particular apprecia- 
tion to Marianne Osborne for testing the subjects 
and analyzing much of the data. Thanks are also 
due to Leon Festinger, Harold H. Kelley, and T. M 
Tomlinson for their helpful suggestions, and to 
Edward F. O’Connor, Jr. for some of the data 
analysis, The research was supported by the Group 
Psychology Branch of the Office of Naval Research 
Contract Nonr-233(54), Bertram H. Raven, prind- 
pal investigator, and by National Science Founda- 
tion grants to the authors. 


contended that mass communications are C0 
erably more effective on new issues than t 
are on familiar issues. In the latter case, 
propagandist presumably runs afoul of uns 
thetic audience predispositions. He is 
effective on new issues because such pre 
tions generally are weak or nonexistent whel 
audience has little familiarity with the 

Complementing this point, McGuire (1961; 
Guire & Papageorgis, 1962) has found i 
research on cultural truisms that a 
counter to the subject’s position are g 
more persuasive when they are new to hi 
when he has previously been exposed to 
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However, a propagandist is not always in a 
position to raise a new issue or present a new 
argument. Many important issues endure long 
past the time when the audiences of mass com- 


, munications were first exposed to them. Similarly, 


as issues persist it becomes more and more diffi- 
cult to generate entirely new arguments for any 
given position. Very often, the best a propa- 
gandist can do under such circumstances is to try 
to convince his audience that an old and hack- 
neyed argument is in fact a new and unfamiliar 
one. For example, a presidential candidate may 
announce that he is issuing a “white paper” 
containing his position on an important issue. The 
white paper itself may actually contain only the 
same tired slogans he has enunciated throughout 
the campaign, but it is possible that they have 
more force because the audience expects them to 
be novel solutions to old and difficult problems. 

The hypothesis proposed in the present study 
is that propaganda advertised as “new” is more 
effective than propaganda advertised as contain- 
ing familiar arguments, even when the actual 
content of the propaganda is the same in both 
cases. This prediction rests upon the assumption 
that expecting new arguments loosens the sub- 
ject’s commitment to his original opinions, and 
thereby facilitates acceptance of the message. A 
person who is told to expect new and unfamiliar 
arguments is being told, in effect, that his present 
opinions on the issue are based upon partial or 
inadequate information, He should therefore be 
less committed to his initial opinions than a 
person who expects information similar or ident- 
ical to that upon which his present opinions are 
based, According to this reasoning, the mere 
expectation of novelty should be sufficient to 
give propaganda added effectiveness; the argu- 
ments do not themselves actually have to be new 
and unfamiliar. 

A second question about expected novelty in 
propaganda concerns its effects upon selectivity 
of exposure. A familiar finding is that persons 
tend to expose themselves primarily to propa- 
ganda which is in accord with their existing opin- 
ions. However, this finding has held most clearly 
on long-standing issues (such as those chronically 


_ contested by the two major political parties) and 


among those respondents who are most involved 
and interested in the issue (cf. Klapper, 1960; 
Lazarsfeld, Berelson, & Gaudet, 1948; Schramm 
: Carter, 1959). Selective exposure effects seem 
© be most apparent, therefore, when the audi- 
ence is most familiar with the arguments on both 
sides of the issue. Perhaps the individual will 
poy more interest in nonsupportive propaganda 
if he expects it to contain new arguments. That 
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is, knowing that unfamiliar dissonant arguments 
exist, but not knowing what they are, may in 
itself create dissonance. Perhaps this dissonance 
can be reduced most effectively by exposing one- 
self to the dissonant arguments and attempting 
to refute them. According to this analysis, volun- 
tary exposure to information will favor dissonant 
information when the subject expects novel argu- 
ments. 

On the other hand, dissonant arguments may be 
more threatening when they are new than when 
they are familiar. If it is assumed that selective 
exposure occurs in part because subjects avoid 
threatening dissonant information (Festinger, 
1957), then subjects expecting new arguments 
should avoid dissonant information more (and 
seek consonant information more) than subjects 
expecting old, familiar arguments. Given these 
alternative possibilities, no prediction was made 
about the effects of expected novelty of argu- 
ments upon selectivity of exposure. 


METHOD 
Synopsis ' 

The subjects were recruited in groups of four and 
five for a jury experiment, and were tested in indi- 
vidual rooms. Each subject read an abbreviated re- 
port of a trial and voted on a verdict (acquittal or 
conviction). Each subject then indicated how much 
he would like to read each of five articles and desig- 
nated the one he would most like to read. The sub- 
ject was next given a persuasive communication and 
the time spent reading it was recorded, This yielded 
three measures of exposure (article ratings, first 
choice of articles, and time spent reading). The 
opinion change produced by the communication was 
measured, as was perceived communicator credibil- 
ity. The major experimental manipulation was ex- 
pected novelty of arguments; that is, the articles 
that were rated, as well as the persuasive communi- 
cation that the subject actually read, were described 
in advance as containing new arguments or as con- 
taining familiar arguments. In addition to this 
variable, the communication was either consonant or 
dissonant with the subject’s initial vote. This yielded 
a 2X2X2 design with the independent variables 
being expected information novelty, communication 
consonance, and the verdict initially favored by the 
subject. 


Procedure 


The subjects were 148 students enrolled in a regu- 
Jar session introductory psychology class at the Uni- 
versity of California, Los Angeles, They volunteered 
for the experiment to fulfill a course requirement. 
Three of the subjects were not considered in the 
analysis because they failed to complete the entire 
experiment, and two were not considered because 
they misperceived the verdict favored in the per- 
suasive communication. 

As each subject arrived, he was seated in an indi- 
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vidual cubicle, and given a mimeographed report on 
“The Case of Johnny Burdick.” This report included 
somewhat condensed versions of all the testimony 
and attorneys’ summations from the (fictitious) trial 
of a juvenile, Johnny Burdick, charged with second- 
degree murder in the death of his father. His defense 
was that the death occurred as a result of man- 
slaughter. The subjects were told that the report con- 
tained all the essential evidence in the case. 

When the subject finished the report, he cast his 
vote by filling out and signing a ballot form. He was 
then given a large sign, on each side of which the 
verdict he favored was written in large letters. This 
sign remained on the desk in front of him for the 
duration of the experiment, and would allegedly be 
placed in front of him at a roundtable discussion 
with the full jury later in the experiment. This pro- 
cedure was designed to produce a relatively high 
degree of public commitment to the initial vote (see 
Deutsch & Gerard, 1955; Hovland, Campbell, & 
Brock, 1957; Sears & Freedman, 1963, for similar 
procedures). He was then given an item on how 
certain he felt about his vote, and three Likert-type 
opinion items concerning the defendant’s guilt or 
innocence. 

The subject was next given a form for rating the 
interest value of a series of article titles. The instruc- 
tions on the form stated that the papers had been 
given at a symposium on the Burdick case at Co- 
lumbia University, organized by Professor Charles B. 
Black, and had been authored by various professors 
in outstanding law schools, Black had selected the 
authors as representing several Points of view, and 
had suggested the titles of the papers. 

The remainder of the instructions were designed to 
vary the subject’s expectations regarding the novelty 
of the arguments contained in these articles. For the 


new arguments condition, the instructions continued 
as follows: 


[The authors] were asked to discuss only legal 
approaches and points of law that had not been 
touched upon in the trial, and to avoid the spe- 
cific arguments raised by the trial attorneys, Thus 
the authors have attempted to present optimum 
Statements of points which were relevant to the 
case, but which were not explicitly covered in the 
trial. In this way Professor Black hoped to obtain 
new perspectives on the Burdick case, 


The parallel instructions for the old arguments con- 
dition read as follows: 


[The authors] were asked to discuss only the spe- 
cific arguments raised by the trial attorneys and to 
avoid introducing any new points of law or legal 
approaches, however relevant they might be. Thus 
the authors have attempted to present optimum 
statements of those arguments explicitly covered 
in the course of the trial itself. In this way Pro- 
fessor Black hoped to obtain more thorough 
Coverage of the kinds of legal approaches used by 
the attorneys in the Burdick case, 


Verbal instructions were give 


X n concurrently to 
highlight the novelty dimension 


contained in the 
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written instructions. The only difference between the 
two conditions was as follows: 


As it states here, the authors are all prominent 
legal authorities, and they were asked to discuss 
only the legal approaches that did not appear in 
the court record [new arguments]. 


As it states here, the authors are all prominent 
legal authorities, and they were asked to discuss 
only the testimony and points of law that you 
read in the court record [old arguments]. 


The five article titles were: “Weaknesses in the 
Defense Case,” “Weaknesses in the Prosecution 
Case,” “Considerations Favoring Acquittal,” “Con- 
siderations Favoring Conviction,” and “Juvenile 
Crime: Personal and Social Responsibility.” Of these, 
the first and fourth were consonant with a procon- 
viction vote and dissonant with a proacquittal vote, 
while the second and third were consonant with a 


proacquittal vote and dissonant with a proconviction, 


vote, The last article was irrelevant to the vote deci- 
sion. The first two are described below as “negative,” 
and the second two as “positive”; that is, arguments 
against or in favor of a position, respectively. The 
subjects were instructed to rate the titles on the 
basis of how much they would like to read each 
paper. A 9-point scale was provided, the end points 
of which were “greatly interested” and “not inter- 
ested at all.” Each subject was also asked to, check 
the title of the paper he would most like to read. 

When the subject had finished rating the article 
titles, the experimenter returned and explained that 
the ratings were being collected in order to guide 
efforts to obtain publishers’ releases permitting re- 
production of the articles, and that none of the 
articles listed could as yet be reproduced. She then 
gave the subject an article that allegedly could be 
reproduced since it had not yet been published. Very 
brief instructions were also given to reinstate the 
novelty variation, as follows: 


This article is one we got directly from the 


author discussing legal approaches that were not 


raised during the trial [new arguments], 
as opposed to 


one we got directly from the author discussing the 
legal points raised at the trial [old arguments]. 


The instructions given in the two conditions wert 
otherwise identical. Two persuasive communications 
were used, one advocating acquittal and the other 
advocating conviction. Both communications we! 
ostensibly authored by Professor William Wallace of 
the Cornell Law School. The same two communica- 
tions were used in both the new condition and the 
old condition. 

The subject was told to signal the experimenter 
when he was finished reading the communication: 
The time the subject spent reading the communica- 
tion was measured by having the experimenter notè 
the time to the nearest 5 seconds when she left Es 
subject’s cubicle, and again when the subject’s signal 
turned on a signal light in the central cubicle. 
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TABLE 1 
Net OPINION CHANGE 


New Old Difference 
» Consonant communication® AEAN 
Acquitters +70.8% (24) +35.0% (20) +35.8% 
Convictors 433.3” (9) 416.7” (12) 167° 
Dissonant communication 
Acquitters +40.6 (32) +28.6 (28) +12.0 
Convictors +66.7 (9) 0 (9) +66.7 
Total +52.7 (74) +24.6 (69) +28.1 


* The consonant communication advocated the verdict subjects initially favored, and the dissonant communication advocated 


the opposite verdict. 


b The entry is the proportion of subjects changing in the advocated direction less the proportion changing in the opposite direction. 


The subject was given two questionnaires when he 
finished reading the communication. The first con- 
tained 10 Likert-type items on the case, including 
the 3 given earlier. Opinion change was measured on 


, these 3 items. The second included an item on which 


verdict Wallace favored, a manipulation check item 
on the novelty of arguments in the communication, 
and 5 items for evaluating the communication and 
communicator (how unbiased, factual, reasonable, 
and justified by the facts the communication was, as 
well as how much the subject liked the communica- 
tion), The manipulation check item read as follows: 


In your opinion, approximately what proportion 

of Wallace’s arguments had already been covered 

in the case report you read earlier? 

1. All of his arguments had already been covered. 

2. Almost all of his arguments had already been 
covered. 

3. Somewhat more than half of his arguments 
had already been covered. 

4. About half of his arguments had already been 
covered. 

5. Somewhat less than half of his arguments had 
already been covered. 

6. Very few of his arguments had already been 
covered. 

7. None of his arguments had already been cov- 
ered. 


When these questionnaires were completed, the 
experiment was terminated. Each subject was cau- 
tioned not to tell others about the experiment until 
the end of the semester. 


RESULTS 
Check on Novelty Manipulation 


The persuasive communications used in the new 
condition were identical in content to those used 
in the old condition. Only the advance descrip- 
tions of the novelty of their arguments differed. 
Responses to the manipulation-check item indi- 
cated that the different advance descriptions were 
successful in producing differences in perceived 
novelty of arguments. Those subjects who were 
told that the communication would contain new 
arguments rated it as actually containing more 
new material than did those who were told it 
would merely cover the same points as they had 
already read in the case report (F = 5.73, df= 
1/135, p <.025). 


Opinion Change 


It was hypothesized that more opinion change 
would occur in the new condition than in the old 
condition, As shown in Table 1, this hypothesis 
was supported. The net opinion change (see 
Hovland, Lumsdaine, & Sheffield, 1949) was 
+ 52.7% under “new” instructions, and only 
+ 24.6% under “old” instructions (= 2.12, p 
<.05). The greater effectiveness of the new 
instructions held regardless of whether the com- 
munication was consonant or dissonant with the 


TABLE 2 
NET OPINION CHANGE AND CERTAINTY IN THE INITIAL DECISION 
New Old Difference 
High certainty» 
onsonant communicates +64.7% (17) +50.0% (16) Hado 
Lasonant communication +21.4 (14) +20.0 (15) i 
certainty 
Consonant icati 4562 (16) +62 (16) +50.0 
issonant communication $502 (27 F227 0) Tos 


two most certain responses (“absolutely certain” and “quite 


“Th 
median split on certainty in the initial decision contrasted [hf Orr feeely uncertain’). 


in") with the four least certain responses (“ 


424 
TABLE 3 
SELECTIVITY OF Exposure 
Acquitters Convictors 
New | Old | New | Old 
Ratings of articles* 
Positive articles +.25 | -+.23 |—1.11 | +.48 
Negative articles +.25 |—.08 |— .50 | —.71 
n 56 48 18 21 
First choices of articles? | 52.6% | 48.6%| 13.3%] 35.7% 
n 38 35 15 14 


*Entry is mean difference between rating of consonant 
article and rating of dissonant article, with a positive number 
indicating greater interest in the consonant article. 

» The entry is the proportion of subjects indicating a con- 
sonant article was their first choice, among those subjects 
choosing either a consonant article or a dissonant article (i.e., 
excluding those choosing the irrelevant article), 


subject’s initial vote, and regardless of whether 
it advocated acquittal or conviction, 

The superiority of new arguments was due 
largely to those who were initially less certain of 
their vote decision. As shown in Table 2, the 
difference between the new and old conditions 
was 9.6% for subjects with relatively high initial 
certainty (2 = .47, ns), and 42.3% for subjects 
with relatively low initial certainty (z = 2.44, p 
<.02). The interaction of novelty and certainty 
was not significant (z= 1.22, Pp <.25), 


Selective Exposure 


Selective exposure is defined in the present 
situation as a preference for consonant, as op- 
posed to dissonant, information, No hypothesis 
was proposed regarding the effects of expected 
novelty on selective exposure. Selectivity on the 
article ratings and the first choices of articles 
was slightly greater in the old condition, but the 
difference did not approach significance in either 
case (F<1.00 and x2< 1,00, respectively). 
These: data are shown in Table 3, There was 
slightly greater selectivity in the new condition 
in time spent reading the communication, but 
again the difference did not approach significance 
(F=1.74, p < .20). Hence the novelty variation 
does not appear to have had any simple and uni- 
form effect upon selectivity of exposure. 

The major difference in selectivity was between 
those who initially voted for acquittal and those 
who initially voted for conviction. As shown in 
Table 3, the acquitters showed more selectivity 
on both the article ratings (F = 5.87, df= 1/139, 
~<.05) and the first choices of articles (x2 = 
4.93, p<.05). On both measures the convictors 
tended to prefer dissonant information to con- 
sonant information. 
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The acquitters’ greater selectivity was 
pronounced in the new-information condition, Th 
interaction of Initial Vote X Novelty on Sele 
tivity was significant for ratings of positive att 
cles (F = 4.86, df = 1/139, p <.05) and in ff 
same direction but not significant for ratings 
negative articles. On the first choices of articles 
the acquitters were considerably more selectiv 
than the convictors in the new condition (y= 
4.54, p < .05), and only slightly more selective if 
the old condition (x? = .25, p > .50). i 

In short, the acquitters demonstrated greal 
selettivity than did the convictors, and the 
ference between the two was greatest when ne 
information was expected. These differences m 
be described equally well, of course, by sayin 
that acquitters and convyictors alike tended 
prefer proacquittal information to proconvici 
information, particularly when the informatioi 
was thought to contain new arguments. In faci 
considering the new-information condition alo 
the convictors actually displayed a greater pref 
erence for proacquittal information (although i 
was dissonant with their initial votes) than dil 
the acquitters (for whom it was consonant will 
their initial votes). The difference is significanl 
beyond the .05 level on first choices of article 
(x? = 3.95), 


Discussion 


The major hypothesis of this study was sup: 
ported. When a persuasive communication W 
described in advance as containing new al 
ments it was more influential than when it wa 
described as containing familiar arguments. Thos 
subjects who were told to expect new argument 
actually perceived more new material in the co 
munication, despite the fact that identical co 


One possible explanation for this result is i 
the instructions by which novelty was manipulate 
somehow made the new communications mor 
impressive, or their source more credible. Novell 
did not, however, significantly affect evaluation 
of the communication and communicator (F 
1.00). Another possibility is that the new inst 1G 
tions motivated a more careful reading of th 
communication than did the old instructions 


2The absence of significant differences on # 
communication evaluation and time measures 
be thought to reflect only their insensitivity. 
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the verdict (and thus closest to the communica- 
tions actually given) were rated as more interest- 


' ‘ing in the old condition than in the new condition 


(F = 4.26, df = 1/139, p<.05), It seems rea- 
sonable to infer from this that the old communi- 
tation was greeted with somewhat more interest 
than the new communication, and that it was 
read more attentively, if anything. The greater 
persuasiveness of the new communication does 
not appear, therefore, to have been due to dif- 
ferences in communication evaluation, source 
credibility, or attention. 

Perhaps a more plausible explanation is that 
the subjects in the new condition felt less bound 
by their previous commitment than did the sub- 
jects in the old condition. An initial commitment 
may become invalidated, in a sense, when it be- 
comes clear that it was based upon only partial 
„information. For one thing, others are less likely 
to be critical of one’s inconsistencies if it is clear 
that new considerations have been introduced in 
the meantime. The introduction of new informa- 
tion provides ample justification for eschewing a 
Previous commitment and accepting a persuasive 
message, Persons expecting familiar arguments, 
on the other hand, have less justification for 
changing their opinions, since they presumably 
had taken all the old arguments into considera- 
tion when they formed their opinions in the first 
place. One would not expect them to be partic- 
ularly receptive to further information under 
such circumstances. 

The invalidation of a prior commitment should 
have its greatest effect among those whose public 
Commitment was not bolstered by personal cer- 
tainty. A person who is unsure of which position 
to favor is also unlikely to be persuaded by a 
simple restatement of the very information which 
originally placed him in conflict. New arguments 
may, on the other hand, enable him to adopt a 
Position with increased certainty, and therefore 
should be more persuasive. This interpretation 
awaits further test, however, since certainty in 
the initial position was not manipulated experi- 
Mentally in the present experiment. 

The finding that the acquitters showed more 
Selective exposure than did the convictors is of 
interest mainly with regard to the asymmetry of 
the present issue (barring individual differences 
Correlated with verdict preferences). Many issues 
are symmetrical in that the consequences of a 
ever, in the present experiment the former registered 
à strong difference between consonant and dissonant 
communications (F = 52,35, df=1/135, p< .001), 
and the time measure differentiated between high- 
and low-commitment conditions in a previous experi- 
ment (Sears & Freedman, 1963). Thus, there is 
Teason to believe the measures are sensitive. 
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victory by either side are approximately equal in 
importance. Most elections of public officials are 
symmetrical in this sense. Other issues are asym- 
metrical in that the consequences are more sig- 
nificant if one side wins than if the other does. 
In the present case, the consequences of falsely 
convicting a boy of murder are probably seen by 
most people (and the law) as more serious than 
are those of falsely acquitting him. On such 
asymmetrical issues, subjects disposed to the 
position with greater consequences may be in- 
clined to seek and evaluate more exhaustively any 
available information favorable to the more con- 
servative alternative. Those subjects already dis- 
posed to the more conservative position may feel 
less obligated to dispose of all possible counter- 
arguments, since less damaging consequences en- 
sue if they prove to be wrong. This interpreta- 
tion is suggested by the tendency for convictors 
to be even more interested in dissonant informa- 
tion when they think new arguments are avail- 
able. The present data clearly do not prove the 
point, but they do suggest a broader sampling of 
issue differences in future research on the de- 
terminants of selective exposure.* 

In sum, it appears that advertising a propa- 
gandistic presentation as containing new argu- 
ments enhances its persuasive effects. Expecta- 
tion of new arguments does not appear to affect 
selectivity of exposure, except possibly insofar as 
partisans of the position bearing more serious 
implications may be more motivated to dispose 
of novel counterarguments. On the basis of earlier 
research (Freedman & Sears, 1965) it was con- 
cluded that warning subjects of the position to be 
taken by a propagandist increases their resistance 
to change. The implication of these two studies 
is that a propagandist maximizes his chances for 
persuasion if he advertises his presentation as 
containing new arguments, but without specifying 
the precise views he intends to advocate. Con- 
versely, resistance to change is maximized if 
members of the audience arrive under the im- 
pression that they will hear only familiar argu- 
ments which support a position highly discrepant 
from their own. It remains for further research 
to determine the limitations of these generaliza- 
tions. 

8In this connection, it should be mentioned that 
the jury setting was selected for this experiment for 
two main reasons: all prior information on the issue 
could be controlled and held constant, and it is a 
sufficiently involving issue to produce relatively high 
degrees of commitment to an initial position. Many 
other issues and settings do not meet these two 
criteria, and different levels of involvement and 
prior familiarity with the issue might well affect the 
relationships reported here. 
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DIRECTION OF AGGRESSION AND ADAPTATION TO FREE 
OPERANT AVOIDANCE CONDITIONING * 


IRVING B. WEINER anp ROBERT ADER 
University of Rochester School of Medicine and Dentistry 


26 male medical and graduate students completed the Rosenzweig Picture- 
Frustration (P-F) Study and the Rod and Frame Test (RFT) and underwent 
an operant avoidance conditioning situation in which elimination of shock could 
be achieved by learning a button-pressing schedule, Learners were significantly 
higher in extrapunitiveness and lower in impunitiveness and conformity than 
nonlearners ; they also reported more aggressive feeling on a mood adjective 
check list. No differences on the RFT were found. The results are compared 
with other literature suggesting that persons who can express anger achieve 
more than those who deny anger or minimize frustrating situations. The P-F 
may have untapped predictive potential in situations of adaptation. i 


Previous papers (Ader & Tatum, 1961, 1963) 
describe the performance of human subjects in 
a free operant avoidance conditioning situation 
similar to that developed for use with animals 
(Brady, Porter, Conrad, & Mason, 1958; Sidman, 
1953). In this situation the subject, with elec- 
trodes strapped to his leg, is seated at a table 
with a button conspicuously within his reach. 
Virtually no instructions are given. Shock is then 
administered at regular intervals (the shock- 
shock interval—S-S), but the shock can be de- 
layed for a fixed interval (response-shock inter- 


1 Portions of this paper were presented at the 1964 
meeting of the Eastern Psychological Association in 
Philadelphia. 


val—R-S) by a button press. The previous 
has found considerable variability in response 
this situation. Some subjects fail to learn, oF 
least to execute, a button-pressing schedule wh 
eliminates shock; of these “nonlearners,” 
terminate the experiment by removing the 
trodes and walking out of the testing rol 
Among those subjects who learn, time to acqu 
tion is quite varied and seems inversely rela 
to the ratio of the R-S to the S-S interval (A 
& Scibetta, 1964). 

The frequency of aggressive comments in p 
conditioning interviews with subjects in previ 
studies suggested that some measure of the t 
cal direction of aggression in persons might 
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late to their mode of adaptation in the condition- 
__ ing situation. Also suggestive in this regard were 
subjects’ changes in aggression scores on a Mood 
Adjective Check List devised by Green and 
Nowlis (1957) to measure affect change as a 
“function of interpolated activity. The specific na- 
ture of the interview comments, which often in- 
volved direct expression of anger toward the ex- 
perimenters or toward the self for not having 
more quickly grasped the solution, led to the 
use of the Rosenzweig Picture-Frustration (P-F) 
Study in this experiment. Additionally, diverse 
reports by subjects of the degree to which their 
behavior was influenced either by the actual 
physical situation, particularly with reference to 
the presence of the button, or by their own cog- 
nitive structuring of the experience, suggested in- 
clusion of the Rod and Frame Test (RFT; Wit- 
. kin, Dyk, Faterson, Goodenough, & Karp, 1962) 
as a measure of field dependence-independence. 


PROCEDURE 


Subjects for this study were 26 male medical and 
graduate students, age 24-30, who responded to a 
general request for paid volunteers. They were tested 
in two experimental sessions a week apart. In Ses- 
sion ¥ they were interviewed briefly to acquaint them 
generally with the experimenters and the procedures 
to follow. They were then given the Mood Adjective 
Check List (MACL), personal history data were col- 
lected, and the Rosenzweig P-F and the RFT were 
administered. At the end of the session a second re- 
sponse to the MACL was elicited. 

In Session II the subject first completed a MACL 
and then was presented with the conditioning situa- 
tion, He was told he was to be taken to another 
room and seated in a chair with electrodes attached 
to his leg. He was instructed that he was not to get 
up from or move the chair, nor was he to touch the 
electrodes. He was asked not to smoke or talk, and 
he was told the experimenter would be in the next 
room, At no time was he told anything of the na- 
ture of the experiment. The subject was then taken 
to a small, virtually bare room and seated in front 
of a table. A red button on a wooden board was 
fastened to the edge of the table, directly in front 
of the subject. Electric current, set at that intensity 
which evoked an involuntary leg flexion, was ad- 
ministered through EKG-type electrodes affixed to 
the calf of the subject’s left leg. The shock was typi- 
cally described as “unpleasant,” “bothersome,” or 
Uncomfortable.” Apparatus controlling the condi- 
tioning regimen was located in an adjoining room 
from which observation through a one-way mirror 
was possible. The experimenter also made the fol- 
lowing statement: 


This part of the experiment will last from 10 min- 
utes to three hours. This is not an endurance task, 
and your performance in this situation will not 
determine the duration of the Session. 
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After the conditioning subjects were again given 
the MACL and participated in a postconditioning in- 
terview, the main purposes of which were to assess 
their reaction to the situation, deal with anxieties 
the experience might have aroused, and solicit their 
cooperation in not revealing the nature of the ex- 
periment to other potential subjects. The form of 
the interview was unstructured, and the only ques- 
tion asked concerning affect was the general inquiry, 
“What was your reaction to the situation?” 

On the basis of their performance in the condi- 
tioning situation, 13 subjects were considered learn- 
ers, 11 nonlearners. The other 2 subjects walked out 
of the experimental room without completing the 
session. Since a sample of 2 does not appear to 
justify assessment of “walkout” behavior, further 
data will be presented only on the 24 learners and 
nonlearners. A subject was considered to have 
learned when he achieved a 5-minute period during 
which he avoided 80% or more of the shocks he 
would otherwise have received as a function of his 
S-S interval. The beginning of this 5-minute period 
was taken as the time to acquisition. Among the 
learners, the time to acquisition of the learning cri- 
terion ranged from 85 seconds to 44 minutes, with a 
median of 5.75 minutes. Of the 11 nonlearners, 10 
had sat an entire hour, receiving repetitive shocks 
from 5- to 30-second intervals, without pressing the 
button; the other nonlearner pressed, but did not 
learn the pressing response to the acquisition cri- 
terion. Failure to acquire the button-pressing re- 
sponse was not associated with any of the temporal 
parameters of the conditioning situation. Available 
scores on the Medical College Admissions Test for 
10 of the learners and 6 of the nonlearners did not 
indicate any significant differences in intelligence be- 
tween the groups. 


RESULTS 


Table 1 presents the mean percentage scores 
on the Rosenzweig P-F categories for the learner 
and nonlearner groups. There are highly signifi- 
cant differences between the groups on two of 
the three categories which pertain to direction of 


TABLE 1 


Mean RosenzweIc P-F SCORES OF LEARNERS 
AND NONLEARNERS 


Non- 

Rosenzweig category ese) Ey SEan| t 
Extrapunitiveness 51.7 36.1 | 3.16 | 4.94%* 
Tatropunitiveness 24.7 28.6 | 2.77 1.41 
Impunitiveness 24.3 35.0 | 2.93 | 3.65% 

ybstacl 
3 Dominance 21.2 18.8 | 2.01 1.19 

Ego-defensiveness 50.8 SA5 ISSN AT 
Need-persistence 27.9 30.1 | 3.94 | <1 

nformi 

SA SR 67.2 76.5 | 4.29 2.22* 

*p <.05. 
wep <01 
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TABLE 2 
PRE- TO POSTSESSION CHANGES IN 
AGGRESSION on MACL 
Change in aggression 
Group eeu S x? 
Less Same More 
Session I 
Learners 3 5 5 8.69% 
Nonlearners 7 3 0 
Session II 
Learners 2 5 6 <1 
Nonlearners 2 2 6 
ee SA A 
*p <.02, 


aggression: extrapunitiveness, in which frustra- 
tion-induced aggression is turned onto the envi- 
Tonment, and impunitiveness, in which aggression 
is evaded in an attempt to gloss over the frustra- 
tion. As measured by the P-F, the learner is gen- 
erally more likely to externalize blame in a frus- 
trating situation, the nonlearner more likely to 
deny the existence either of blame or of the frus- 
tration itself, 

No significant differences occurred between the 
two experimental groups in the three categories 
of reaction type, obstacle-dominance, ego-defen- 
siveness, and need-persistence, However, an addi- 
tional significant finding appears in the group 
conformity ratio, with the nonlearners more fre- 
quently than the learners endorsing conventional 
„modes of response on the P-F. 

Table 2 lists the changes in aggression on the 
MACL for the two 
the four factors which have been most consist- 
ently identified in studies of the MACL and is 
scored from the adjectives angry, bold, defiant, 
and rebellious on the basis of their high factor 
loadings (Nowlis, 
learner and nonlearner groups did not differ in 
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quisition criterion and coming close to being 
classed as nonlearners, 

Scores on the other three MACL factors most 
consistently identified—concentration, deactiya. 
tion, and social affection—were not found to be 
associated with learning in any systematic way, 
On the RFT learners had a mean score of 5.57, 
the nonlearners a mean score of 4.44, and the 
SEaite, Was .8. The resulting ¢ of 1.41 is not sta- 
tistically significant. 

The manifest content of the postconditioning 
interviews was grossly analyzed for the presence 
of‘aggressive statements, “Aggression” was scored 
as present when a subject voluntarily stated that 
he was angry, irritated, annoyed, mad, or ag- 
gravated. “Nonaggression” was scored if the sub- 
ject specifically denied one of these feelings, and 
no score was given if neither aggression nor its 
denial was manifest. The aggression, nonaggres- 
sion, and zero scores were randomly distributed 
between the learner and nonlearner groups. 


Discussion 


The significant findings in the experiment ap- 
pear relevant to at least two areas of infer- 
ence. The first involves the relationship between 
achievement and the handling of aggression, The 
behavior of the nonlearner in the conditioning 
situation represents a less successful level of 
adaptation than that achieved by the learner, 
That this statement is more than merely a value 
judgment of the experimenters is attested to 
by the manner in which the nonlearners were 
prone in the Postconditioning interview to justify 
their behavior. Their comments ranged from “I 
thought it was an endurance test [which the in- 
structions clearly indicated it was not] and the 
button a panic button for ending the experiment 
to “I thought the button was just left over from 
another experiment and had nothing to do with 


this one.” Inasmuch as the learners were identi- | 


fied as primarily extrapunitive and the nonleatt 
ers as impunitive and learners tended to report 
and nonlearners to deny increased aggressive feel- 
ings on the MACL, the data generally support 
the hypothesis that those individuals who express 
their anger are more likely to achieve than thos? 
who deny either their anger or experiences which 
Provoke it, i 
Turning to a use of the word achievement dif- 
ferent from that employed above, it is interest- 
ing to note that such a relationship has repeat 
edly been found in Studies of academic achieve- 
ment. Kirk (1952), summarizing observations 00 
large numbers of college underachievers, states 
that such people have difficulty expressing hos- 


a 
A 


tility. Shaw and Brown (1957) and Shaw and 
Grubb (1958) report that both high-school and 
college underachievers have hostile impulses 
which are not directed into overt behavior. Shaw 
and Black (1960), using the Rosenzweig P-F, 
found greater extrapunitiveness among achievers 
than underachievers. The present study suggests 
‘that the capacity to express anger may promote 
achievement not only in the specific academic 
sense, but also as more broadly viewed in rela- 
tion to adaptation in situations of stress or chal- 
lenge. 

The relationship between conformity and fail- 
ure to learn was somewhat puzzling to the ex- 
perimenters. A possible rationale for this rela- 


| 
: tionship may be that those subjects low in con- 
j 
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formity were those with sufficient imaginative 
capacity and behavioral flexibility to conceive 
and execute the correct solution. However, it can- 
not be ignored that persons may under different 
circumstances achieve by conformity as well as 
by independence, a possibility recognized, for ex- 
ample, in the separate scales for such modes of 
achievement developed for the California Person- 
ality Inventory. The lack of significant findings 
with the RFT suggests that the subjects’ per- 
formance in the conditioning situation was inde- 
pendent of their general trends toward field de- 
pendence or independence. 

The second area of inference to which the 
present findings are pertinent is the prediction of 
overt behavior from knowledge of defensive style 
as reflected in P-F scores. The literature on pre- 
dicting from the Rosenzweig to overt behavior 
has not been particularly promising. Albee and 
Goldman (1950), for example, found no signifi- 

_ cant correlations between rated aggressive behav- 
~ ior and P-F scores in a patient population, and 
Mehlman and Whitman (1955) achieved only 
chance results in comparing P-F scores of col- 
lege students to their observed performance in 
contrived situations analogous to several de- 
Picted in the test. Actually, however, only a small 
Portion of the Rosenzweig literature has been 
concerned with such efforts to predict overt be- 
havior. Studies of concurrent, rather than pre- 
dictive, validity have been popular, the former 
_. involving comparisons between known groups and 
Of score changes following presumably induced 
3 frustration. No use of the test to predict behav- 
_ ior in a situation similar to that of the current 
Study is known to the authors. In view of the 
findings here, it may be that the P-F has predic- 
tive potential in situations of adaptation which 
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has not yet been sufficiently explored to demon- 
strate the value of the test. 
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EPPS PERFORMANCE UNDER SOCIAL DESIRABILITY 
INSTRUCTIONS * 


GARY E. STOLLAK* 
Rutgers University 
Individual groups of 206 and 178 male Ss were administered the EPPS under 


standard instructions (SI) and social desirability instructions (SDI), respec- 
tively. Under SDI, Achievement, Deference, Order, Affiliation, Nurturance, 


Endurance, and Dominance means were 


Autonomy, Succorance, Change, Heterosexuality, and Aggression means were 


significantly lower. A social desirability 


formity and affiliative behaviors, and inhibited individualistic, aggressive, and 


sexual behaviors. Since it was easily apj 


that the EPPS does not fully control for SD. The proportion of Ss choosing 
Statement A in a selected sample of 30 items under SDI correlated .19 with Ss 


under SI. Previous investigators found 
explained in terms of a sex variable. 


The Personal Preference Schedule was con- 
structed by Edwards (1954) to eliminate the in- 
fluence of the social desirability (SD) factor by 
equating each item in the 210 pairs for SD value 
and matching items of equal value. Corah, Feld- 
man, Cohen, Gruen, Meadow, and Ringwall 
(1958) have pointed out that 


Edwards made no attempt to check his items for 
judged social desirability after they had been paired. 
It seems possible that responses to pairs differ from 
responses to single items. A judge rating a pair of 
items for social desirability might make a clear-cut 
choice even though, in rating the items individually, 
-he might assign both the same value [p. 701. 


Corah et al. found judges significantly preferring, 
in 20 of a sample of 30 item pairs, one member 
of the pair as more socially desirable. When pair- 
ing Achievement and Succorance, Achievement 
and Heterosexuality, Order and Aggression, and 
Abasement and Heterosexuality, the item repre- 
senting the first of each of these pairs was always 
judged more socially desirable. Furthermore, the 
preference of judges in terms of SD correlated 
highly (.88) with actual choices of another group 
of subjects who completed the 30 item pairs un- 
der standard instructions (SI). The authors con- 
cluded that additional judgments of the item 
pairs themselves, along with revisions in pairing, 
would be needed before SD could be eliminated 
from the EPPS. 


1 This investigation was supported by a Predoc- 
toral Fellowship, MH-16,835, from the National In- 
stitute of Mental Health, United States Public Health 
Service. 

The author would like to thank Bertram D. Cohen 
for his helpful suggestions and Susan Osborne for 
her statistical aid. 

2 Now at Indiana University, 
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significantly higher; and Exhibition, 
(SD) response set emphasized con- 


plied to the EPPS, it was concluded 


higher values. The discrepancy was 


In a rebuttal study, Edwards, Wright, 
Lunneborg (1959) gave the complete EPPS 
SI to one group, social desirability instructi 
(SDI) to another group, and formed a 
group from a sample of records drawn fro! 
normative group used in standardizing the EP 
In this study, the proportion choosing State 
A in the normative group correlated an 
.93 with the SI group. The group given SDI œ 
related .69 with the SI group and .61 with i 
norm group. Using the items used by Corah 
Edwards found a correlation of .64 between 
SI and SDI groups, and .51 between the SDI 
the normative group. He concluded that the 
correlation of .88 in their study and .64 i 
was a result of their administering items 0 
context, which influences the responses to 
Although the correlation between SI and 
drops from .88 to .64, such correlations are h 
enough to conclude, though Edwards is rel 
to admit it, that performance under SDI is 
to that under SI, with the implication that 
still seems to be an important influence on E! 
performance. 

A description of the personality profil 
result under SDI has been ignored by the 
investigators, who have focused rather on ite 
correlations. It was predicted that mean sco 
on the individual personality needs wo 
Significantly different under SDI. 


expected 


1 


METHOD 


All subjects were male students in two ini 
tory psychology courses at Rutgers Universit 

The EPPS was administered to 206 subjects | 
SI. Another group of 178 was administered the 
with the following SDI, identical to those W 
Corah et al. (1958): 
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Select the statement in each pair which you think 
is more socially desirable—that is, the statement 
that would make another person look better to 
the other person if it were said of him [p. 701. 


RESULTS 


There were 14 protocols of the SI group ex- 
cluded from analysis because their consistency 
score was 8 or below, and 25 protocols from the 
SDI group for the same reason, reducing the 
total N to 192 and 153 in the SI and SDI groups, 
respectively. The proportion of protocols elimi- 
nated under SDI was significantly greater than 
under SI (¢ = 2.31, p <.05). 

Repeating Edwards’ procedure in analyzing the 
30 item pairs selected by Corah et al. (1958), in 
the context of the total 210 pairs, the propor- 
tion of subjects choosing A under SDI was cor- 
related with the proportion choosing A under SI. 
A Pearson product-moment correlation of .19 re- 
sulted, 

Table 1 gives the means and standard devia- 
Sid for each personality need under SI and 
DI. 

As Table 1 shows, under SDI, Achievement, 
Deference, Order, Affiliation, Nurturance, En- 
durance, and Dominance means are significantly 
higher than under SI; and Exhibition, Autonomy, 
Succorance, Change, Heterosexuality, and Ag- 
gression means are significantly lower. Means of 
11 of the 15 personality needs are significantly 
different under SDI beyond the .001 level; and 
13 of the 15 beyond the .05 level. 

Spearman rank-difference correlations between 
the personality need means in the SI group and 
Edwards’ college male norms (Edwards, 1954, p. 
10) yielded an expected significant value of .94, 
whereas correlation between the SI and SDI 
groups yielded an insignificant value of .03. Fur- 
thermore, the SDI group correlated only .19 with 
the college male norms, but a significant .49 ($ 
<.05) with the college female norms. 

Also seen from Table 1 is that the standard 
deviations under SDI are comparable to those 
under SI. 


Discussion 


The results of this study indicate that per- 
formance under SDI, whether measured by dif- 
ferences between personality need means or item 
correlations, differs very significantly from that 
under SI. The results further suggest that the 
low correlation was not a result of random SDI 
performance, in which case each need under SDI 
would have centered around a mean of 14, since 
there are 28 items for each need, but rather a 
consistent, significantly different performance. It 
would seem, then, that SD is a response set easily 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF THE 
EPPS VARIABLES UNDER SI anD SDI 


M SD 
Variable 

SI SDI SI SDI 
Achievement 15.98 17.27 | 4.38 | 4.45 
Deference 10.00 15.33% | 3.75 | 3.20 
Order 9.03 13.118 | 4.74 | 4.72 
Exhibition 15,06*** | 13.70 3.18 | 3.76 
Autonomy 15.06% | 7.75 4.36 | 3.66 
Affiliation 14.07 18.50 | 4.38 | 4.17 
Intraception 15.65 15.80 5.17 | 4.24 
Succorance 11.99% | 9.60 5.15 | 4.77 
Dominance 15.62 16.65* 5.09 | 4.43 
Abasement 13.06 13.63 5.11 | 5.34 
Nurturance 14.10 20.46" | 5.14 | 4.06 
Change 16.41 | 11.75 4.97 | 3.41 
Endurance 12.03 17.55" | 5.32 | 4.76 
Heterosexuality | 18.70*** | 12.39 5.07 | 6.04 
Aggression 13.53 | 6.97 5.03 | 3.82 

*p<.05. 
ahs oi 


+++ p 2.001, from corresponding mean under opposite in- 
structions. 


applied to the EPPS, that emphasizes conformity 
and affiliative behavior and deemphasizes and 
inhibits individualistic, sexual, and aggressive 
behaviors. The greater proportion of protocols 
eliminated under SDI because of low consistency 
score suggests that for some subjects this re- 
sponse set is difficult to assume. The subjects’ 
ability to apply an SD response set suggests that 
the EPPS does not fully control for SD. 

The high correlations between groups under 
SDI and SI in Corah et al. (1958) and Edwards’ 
studies, and the very low correlation found in the 
present one, could be a result of a sex variable. 
Whereas in the present study only males were 
used, almost half of the subjects in Corah et al. 
and Edwards’ studies were females. Conformity 
and affiliative behavior would most likely be 
emitted under either SDI or SI in college fe- 
males, leading to high SDI-SI correlations; note 
the significant SDI-college female norms corre- 
lation found in this study. Although an SD re- 
sponse set does seem to exist and be applicable 
to the EPPS, it is unlikely that male subjects 
under SI respond with an SD set (correlation is 
03), Therefore, only if we obtain in a male sub- 
ject a personality profile similar to that obtained 
jn subjects under SDI would it be difficult to 
separate out an SD set from his performance; 
but it might be more difficult to separate out SD 
from female subjects’ SI performance. It might 
be that only in men is SDI so effective in effect- 
ing differential performance. Possibly, in men, an 
“jdeal”-self response set influences their responses 
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under SI, and if given, would result in person- 
ality profiles characteristic of college males, that 
is, high Achievement, Autonomy, Dominance, 
Heterosexuality, and Aggression, and low Defer- 
ence, Affiliation, Nurturance, Abasement, and 
Change. The above possibilities are being in- 
vestigated, 

It would seem to be useful to acknowledge SD 
not only as a confounding factor, but also as a 
specific personality dimension—as Crowne and 
Marlowe (1960) and Scott (1963) have—and 
make further efforts to isolate and delineate the 
variables that comprise and control it, as well as 
to explore how it affects other performances. 


REFERENCES 


Coran, N, L., FELDMAN, M. J. Conen, I. S., Gruen, 
W. Meavow, A., & Rwcwarr, E. A. Social de- 


Journal of Personality and Social Psychol 
1965, ve. 2, No. 3) 432-435 phat 


PROJECTION, ATTRACTION, AND STRATEGY CHOICES IN 
INTERGROUP COMPETITION 
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40 undergraduates played a modified Prisoner’s Dilemma game. 2 dyads played 
on each occasion. Ss chose a competitive or cooperative strategy toward both 
their opponents and their partners on each of 20 trials, Partners received more 
cooperative choices than did opponents p<.001. Ratings on personality, 
sociometric, and ability traits improved, though not always significantly, for 
both Partners and opponents. Ratings on motives likely to be aroused by the 
game improved for the partners p < .05 but decreased in favorability for the 
opponents p< .05. The study apparently illustrates a projective mechanism 
which influences the perceptions of conflicting groups, 


This study is concerned with inter- and intra- 
group relations in the setting of a non-zero-sum 
game. Such games have been used and/or dis- 
cussed in several recent articles (Bixenstine, 
Potash, & Wilson, 1963; Bixenstine & Wilson, 
1963; Deutsch, 1958, 1960; Lutzker, 1960, 1961; 
McClintock, Harrison, Strand, & Gallo, 1963; 
Minas, Scodel, Marlowe, & Rawson, 1960; Rapo- 
port, 1963; Rapoport, Chammah, Dwyer, & Gyr, 
1962; Rapoport & Orevant, 1962; Scodel & 
Minas, 1960; Scodel, Minas, Ratoosh. & Lipetz. 
1959; Wilson & Bixenstine, 1962), The present 
study is apparently different from the others in 
that it employs an intergroup setting and in that 
it reports attitude changes as well as strategy 
choices, 
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METHOD 
Subjects 


number of males and females took part. Due t0 
difficulties of recruiting subjects, the sexual compor 
sition of each group was allowed to vary hap: 
hazardly. Some extraneous variance was undoubt 
introduced as a result; however, significant trends 
have shown that females are less cooperative in this: 
type of setting; however, significant trends emerged 
nonetheless, 


Rules of the Game 


The subjects could make either a cooperative (ed) 
or a competitive or exploitative (blue) choice. © 


, 
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both teams chose blue, neither received any reward. 
If one team chose red and the other blue, the team 


' «choosing blue received the total reward which con- 


sisted of 4 pennies. If both teams chose red, both 
received 2 cents. Each individual also made an in- 
«group choice, blue or red, to determine the distribu- 
tion of the team’s winnings, if any. The rules of 
the ingroup game completely paralleled those of the 
outgroup game. The subjects made all their choices 
in secret. They, of course, made their ingroup choice 
before learning their opponents’ choices. 


Response Measures 


` 

The strategy choices, that is, the number of red 
versus blue choices made toward the outgroup and 
toward the ingroup, constituted one dependent 
variable. 

A second response measure consisted of the dif- 
ference between before and after ratings made on 
22 traits, The traits were divided a priori into four 

* groups: (a) personality traits—general personality 
traits with at least mild evaluate connotation, for 
example, anxious, dependent, gullible, and obsequi- 
ous; (b) sociometric traits—traits indicative of social 
attractiveness, for example, likable, attractive, de- 
sirable as a friend, roommate, or date, friendly, 
charming, and pleasant; (c) ability traits—capable, 
orderly, efficient, effective, and intelligent; and (d) 
motiye traits—traits indicative of motives likely to 
be aroused by the situation, for example, hostile 
generous, cooperative, fair, competitive, greedy, kind, 
mean, and stubborn. For each type of trait a com- 
posite score was obtained for each group of four 
subjects. Each session, therefore, provided one in- 
group score and one outgroup score. 

In accordance with instructions, each subject used 
a 9-point scale to rate the other three persons. 
Each subject rated only the other three persons, 
never himself. 


Apparatus and Procedure 


During the experiment the four subjects took seats 
around a small table. Each subject had before him 
a small glass container to receive pennies, a plastic 
container holding a large number of blue and red 
poker chips, and a card with a number on it 
which identified him to the other players. The first 
thing the subjects did was to fill out the rating 
sheets, The experimenter then indicated for the first 
time that the two persons on either side of the table 
would work together as “teammates,” that they 
would “play a game with the other two persons,” 
and that they “would make choices in relation to the 
ingroup and the outgroup.” The experimenter 
avoided any direct reference to competition. The 
subjects indicated their choices by selecting a red or 
a blue chip, The experimenter drew a shade down 
from the ceiling to separate the two teammates while 
they made their outgroup choice. The two teammates 
chose a chip to represent their choice toward the 
other teams, and laid it on the table before them. 
Then, while still hidden from their opponents and in 
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ignorance of their opponent’s choices, they made 
their ingroup choice. The subjects kept their ingroup 
choices secret by holding the chip hidden in their 
hand until both had committed themselves; then the 
experimenter raised the shade and both outgroup 
and ingroup choices were revealed. The experimenter 
would then drop the pennies into the glass con- 
tainers in accordance with the choices. The experi- 
menter indicated to the subjects in the beginning 
that they would play the game 20 times. After 
the last game, the experimenter asked the subjects 
to fill out the rating sheets again, indicating that 
after having had some experience with each other 
they might be able to be more accurate. 

As the session proceeded, each chip was stacked 
on top of the previous choices. In all, six stacks 
were formed, indicating the ingroup choice of each 
person and the outgroup choice of each pair, As a 
result, every player could tell at a glance the nature 
of all the previous choices made by all participants. 


RESULTS 
Strategy Choices 


The subjects made competitive strategy 
choices toward the outgroup more than twice as 
often as toward the ingroup, an average of 11.0 
versus 4.5 out of a possible 20. This result 
indicates that most groups followed a competitive 
intergroup pattern; indeed, the highest number 
of cooperative outgroup choices made by any 
dyad was 13 out of the possible 20. This result 
agrees with that of several other studies which 
also showed a predominance of exploitation 
(Minas et al, 1960; Scodel & Minas, 1960; 
Scodel et al., 1959). A recent study by Rapo- 
port (1963) indicates, however, that cooperation 
eventually occurs about 65% of the time if the 
game is continued long enough, 50 trials or more. 
In terms of total payoff, the competitive strate- 
gies employed naturally proved mutually self- 
defeating. A mutual cooperative strategy would 
result in winnings of $.80, $.40 for each dyad, 
and $.20 for each person. In fact, the highest 
score attained by any pair of dyads was $.72, 
only one dyad received as much as $.40, and 
only three subjects received as much as $.21. The 
average gain per player was only $.14. It is 
apparent, then, that essentially no one achieved 
successful exploitation in the sense of gaining 
more than he would have gained had everyone 
been strictly cooperative. In terms of relative 
payoff, however, the competitive strategy proved 
more favorable. The dyads that won more than 
their opponents also made more competitive 
outgroup choices, t=2.77, p<.05. Likewise, 
those who won more than their partner made 
more competitive ingroup choices than their 
partner, £= 2.72, p <.05. 
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TABLE 1 
MEAN SHIFTS IN FAVORABILITY ON THE DIFFERENT Traits (N or Sessions = 10) 
Opponents Partners Difference 
Trait 

M Ua M ta M b 
P lit 43 1.2 5.5 2.4* 1.2 0.63 
Eodometric 9.2 1.6 7.8 2.0 1.4 0.47 
Ability 2.9 0.7 3.2 1.2 0.3 38 e 
Motive —15.9 —3.1** 9.6 2:3* 25.5 5.62 


t test indicating significance of before-after shift, - 
È Direct diference t test indicating significance of difference between ingroup shift and outgroup shift. 


*p <.05, 
*+ b < 02. 
Pp < 001. 


Shifts on Rating Scales 


The mean shifts on the personality, socio- 
metric, ability, and motive traits appear in 
Table 1, In the case of personality, sociometric, 
and ability traits, the changes always go in the 
positive direction for both the outgroup and the 
ingroup; and although the individual shifts usu- 
all do not reach significance, the pattern seems 
extremely consistent. In no case is the ingroup 
shift significantly different from the outgroup 
shift. This result replicates the study by Wilson 
and Miller (1961) in which a similar result 
appeared in conjunction with a similar group 
of traits. 

The motive traits show a markedly different 
pattern, going up significantly for the ingroup 
and down significantly for the outgroup. In this 
case the difference between ingroup and outgroup 
is, of course, highly significant. This differential 
response on the motive traits seems all the more 
striking because of the marked contrast to the 
other traits that showed no differential effect. 


Correlation between Ingroup and Outgroup 
Strategies and Ratings 


Several correlations served to test for a rela- 
tion between responses to the ingroup and the 
outgroup. Of these several but not all proved 
significant. The correlation between the ingroup 
and outgroup strategy choices, based on an N 
of 20 dyads, was a nonsignificant .28. Correla- 
tions between each person’s ingroup and outgroup 
shift on each of the four types of traits indi- 
cated correlations of .69, .66, .55, and .19 for 
the personality, ability, sociometric, and motive 
traits, respectively; an r of 30 being needed 
for significance at the .05 level. 


Discussion 


The differential rating of the ingroup and out- 
group on the motive traits constitutes probably 
the most important finding. Clearly this differ- 


ential rating cannot reflect real differences in 
behavior since the ingroup and outgroup objects 
are, in toto, the same people. Furthermore, the 
experimental setting allowed each subject to see 
very readily what the other three subjects were 
doing. The biased ratings take on even more 
interest when it is noted that they are limited 
to motive traits particularly relevant to the 
situation. It was the lack of intergroup coopera- 
tiveness, manifested just as much by each subject 
and his partner as by their opponents, that re- 
sulted in mutual frustration. After the session, 
however, the subjects rate their partners as 
showing less of such traits and rate their op- 
ponents as showing more. The ratings are seem- 
ingly biased so as to explain the status quo 
without discredit to the ingroup. One might 
argue that the instructions to proceed as two 
groups might have encouraged ingroup coopera- 
tion and outgroup competition and that the 
generally amiable atmosphere of the sessions 
might have encouraged the overall favorable 
ratings on some traits. It is difficult to see; 
however, in what manner the definition of the 
game situation could have, at the same time, 
encouraged the biased ratings on the motive 
traits. The occurrence of the bias, therefore, 
Suggests that a projective mechanism was acti- 
vated by the moral conflict inherent in the rules 
of the game. Clearly, such a mechanism poses 4 
basic threat to intergroup harmony, and out 
lack of an understanding of how to control or 
reverse it continues to pose a challenge to soci 
science research, 
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BLOCKING AND INHIBIT! 


Based on earlier resear 


_ Within the last 20 years a number of studies 
have appeared in the literature which have re- 
Ported on personality correlates of perceptual 
blocking. The measure of blocking that usually 
has been used is the discrepancy between the 
Tecognition threshold for a relatively neutral 
Stimulus and the recognition threshold for a 
stimulus with drive-related content. The most 
| Consistent finding from these studies has been 

i that the greater the discrepancy, the less apt is 
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A NOTE ON THE RELATIONSHIP BETWEEN PERCEPTUAL 
ON OF DRIVE EXPRESSION 


LLOYD H. SILVERMAN 
Northport Veterans Administration Hospital and New York University 
anp LESTER LUBORSKY 


University of Pennsylvania 


60 male college students were divided into 2 groups based on the difference be- 
tween their perceptual thresholds for a neutral stimulus (a triangle) and a drive- 
related stimulus (a female torso). One group consisted of 19 Ss who took a 
long time to recognize the torso relative to their reco} 


the other of 41 Ss who took a relatively short time 
ch findings it was hypothesized that on the Rorschach, 


gnition of the triangle and 
to recognize the torso. 


the 2nd group would give significantly more responses than the first reflecting: 
(a) heterosexual drive expression; (b) 0 
hypothesis was supported while the 2nd was not. 


ther kinds of drive expression. The Ist 


the subject to express the particular drive rep- 
resented by the experimental stimulus. The de- 
gree of drive expression usually has been as- 
sessed from psychological tests (Eriksen, 1950, 
1951; Kohn, 1960; Lazarus, Eriksen, & Fonda, 
1950), though in one instance it was also judged 
from overt behavior (Kohn, 1960). Additionally, 
in three of these studies it has been found that 
large discrepancy scores have also correlated sig- 
nificantly with relatively little drive expression 
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of other kinds, that is, of drives mot represented 
by the experimental stimulus (Eriksen, 1951; 
Kohn, 1960; Lazarus et al., 1950). 

Recently a study was carried out for a different 
purpose which provided another source of data 
bearing on the relationship between perceptual 
blocking and drive expression (Silverman & 
Silverman, 1964). In this experiment, a drive- 
related stimulus—a drawing of a female torso— 
was presented subliminally to 60 male college 
students and the effects of its registration were 
sought in subsequent Rorschach responses. A way 
in which this experiment differed from others in 
this area was that differential predictions were 
made and borne out as to how the various sub- 
jects would respond to the subliminal exposure. 
The subjects were divided into groups based on 
two threshold variables, one of which is relevant 
here. This was the ratio of recognition threshold 
for the experimental stimulus to recognition 
threshold for a relatively neutral stimulus, a 
triangle, On the basis of a cutoff score chosen 
a priori, the 60 subjects were divided into two 
groups: 19 “deniers,” that is, subjects who took 
a long time to recognize the torso relative to 
their recognition of the triangle, 41 “nondeniers,” 
those who took a relatively short time to recog- 
nize the torso. These two groups differed from 
each other in their Rorschach responses following 
the subliminal exposure of the torso, the deniers 
reacting with an increase in defensive operations 
while the nondeniers manifested increased hetero- 
sexual drive expression. The following comment 
was made about this finding. 


The Ss’ reactions during threshold determination for 
the female torso can be said to be consistent with 
their reactions in the Rorschach after the subliminal 
exposure of this stimulus, Thus, if the S’s psycho- 
logical makeup was such that a strong defensive need 
was called into play when the torso was shown for 
progressively longer exposures, defensive needs were 
apt to be prominent when associates to the stimulus 
were stirred in the experimental session, 


It would be in keeping with both the above 
formulation and the research findings discussed 
earlier to anticipate that the base rate of hetero- 
sexual drive expression would be greater for the 
nondeniers than for the deniers, Data bearing on 
drive expression uninfluenced by the experimental 
condition was available from the first of two 
“base-line” Rorschachs administered to all 60 
subjects under the same conditions (ie., before 
subliminal exposure of the torso), These proto- 
cols were scored blindly for drive content ac- 
cording to criteria outlined in Holt’s (1960) Pri- 
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mary Process Manual. Three scores were de- 


rived: (a) libidinal content that was judged a | 


priori to be related to the experimental stimulus. 
This included Holt’s subcategories of “sexual,” 
“exhibitionistic-voyeuristic,” and “miscellaneous,” 
the last of these because it involved references 
to the female reproductive tract (“womb,” “fal- 
lopian tubes,” etc.); (b) other libidinal content 
“oral,” “anal,” and “homosexual”); (c) aggres- 
sive content. 

For the first of these categories, the difference 
between groups was significant in the anticipated 
direction (¢= 1.75, p = .05). However, for the 
other two categories, the differences did not even 
approach significance (¢ << 1). We therefore con- 
clude that our data: support the formulation that 
perceptual blocking is related to inhibited expres- 
sion of the drive that is represented by the stimu- 


lus on which the subject blocks; do not support 


the contention that perceptual blocking is related 
to generalized drive inhibition. While the latter 
negative finding is at variance with some earlier 
research reports (Eriksen, 1951; Kohn, 1960; 
Lazarus et al., 1950), it is not inconsistent with 
clinical reports based on psychotherapeutic and 
psychodiagnostic contact with patients. Clinicians 
have often noted that certain drives are accept- 
able and others unacceptable to a particular in- 
dividual and have even alluded to one type of 
drive being called forth as a defense against the 
expression of a different drive. 


1 We are deeply indebted to Reeva Safrin who did 
the scoring for us, 
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AND EVALUATIVE ASPECT OF BELIEFS ABOUT 
THE ATTITUDE OBJECT: A 


A COMPARISON OF SUMMATION AND CONGRUITY THEORIES + 


LYNN R. ANDERSON 
Wayne State University 


AND 


MARTIN FISHBEIN 
University of Illinois 


5 groups of 20 Ss read stories containing differential amounts of positively 
evaluated information about an incidental character (stimulus object). Ss were 
then asked to evaluate the incidental character as well as all the adjectives used 
to describe her. Finally, they indicated how strongly they believed each of the 
adjectives was, indeed, a characteristic of the stimulus object. 3 different 
quantitative predictions of each $’s attitude toward the stimulus object were 
computed; 2 predictions were based on different extensions of congruity theory 
and the 3rd was based on summation theory. The results indicated that sum- 
mation theory yielded significantly better predictions of attitude than either 


extension of congruity theory. 


In a recent series of articles, Fishbein (1961, 
1963, 1965) has presented a theory of attitude 
organization and change based on the relation- 
ships between beliefs about an object and the 
attitude toward that object. Consistent with 
Osgood, Suci, and Tannenbaum (1957), attitude 
is defined as the evaluative dimension of a 
concept, where the term “concept” refers to any 
discriminable aspect of an individual’s world, 
verbalizable or not. In Osgood’s behavior theory 
terminology, an attitude can be described as 
a mediating evaluative response associated with 
any stimulus (Osgood et al., 1957). Operation- 
ally, an individual’s attitude toward any concept 
can be assessed by having the individual rate 
the concept on a series of bipolar evaluative 
scales (e.g., good-bad, clean-dirty) of the 
Semantic differential form. 

Similar to the definition of attitude, belief is 
defined as the probability dimension of a con- 
cept. Belief i a concept is seen as the proba- 
bility that the concept per se (e.g, ESP) does 
_ exist, while belief about a concept is defined as 
_ the probability that a specific relation exists 
between the concept and some other object, 
Concept, value, or goal (e.g., ESP is mystical, 
ESP is useful). Beliefs can be measured by 
\ having the subject rate the assertion (belief 
‘about) or the concept per se (belief in) on 


fi 1This study was supported in part by a grant 
tom the University of Illinois Research Board. We 
are indebted to S. Feldman, J. W. Julian, J. E 
McGrath, C. E. Osgood, and H. C. Triandis for 
eir critical readings of an earlier version of this 
ie In addition, we would like to express our 
‘ a appreciation to C. E; Osgood for suggesting 
e test of his new extension of congruity theory- 


a series of bipolar probabilistic scales (eg. 
probable-improbable, likely-unlikely) of the 
semantic differential form. This latter instrument 
has been called the B scale by Fishbein and 
Raven (1962), and considerable support for its 
reliability and validity has been presented. 

Essentially, Fishbein’s theory leads to the pre- 
diction that an individual’s attitude toward any 
object is a function of his beliefs about the 
object and the evaluative aspects of those be- 
liefs. The prediction can be expressed by the 
following formula: 


N 
Ao = > Bias 


iml 


where: 


Ao= the attitude toward object “o” 

B,=the strength of belief i about o (ie, the 
probability that o is related to some other object 
0) 

a, =the evaluative aspect of B, (ie, the evalua- 
tion of x) 

N = the number of beliefs. 


As can be seen from the formula, the attitude 
toward the object is predicted to be a part 
function of the total amount of affect associated 
with each of the beliefs about the object. 
Fishbein (1961) and his associates (Fishbein & 
Hunter, 1964; Triandis & Fishbein, 1963) have 
previously pointed out that this prediction is 
opposed to the predictions which would follow 
from most of the theories of attitude change 
based on a notion of “consistency.” In general, 
the consistency theories state that an individual’s 
attitude toward an object is essentially based 
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on a mean, or weighted mean, of the affect as- 
sociated with each of the beliefs about the 
object. For example, both the principle of con- 
gruity (Osgood et al., 1957; Osgood & Tannen- 
baum, 1955) and Heider’s (1958) balance theory 
would predict that if an individual held two 
beliefs about one object, one belief containing 
a high postive evaluative aspect and the other a 
low positive evaluative aspect, the individual’s 
attitude toward the object would essentially 
represent a convergence or mean of these two 
evaluative aspects.2 In contrast, Fishbein’s 
theory would predict that the attitude is a 
function of the algebraic sum of the evaluative 
aspects of the beliefs. To summarize then, a 
basic difference between Fishbein’s theory and 
the various consistency theories is that Fishbein 
views attitude organization and change as a 
process of cognitive summation, while con- 
sistency theories view attitude organization and 
change as a process of cognitive balance. 

Two recent studies provide support for the 
summation approach. Triandis and Fishbein 
(1963) compared predicted and obtained values 
of subjects’ attitudes toward complex stimuli. 
One set of predictions was based on an exten- 
sion of congruity theory, while another set of 
predictions was based on summation theory. The 
results clearly indicated the superiority of the 
summation theory predictions. Fishbein and 
Hunter (1964) provided four groups of 40 sub- 
jects with different amounts of positively evalu- 
ated information about a fictitious person. The 
information was presented in such a way that 
the total amount of affect associated with the 
information increased as a function of the 
amount of information, while the mean amount 
of affect associated with the information de- 
creased as a function of the amount of informa- 
tion presented. Consistent with summation 
theory, the amount of attitude change was a 
function of the total amount of affect associated 
with the information presented, In addition, in- 
direct evidence suggesting cognitive summation 
has also been obtained by Feldman (1962), Ker- 
rick (1959), Podell and Podell (1963), and 
Tanaka (1964). 

The present investigation is a further attempt 
to test Fishbein’s theory. While all of the above 
studies provide support for a summation theory, 
none are strict tests of Fishbein’s position, That 
is, in none of the above studies was the strength 
of belief (B,) utilized in the prediction of atti- 


5 2For a more complete discussion of the distinc- 
“tion between summation and balance theory see 
Fishbein (1961), Triandis and Fishbein (1963), and 
Fishbein and Hunter (1964). 
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tudes. In addition, with the exception o 
Triandis and Fishbein study, none of the 
studies has attempted to compare the 
quantitative predictions of two different 
With respect to this latter study, hoy 
Osgood (1963) has recently presented an 
sion of congruity theory that differs from 
earlier extension utilized by ‘Triandis 
Fishbein. This extension was in part pres 
in response to studies “which suggest an 
tive rather than a polarity-weighted functio 
As Osgood points out, however, “Whether 
ferences would be obtained in predictions 
these two conditions ... [the new as op 
to the “old” extension] ... remains to 
seen.” 

Thus, the purpose of the present pap 
threefold: to provide a further test of the 
pothesis that attitude change is a function w 
the total affect associated with each of a 
son’s beliefs about the attitude object; to 
vide an explicit comparison between Fishi 
summation theory and Osgood’s con 
theory; and to compare Osgood’s new ext 
of congruity theory with the extension previi 
utilized by Triandis and Fishbein. 


METHOD 
Subjects 


One hundred undergraduate subjects (50 male 
and 50 females) were randomly assigned to 
groups. The only restriction placed on the ra 
assignments was that there would be an 
number of males and females in each group. 


Procedure 


The experiment was administered under the 
of a Literary Preference test in which the subj 
were to evaluate each of four mimeographed pi 
graphs taken from a short story. The al 
object was an incidental character, Mrs. Wi 
who was mentioned in each of the four paragi 
In order to provide a pretest measure of 
the subjects, before reading the paragraphs, 
the name Mrs. Williams and other charact 
the story on the A Scale of Fishbein and R 
(1962). The A Scale consists of five empi 
determined, evaluative scales of the semantic 
ferential form. Since each polar pair of adj 
had a 7-point scale, a rated concept could 
a score ranging from —15 (negative evaluation) 
+15 (positive evaluation), The five scales 1 


y 3A copy of the paragraphs used has been 
ited with the American Documentation Instit 
Order Document No. 8463 from ADI Auil 
Publications Project, Photoduplication Service; 
brary of Congress, Washington, D. C. 20540. 
in advance $1.25 for microfilm or $1.25 for phi 
copies and make checks payable to: Chief, Ph 
duplication Service, Library of Congress. 
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were: harmful-beneficial, foolish-wise, dirty-clean, 
bad-good, and sick-healthy. 

* After the pretest all subjects were then instructed 
to read and to evaluate each of the four para- 
graphs by circling a letter grade beneath each 
paragraph. The paragraphs were identical in each of 
the five experimental groups except for the inclusion 
of the evaluative adjectives used to describe Mrs. 
Williams. 

Group I. This group represents a control in that 
no adjectives or evaluative characteristics were used 
to describe Mrs. Williams in any of the four 
paragraphs. 

Group II. For the subjects in this group, the 
adjective “honest” was inserted before the noun 
“Mrs, Williams” in the first paragraph. The three 
remaining paragraphs were identical to those of 


i Group I, 


Group II, The adjective “honest” again described 
Mrs, Williams in the first paragraph, and the adjec- 
tive “friendly” was attributed to her in the second 


“paragraph, The remaining two paragraphs were 


identical to the control paragraphs. 

Group IV. The adjective “honest” was inserted in 
the first paragraph, “friendly” was inserted in the 
second paragraph, and “helpful” was used to de- 
scribe Mrs, Williams in the third paragraph. The 
fourth paragraph was again identical to the control 
group. 

Groyp V. This group was similar to Group IV in 
having Mrs. Williams described as “honest,” 
friendly,’ and “helpful” in the first three para- 
gtaphs, respectively. In the fourth paragraph, how- 
ever, the adjective “helpful” was repeated. Thus, 
for the subjects in this group, Mrs, Williams was 
described as being helpful in both the third and 
fourth paragraphs of the short story. Group V was 
included in the experiment in order to assess the 
effects of a single repetition of a belief statement 
on attitude change. 

At the subjects had read and evaluated each 
g 1¢ four paragraphs, they were told that the 
menter was also interested in determining the 
as of impression which the paragraphs had cre- 
Th regarding each of the characters in the story. 
as mueiects were asked again to describe the fic- 
a x characters, including Mrs. Williams, on the 
fa cale. This measure provided the posttest atti- 
le score. 
fa at to determine if the subjects had ac- 
EN learned the experimental characteristics at- 
CM ed to Mrs. Williams, three assertive statements 
frie a Williams is honest,” “Mrs. Williams is 
Aa ly,” and “Mrs. Williams is helpful”) were 
a er by all subjects on Fishbein and Raven’s 

1962) B Scale. This scale is also of the semantic 
Tee form and contains five, empirically de- 
feet Probabilistic scales (impossible-possible, 
ble a nonexistent-existent, improbable-proba- 
a NS ely-likely). The scale provides a measure 
thus subject’s beliefs about Mrs. Williams, and 

eee the belief score (B:) utilized in Fish- 
aie Prediction formula, The subjects were finally 

to evaluate each of the descriptive adjectives 


439 


TABLE 1 


OBTAINED MEAN EVALUATION SCORES FOR THE 
EXPERIMENTAL GROUPS 


Experimental Number of adjec- 


group tives presented Obtained M 
I 0 .95 
0 1 315 
I 2 2.85 
IV 3 5.90 
Vv 4 


“Although four adjectives were used in describing Mrs. 
Williams in Group V, only three diferent adjectives were 
presented with one adjective being presented twice. 


(honest, friendly, and helpful) on the A Scale. 
These ratings provide a measure of the evaluative 
aspect of each belief (as) used in Fishbein’s pre- 
diction formula and a measure of the evaluation 
and polarity of each of the adjectives used in the 
congruity prediction formulas. 


RESULTS 
Obtained Evaluations of the Attitude Object 


The pretest measure indicated that the five 
experimental groups were comparable in initial 
evaluation of the attitude object (F = 414, dj 
= 4/90). Thus, analyses were conducted only 
on the data obtained on the posttest A Scale. 
The main hypothesis predicted that the subjects 
evaluation of the attitude object, Mrs. Williams, 
would be a function of the total amount of 
affect associated with each of the characteristics 
attributed to the object. Since only positively 
evaluated adjectives were presented to the ex- 
perimental groups, the mean evaluative score 
should increase with the number of adjectives 
presented if the hypothesis is to be supported. 
The mean posttest score for each of the five 
experimental groups is presented in Table 1. 
When tested by an analysis of variance, these 
means proved to be significantly different (F= 
6.31, df = 4/90, p<.01)* In addition, the 
slope of the regression of attitude scores on 
number of adjectives presented was tested and 
found to be significantly different from zero 
(F =11.98, df= 1/90, p < 001). Thus, in sup- 
port of a summation theory, attitude scores do 
increase significantly with the number of adjec- 


4 The significance test was made on the data ob- 
tained from all five experimental groups. Although 
four adjectives were presented in Group V, only 
three different adjectives were utilized. Hence, the 
data from this group may not represent a direct 
test of the hypothesis. However, when a similar 
analysis of variance test was made using only the 
data from the first four groups, the F ratio was 
significant beyond the .05 level. 
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tives presented, and therefore, with the total 
amount of affect associated with each belief. 


Differential Predictions of Attitude towards the 
Attitude Object 


Three predicted attitude scores were computed 
for each subject: based on Fishbein’s summation 
theory, based on Triandis and Fishbein’s exten- 
sion of congruity theory, and based on Osgood’s 
more recent extension of congruity theory. 

Summation theory predictions. According to 
the prediction formula presented by Fishbein, 
an individual’s attitude toward any object should 
be a function of the sum of the products of 
beliefs about the object (B,) and the evaluative 
aspect of these beliefs (a;). Each subject had 
indicated the strength of his beliefs that “Mrs. 
Williams is ‘honest,’ ‘friendly,’ and ‘helpful,’” 
and had evaluated each of the three descriptive 
adjectives. Thus, by direct application of the 

N 
formula (ie., the predicted attitude = >> B;a;) 
i=l 
a predicted attitude score was computed for 
each subject. 

Congruity theory predictions: Triandis and 
Fishbein’s extension, In a recent paper, Triandis 
and Fishbein (1963) presented a simple mathe- 
matical extension of congruity theory that per- 
mitted the consideration of more than a single 
adjective-noun combination. The formula is 
identical to the one presented by Osgood et al. 
(1957), but it allows for the case where more 
than one adjective is associated with the noun. 
The formula is given as follows: 


N 
CX lail (ai) + lanl (an) 
Predicted attitude = = 


N 
CE lail] +lanl 


i=l 


where: 


_ |ai|= the absolute evaluation of polarity of ad- 
jective “i” 
(a;) = the algebraic evaluation of adjective i 
|an|= the absolute evaluation or polarity of the noun 
(an) = the algebraic evaluation of the noun 
N = number of adjectives associated with the noun. 


By direct application of the formula, predic- 
tions were made for each subject based on his 
pretest evaluation of the noun “Mrs. Williams” 
and his posttest evaluations of each of the 
descriptive adjectives. It should be noted, how- 
pene, that only those adjectives which were 
“presented to the subject in the communication 
were utilized in computing his predicted atti- 


BRIEF ARTICLES 


tude score.» Although only three different ad- 
jectives were presented in Group V, the second f 
presentation of the adjective “helpful” was 
treated in the formula as a separate adjective, 

Congruity theory predictions: Osgood’s new 
extension. Osgood (1963) has recently stated 
that the formula proposed by Triandis and 
Fishbein may be inadequate in representing the | 
actual cognitive “balancing” that takes place a 
the individual is given more information about 
an attitude object. The objection is based 
mainly on the notion that a phrase such as “he 
is ‘a wise, unscrupulous man” should be viewed | 
as two separate assertations: the man is wise, 
and the man is unscrupulous. 

“According to this view, the meaning of man 
resulting from the man is wise should first be! 
determined and then ‘plugged’ into the second 
assertion, the man (as qualified) is unscrupulous 
[Osgood, 1963, p. 103].” Thus, the major ob: 
jection to the formula proposed by Triandis and 
Fishbein is that it does not take into account 
this step-by-step convergence of evaluation, but | 
rather, assumes that convergence occurs in one | 
cognitive operation, Although Osgood’s exten- 
sion was formulated specifically to handle the | 
prediction of complex stimuli (e.g., the, wist 
unscrupulous man; honest, friendly Mrs. Wil 
liams), it can be seen that his objection is pare) 
ticularly apropos in the present experiment snig 
the subjects actually received separate assertions 
about Mrs. Williams in the communication, The” 
general formula for the new extension is given | 
as follows: 


Í 
l 
Í 


Predicted attitude = 
redicted attitude Jai] F lanal 


where: 


|a:| and (a;) = the absolute and algebraic evaluation 


of the “i” adjective P, 
|an;_,| and (ap, 1) = the absolute and algebral 
evaluation of the noun modified by i — 1 adjectives: — 


For example, the predicted attitude for a stu? 
receiving two adjectives and a noun woul d 
a function of the evaluation of the second i 
jective and the evaluation of the noun after! 
had been modified by the first adjective. nets 
again, the subjects’ evaluations of the adjectives 
and their pretest evaluation of Mrs. William’ 
were used to compute the predicted scores. Here 
too, only those adjectives actually presented 
the subjects were included in the computations 
It should also be recalled that in Group 7 


5In the case where only one adjective was i 
sented the formula is identical to the one prese" 
by Osgood et al. (1957). 
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which received the adjective helpful twice, the 
, adjective was considered as two separate adjec- 
tives (ie, Mrs. Williams, modified by honest, 
friendly, and helpful, is again modified by help- 
ful for the final predicted score). 

The intercorrelations of the three predicted 
scores and the obtained scores may be seen in 
Table 2. All three of the predictors are signifi- 
cantly correlated with the obtained scores (b < 

001). However, consistent with previous find- 
ings, the predictions based on Fishbein’s sum- 
mation theory are significantly more accurate 

= than the predictions based on either form of the 
congruity theory (Z=2.74, p<.01 for both 
comparisons). These findings provide further 

y support for Fishbein’s theory and once again 
suggest that attitude organization and change is 

a function of cognitive summation rather than 
* „cognitive balance. Table 2 also indicates that 
Osgood’s new extension of congruity is highly 
correlated with the extension previously uti- 
lized by Triandis and Fishbein (r= .86) and 
perhaps more importantly, indicates that the 
new extension adds no predictive power over 
the original formulation. 

n Two basic objections could be raised concern- 

ing the intercorrelations presented in Table 2. 

First, with respect to the comparison between 

summation and congruity theory, it should be 
noted that in computing the summation theory 
predictions all of an individuals beliefs are 
taken into account in the computation, regard- 
less of whether or not they were actually in- 
cluded in the communication. The congruity 
formulas, on the other hand, use only the evalu- 
ation of those adjectives actually included in the 
message presented to the subject. Thus, the ob- 
tained differences between summation and con- 

TEN may be artifacts produced by the utiliza- 

ion of different amounts of information in the 
prediction formulas. Second, with respect to the 
comparison of the two extensions of congruity 


TABLE 2 


INTERCORRELATIONS OF THE VARIOUS PREDICTED 
SCORES AND THE ACTUAL POSTTEST 
EVALUATION OF THE ATTITUDE 
Oxyect (N = 100) 


i Congruity | Congruity | -Fishbei Obtained 
icti prediction BEA osttest 
prediction | Pittonded) | Prediction erination 
Y @) ) (4) 
1 f 
2 Di 86 46 38 
3 < AT 39 
i ca 


Note —, i 
&—All correlations are significant beyond the .001 level. 
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TABLE 3 


CORRELATION OF THE VARIOUS PREDICTED SCORES AND 
THE ACTUAL PosTTEST EVALUATION OF THE 
ATTITUDE OBJECT FOR Group IV 


(Ñ = 20) 
ay | Congmity | fi Obtained 
Congruit; Pati Fishbein 
prediction | prediction prediction | ,POsteess 
a) (2) @) @ 
1 — 69% -30 38 
2 A AG 34 
3 — 660 
4 = 
*p <.05. 
> <01. 


theory, it is possible that much of the correla- 
tion is spurious. Both extensions lead to identi- 
cal predictions when only one adjective is asso- 
ciated with the noun (Group II) or when no 
adjective is associated with the noun (Group I). 
Thus, 40 of the 100 scores predicted by both 
formulas are identical. 

In order to check against both of those ob- 
jections, the correlations reported in Table 2 
were computed again for only those subjects in 
Group IV (i.e, the group receiving all three 
adjectives in the communication). Thus, for 
these 20 subjects, all three predicted scores are 
obtained from the same amount of data, and 
hence maximal differences between the two 
extensions of congruity theory should be ob- 
tained. While subjects in Group V also received 
all three adjectives, it should be recalled that 
in the computation of the congruity predictions 
the repeated adjective was treated as a fourth 
adjective and thus the same problem of utilizing 
different amounts of information would be en- 
countered if Group V had been used in the 
present analysis. 

The intercorrelations of the three predicted 
scores and the obtained scores for the 20 sub- 
jects in Group IV may be seen in Table 3. The 
results of the analyses based on the smaller 
sample are virtually identical to the results 
presented in Table 2. Thus, the higher accu- 
racy of prediction obtained from the summation 
theory formulation is obviously not a statistical 
artifact of the number of adjectives included in 
the formula, Similarly, although the correlation 
between the two extensions of congruity theory 
is slightly lower (r = .69), it is still highly sig- 
nificant (p < -01). Finally, it can once again be 
noted that Osgood’s extension of congruity theory 
does not lead to an increase in predictability. 
To summarize briefly then, these results strongly” 
support a summation theory, and in particular, 
Fishbein’s theory of the relationships between 
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beliefs and attitudes. In addition they seriously 
question the utility of Osgood’s new extension 
of congruity theory, as well as the older extension 
suggested by Triandis and Fishbein. 

Effect of repetition on attitude change. Sub- 
jects in Group V were presented with the three 
different adjectives as well as a repetition of one 
of the adjectives (helpful). A comparison of 
this group’s attitude toward Mrs. Williams with 
the mean attitude of Group IV, which received 
only the three adjectives, constitutes a test for 
the effect of repetition on attitude change. Sub- 
jects in Group V were significantly more favor- 
able toward Mrs. Williams than were subjects in 
Group IV (7.10 versus 5.90, ¿= 7.31, df = 38, 
p<.001).6 Thus it appears that attitudes can 
be changed by even one repetition of the “mes- 
sage.” According to Fishbein’s theory, this 
change should result primarily from an increase 
in the strength of belief that “Mrs. Williams is 
helpful.” If presenting the adjective twice in 
Group V did serve to strengthen the subjects’ 
beliefs, then the statement “Mrs. Williams is 
helpful” should have been rated higher (i.e. 
more probable) in Group V than in Group IV. 
Consistent with this prediction, the obtained 
difference between the means (7.45 versus 3.75) 
was significant (ż = 1.79, df= 38, p < .05 using 
a directional test of significance). 

Indeed, one would expect that the main locus 
of effect of a communication would be to change 
and/or increase subjects’ beliefs about the atti- 
tude object. That is, subjects whose communi- 
cations mentioned “honest Mrs. Williams,” 
should believe that “Mrs. Williams is honest” 
more strongly than subjects who did not receive 
this information. Similarly, subjects receiving 
information about “friendly Mrs. Williams” 
Should believe that “Mrs. Williams is friendly” 
more strongly than subjects not receiving this 
information. While the overall effects suggest 
that this is true (the mean belief scores on those 
statements contained in the communications = 
6.26, while the mean belief scores on those 
statements not contained in the communications 
=5.19; t, corrected for correlated scores, = 
2.09, df= 298, p<.05), analyses of the indi- 
vidual belief statements suggest a more com- 
plex relationship between information contained 
in the communication and belief change. For 
example, although. subjects in Group II only 
had the adjective honest paired with Mrs, Wil- 
liams, this information produced a significant in- 
crease in their belief that “Mrs. Williams is 
friendly,” as well as the expected increase in 
the belief that “Mrs. Williams is honest.” Al- 


€ Unless otherwise noted, all ¢ tests are two-tailed. 
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though these findings suggest some generalization, 
the results are not indicative of any simple 
systematic effect. A fuller discussion and int 
pretation of these data must await addi 
research. 


SUMMARY AND CONCLUSIONS 


This paper presented a comparison betwee 
two competing theories of attitude organizatio 
and change. On the basis of Fishbein’s (1961 
summation theory, it was predicted that an 
vidual’s attitude toward any object would bef 
part function of the total amount of affect a 
sociated with each of the individual’s belief 
about the object. In contrast to this, Osgoodi 
congruity theory, and indeed, most theori 
based on a principle of consistency or balance 
predicts that attitude is a part function of 
mean amount of affect associated with an ind 
vidual’s beliefs. The results strongly supporté 
the summation theory hypothesis. i 

In addition, an explicit comparison of sumi 
mation and congruity theory was made. Predi¢ 
tions based on Fishbein’s summation theo 
correlated significantly higher with obtained ab 
titude scores than predictions based on congrui 
theory. This result held true for Osgood’s moi 
recent extension of congruity theory, as well a 
an earlier extension proposed by Triandis am 
Fishbein. Indeed, predictions derived from US 
good’s extension were no more highly correlati 
with the obtained attitudes than were the pi 
dictions based on the earlier extension. Furth r 
the two sets of congruity predictions were f 
correlated with each other. Thus, these resi 
add further support to the hypothesis that ati 
tude organization and change is a process 0 
cognitive summation and not a process of cog 
nitive balance, and point once again to ¥ 
necessity of reexamining the validity of M 
consistency principle in attitude research. 


i 
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THREE TECHNIQUES FOR OBTAINING SELF-PERCEPTIONS 
IN PREADOLESCENT BOYS 


CHARLES W. PERKINS np DONALD T. SHANNON 


University of Illinois 


Self-perception scores were obtained for 64 preadolescent boys on 3 different 
measures: a self-rating inventory, & projective technique, and a multiple-choice 
picture identification test. Scores were intercorrelated with each other, with 
teacher ratings of adjustment, and with measures of IQ and academic success. 
Self-perception scores were positively related among the 3 instruments and to 
teacher ratings of adjustment. Ideal-self scores on the inventory and projective 
measures were found related to IQ and academic success. 


Self-concept research has focused on two gen- 
eral problems: the development of instruments 
for assessing self-perceptions,! and the determi- 
Nation of relationships between self-perceptions 
and other behavioral variables. Crowne and 
Stephens (1961) have reviewed the literature on 
Measuring instruments, while McCandless (1961) 
sh Wylie (1961) have reviewed the literature 
ot Most self-perception instruments 
pls been of a verbal self-report type, for ex- 
its e, self-rating inventories, adjective check 
ri ues Q sorts, These are particularly subject 
fae ae of defensiveness of subjects; social 
Å T ility of items; and, especially with chil- 

' e cations of verbal comprehension. The 
ae continues for techniques which will mini- 
ae effects and at the same time ade- 

ly sample self-perceptions. 

*No distinction is made here between the terms 


self-perceptio k 
description n, reported self-perception, and self- 


The present study investigates the relationships 
among three techniques for obtaining self- 
perceptions in preadolescent boys: a self-rating 
inventory, a projective technique, and a multiple- 
choice pictorial jdentification test. It also studies 
the relationships among types of self-perception 
scores. In previous studies scores have often 
been based either on culturally defined norms for 
positive self-perception or on individual subject 
values, that is, the discrepancy between the sub- 
ject’s description of himself and his ideal. Only 
a few studies (e.g-, Worchel, 1957) have com- 
pared both types of scores derived from the same 
instrument. The present study also examines 
the relation between self-perception scores and 
teacher ratings of adjustment, IQ, and academic 
achievement. ‘Although it has been generally held 
that positive self-perception accompanies suc- 
cessful adjustment and achievement, not all 
studies support this assumption (e.g. Fiedler, 
Dodge, Jones, & Hutchins, 1958). Little is 


444 


known of the relationship between IQ and self- 
perception scores. 


METHOD 


Three instruments for obtaining both self- and 
ideal-self-perception were administered sequentially 
on 3 consecutive days. To insure that subjects would 
be unaware of the self-referent nature of the study 
when responding to the projective instrument, it was 
administered first in all classes. Presentations of other 
instruments were counterbalanced to control for 
order effects. Results from these instruments along 
with scores for adjustment, intelligence, and aca- 
demic performance and achievement were intercorre- 
lated. 


Subjects 


The subjects were 78 sixth-grade boys, with a 
mean age of 12 years, attending public schools in a 
small midwestern community. They were predomi- 
nantly from middle-class socioeconomic backgrounds, 
being sons of military and civilian military person- 
nel, and local community families. Fourteen subjects 
were eliminated from the final sample for the fol- 
lowing reasons: cerebral palsy, 1; lack of IQ scores, 
2; ethnic differences, 2; mishandling of tests, 4; 
absence from testing, 5. 


Measuring Instruments 


Self-rating inventory. A modified form of the 
Coopersmith Self-Esteem Inventory (Coopersmith, 
1959) was employed. Twenty statements covering 
content depicted in the pictorial identification test 
(see below) but not sampled in the Coopersmith 
inventory were added; response categories “Like 
Me” and “Unlike Me” were replaced by the cate- 
gories “Hardly Ever,” “Sometimes,” and “Most of 
the Time.” The final inventory (Inv) consisted of 
70 items concerned with the subject’s perceptions in 
four areas: peers, parents, school, and self. To limit 
the effects of response bias the items were randomly 
counterbalanced so that each extreme response cate- 
gory (ie, Hardly Ever and Most of the Time) 
represented highest positive self-perception for 50% 
of the statements. The subjects responded to Inv 
twice, first describing themselves, and then describing 
their ideal selves. The split-half reliability for self 
was .89; for ideal it was .92.2 Preliminary study 
had indicated that sixth graders could handle the 
three category self-description inventory (Inv) with 
a test-retest reliability of .91 over a 1-week interval: 
further test-retest reliability estimates for Iny were 
not obtained. 

Projective self-description. Projective descriptions 
of self and ideal-self were obtained by an adapta- 
tion of the Draw-A-Person Test (Machover 1950) 
Subjects were asked to draw a person “ail then 
describe how that person felt using the inventory 
described above. Preliminary study had shown that 


2 All split-half reliabilities have been corrected by 
the Spearman-Brown formula. 
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the instruction, Draw a Person, was too general, 
sometimes prompting the subject to draw some | 
known or famous person, for example, testing con 
ducted close to Lincoln’s birthday evoked drawings 
of Abraham Lincoln. Therefore, the instructions were 
modified to read: ' 


| 
Draw a person in the space below. Make the | 
person to be about sixth grade age. Boys should: 
make their person a boy. Girls should make theit 
person a girl. Draw the person any way you want 
to, but be sure it is a whole person. You will have 
five minutes to make your drawings. 


After making their drawings subjects were givel 
a form of Inv in which the first-person pronoun “MA 
had been replaced by the third-person pronoun “he? 
On this Draw-A-Person Inventory (DAPI) subjects 
described how the persons in their drawings felt 
about themselves. Then on second copies of DAPI 
they described how the persons “would really most 
like to be.” The split-half reliabilities were 92 for 
self and .84 for ideal. Retesting 22 subjects | 
after 4 weeks yielded test-retest reliabilities of # | 
for self and .74 for ideal. J 
Multiple-choice pictorial self-description. A multi- | 
ple-choice pictorial identification instrument de- 
veloped by Shannon and Shoemaker (1960) at the | 
University of Illinois was used to obtain pictorial | 
descriptions of self and ideal-self. This instrument is 
a set of 29 black and white drawings reproduced as 
$ X10 photographs (Pics). They depict three ple 
adolescent boys engaged in various activities relate 
to peers, family, school, and self. In each picture 4 
one boy represents the normative ideal, another tht 
normative “least” ideal, and a third is intermediate 
between these two. Pics were so ordered that pO 
sition of character type and theme varied random 
In each picture the principal characters were label 
A, B, and C from left to right. Pics were oe 
administered twice by projecting them onto a 52X 7 
inch screen by means of a 1000-watt Vu Lyte opaque 
projector in a semidark room. First administrati 
was at the rate of 25-seconds exposure per pe 
with 5 seconds between each; second administrat 
was at the rate of 15-seconds exposure per pic 
with 5 seconds between each. On the first presen 
tion subjects were asked, “Which one are yo 
and on the second presentation they were aS a 
“Which one would you most like to be?” Respon: f 
were made by circling appropriate letters 0n ant 
sheets. The split-half reliabilities were .81 for Ei 
and .79 for ideal. Test-retest reliabilities for 25 § 
jects after 4 weeks were .71 for self and .74 for 1 ie 
Adjustment, achievement, and intellectual Ae 
ures. Average teacher ratings of subjects on Fan 
of the Peterson Behavior Rating Schedule (Peteri, 
1960) were used as criterion of adjustment Ne 
This schedule consists of bipolar behavioral if 
teristics rated on a 5-point scale, 10 items lea 
on Factor I, General Adjustment, and 10 3 E 
loading on Factor II, Introversion-Extrave"" “te 
Each subject was rated by his homeroom be 
and a second teacher. Teachers were not told 
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their ratings would be used as a criterion of ad- 
, justment. IQ and academic achievement scores were 

obtained from the subjects’. school records. IQ 

scores were from group-administered intelligence tests 
given over the preceding 2.5 years. Average grades 
for the third school quarter were used as the measure 
of current academic performance. Median scores on 

the Metropolitan Achievement Test (Battery A) 

were used as the index of academic achievement. 

Derivation of self-perception scores. For DAPI and 
| Inv, least ideal-, intermediate ideal-, and most ideal- 
self-perceptions, as culturally defined, were repre- 
sented by numbers —1, 0, and +1, respectively (for 
Pics the respective numbers were 1, 2, and 3). Titus 
for the item “I’m pretty happy,” the response cate- 
gories Hardly Ever, Sometimes, and Most of the 
Time were scored —1, 0, and +1, respectively. 

For each instrument five scores were derived: 
total self from the subject’s self-description; total 
ideal from his ideal-self-description; and three dif- 
* ference scores, Ed, >|d|, and 2d». Self and ideal 
Scores reflected the congruence between the cultural 
norm definition of ideal-self-perception and the 
subject’s self- and ideal-self-descriptions, respec- 
tively, 2d was the sum of algebraic differences be- 
tween ideal and self-ratings (ie, ideal minus self), 
and thus indicated the agreement between the sub- 
Í ject’s _self-description and his stated ideal while 
retaining cultural norm definitions of positive and 
negative self-perceptions. Since Z|d| was the sum of 
absolute differences between self and ideal, both it 
and Zd? disregarded the cultural definition of posi- 
tive and negative self-perception with Xd? giving 
greater weight to the larger item-for-item disparities 
between self and ideal.3 

The rationale for comparing 2d with =|d| and Zd? 
arises from a potential weakness in the Zd score. =d 
is the sum of ideal minus self differences, and in gen- 
eral the numerical value of the subject’s ideal rat- 
ae for any given item will be as high or higher than 
i self-rating, thus yielding a positive difference. 
ah on some items, due either to unreliability 
RET i or to a subjective interpretation of the 
N ich deviates from the cultural norm, the 
iat ct may obtain a negative difference. In sum- 
Cae items these negative differences have 
N ect of reducing the overall ideal minus self 
ST epancy rather than adding to it. Since Zj|d| and 

eliminate negative numbers they overcome this 
Weakness, 


RESULTS AND DISCUSSION 


an relationships of self-perception scores ob- 
i ee by the three techniques are given in Table 
rel 7 general there was a significant positive 
tho ag among the three instruments. Al- 
ough the highest correlations were consistently 


3 
ie, course these scores are still, in large part, 
i ratin ced by the degree to which the subject’s self- 
more e DProaches the cultural ideal, since 07% the 
the cult individual ratings of ideal-self will reflect 
ural ideal. 


TABLE 1 


INTERCORRELATIONS OF SELF-PERCEPTION SCORES FROM 
THREE ASSESSMENT INSTRUMENTS (N = 64) 


Inv Pics 
Self 
DAPI .160* „554 
Iny AG 
Zd 
DAPI 660* DIAR: 
Inv Le 
2\d| 
DAPI 68%* 38* 
Inv .36* 
ze 
DAPI „657 423 
Inv AS 
Ideal 
DAPI Bip has AZ 
Inv Al 


* p <.01, two-tailed test. 
++ p < 001, two-tailed test. 


between Inv and DAPI, these must be viewed 
with the knowledge that both instruments were 
using essentially identical response categories. 
Although number of items and separate-day ad- 
ministration probably reduced memory effects, the 
tendency to use extreme response categories may 
have inflated these correlations. Difference in type 
of response categories used for Pics may have 
lowered the degree of its correlation with Inv 
and DAPI. Responses to both Inv and DAPI 
were on a frequency continuum, the subject 
indicating how often he would want to be like 
a given description. Responses to Pics were on 
a continuum from least to most ideal, the sub- 
ject indicating the figure he would “want to be 
most of the time.” Future research with Pics 
should determine whether correlations with Inv 
and DAPI are higher when responses to all in- 
struments are placed on the same frequency 
continuum. 

Correlations between average teacher ratings 
of adjustment and self-perception scores from 
the three instruments are given in Table 2. All 
self-perception scores from all instruments 
yielded positive relationships with adjustment 
ratings, with Inv consistently producing the 
highest correlations.* These results along with 
the intercorrelations in Table 1 suggest that the 
projective and pictorial identification techniques 
represented by DAPI and Pics, respectively, may 
be alternatives to the self-rating inventory ap- 
proach for obtaining self-perceptions. However, 

4Inspection of scatterplots for average ratings of 


adjustment with uself” scores from each of the three 
self-description instruments indicated linear relation- 


ships. 
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in their present form, neither alternative demon- 
strated superiority over the self-rating inventory. 

A major weakness of the projective technique 
was its unreliability. Test-retest reliability for 
DAPI self over 4 weeks was .43. The task ap- 
pears to have been so ambiguous that subjects 
may have projected not only their self-percep- 
tions, but also their ideal-self-perceptions, or 
even their perceptions of other persons. Such a 
weakness may not be easy to eliminate from a 
projective technique without also destroying its 
projective quality. 

Results in Table 2 indicate that the self and 
3|d| scores generally yielded the higher correla- 
tions with adjustment and were of similar magni- 
tude within each instrument. Both of these scores 
have rather clear-cut interpretations, the self 
score reflecting congruence of self-perception with 
a constant cultural norm of ideal-self-perception, 
and 3|d| signifying the congruence of self-percep- 
tion with subjective ideal-self-perception. Since 
the Sd score introduces the possibility of nega- 
tive ideal minus self discrepancies, which are 
psychologically meaningless, it is not surprising 
that this score was generally a lower correlate 
of Adj. Results of this study therefore partially 
support the observation by Wylie (1961) 


TABLE 2 


CORRELATIONS OF IDEAL AND SELF-PERCEPTION SCORES 
WITH AVERAGE RATINGS OF ADJUSTMENT, IQ, 
ACADEMIC PERFORMANCE, AND ACADEMIC 
ACHIEVEMENT (N = 64) 


A Academi demi 
dius 19 TOPS ean 
ance ment 
Ideal 
DAPI 27 4g .37* AR 
Thy 9 43 .33* AT 
Pics —.06 | —.02 —,21 mlt 
Self 
DAPI .34* 15 AS 15 
Inv 34 26 27 23 
Pics -16 07 —.06 —.10 
DAPI 10 21 
aa 05 22 
ics = 
zal 09 .02 
DAPI —.14 —AS 
ay —.34* —:25 
ze 02 05 
DAPI —.09 
‘ =A0: 
Inv —.31 —.25 
Pics 02 04 
Adjustment 67% 628 


Note-—Negative correlations obtained for diff 
also indicate positive relationships between positive self-per. 
ception and adjustment. E see Der: 
*p<.01, two-tailed test, 
5 < .001, two-tailed test. 
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that correlations between theoretically relevant vari- 
ables and Self scores . . . are at least as high and f, 
usually higher than those obtained from two-part 
indices from the same instrument . . . [p. 106]; 


they also support the assertion of Worchel 
(1957) that 


it is the amount of discrepancy [between self and 
ideal] and not the direction that is important in the 
prediction of maladjustment [p. 7]. 


Giving greater weight to the larger ideal minus 
self discrepancies by means of Xd? lowered the 
correlations with Adj on all instruments. 

IQ and academic achievement variables were 
not significantly related to the self-perception 
measures, However, they were positively related 
to ideal and Adj as shown in Table 2. It is 
possible that more intelligent subjects have a | 
more accurate understanding of the culturally 
defined ideal, or at least dare to state a higher 
ideal because of their past successes. Pics devi- 
ated from the trend set by Inv and DAPI, yield- 
ing nonsignificant negative correlations between 
ideal and measures of IQ and academic success: 
This again may be a function of the different 
response categories used for Pics, or it may 
reflect some unexplained difference between seli- 
perceptions based on pictorial stimuli and those 
based on verbal stimuli covering similar content 
areas. 

Assuming that brighter, more successful boys 
may indeed find it easier to be well adjusted, # 
positive correlation between academic success 
and adjustment does not seem unusual. Howevely 
the fact that teacher ratings of adjustment cone 
lated more highly with grades than with either | 
IQ or achievement scores suggests some hal 
effect. It may be that those who conform more 
to teacher expectations for academic performance | 
also conform more to teacher expectations ong 
classroom behavior, or at least are perceived 45 
conforming. One sobering thought for self 
concept research is that an objective measure 
ment of intelligence or academic achievement 
may be a better predictor of adjustment © 
children than any self-perception instrument. 

Results here suggest that techniques other that 
verbal self-report approaches may be suitable {og 
assessing self-perceptions. The pictorial identify 
cation technique with children has 4 novell) 
attention 
There are hints that self-perception responses 4 
pictorial stimuli may be different from tie 
made to verbal material covering similar conten i 
Some clarification of this point may be achi 
by studies which equate response categories 
both techniques. 
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ACHIEVEMENT, NURTURANCE, AND COMPETITION IN 
MALE AND FEMALE TRIADS* 


IMAT R. AMIDJAJA AND 


University of Hawaii 


W. EDGAR VINACKE 
State University of New York at Buffalo 


A previous experiment found that motivational differences influence bargaining 
behavior in an intragroup competitive game situation, when triads are composed 
of males. The present experiment replicates this study with triads of both 


sexes, each consisting of an S high in a 


chievement, 1 high in nurturance, and 1 


intermediate in these respects. The Ist experiment used the Edwards Personal 
Preference Schedule to assess motivation; this one employed the French Test 
of Insight. Results of the previous experiment are supported for males, but 


not females. Ss high in achievement play an active, initiating role, those high 


in nurturance, a less active, recipient role. 


In a game situation designed to investigate 
bargaining behavior and coalition formation in 
three-person groups, important effects of motiva- 
tional characteristics have been reported (Chaney 
& Vinacke, 1960). In this game, the relative 
Power of the three participants was varied by 
the drawing of weights which determined how far 
each person could move around a board to arrive 
at “Home.” In addition, one player was high in 
achievement, but low in nurturance, a second was 
high in nurturance, but low in achievement, and 
_ the third was intermediate in both respects. Gen- 
erally speaking, it has been found that the weaker 
4 members tend to ally against the stronger. When 
pean characteristics were varied, this 
endency still predominated, but motivational 
Me was prepared by the junior author, 
S ae on research conducted by the senior 
(Amid; to fulfill requirements for the MA degree 
0. Jaja, 1963). The generous assistance of William 
- Oakes is gratefully acknowledged. 


differences were associated with a number of 
variations in the behavior of the subjects, perhaps 
representing the kind of role that a person adopts 
in competitive situations. In particular, the sub- 
jects high in achievement tended to be most 
active and to be the principal initiators of offers 
to form coalitions; by contrast, subjects high in 
nurturance tended to be the recipients of offers 
to ally. 

A major finding of the research conducted with 
triads, under conditions of intragroup competi- 
tion, has been a systematic difference in the 
strategies employed by the two sexes. Males 
display a pattern of ruthless, cutthroat competi- 
tion, with a strong orientation towards winning. 
Females, on the other hand, more often manifest 
concern for fairness, equalization of outcome, and 
a strong orientation towards the social-interac- 
tion aspects of the game. These strategies have 
been named “exploitative” and “accommodative.” 
These and other results, together with details of 
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the game, may be found in Vinacke and Arkoff 
(1957) and Vinacke (1959, 1962). 

In the Chaney-Vinacke study, only male triads 
were investigated. In view of the sex differences 
which have consistently been found in other ex- 
periments, it would be interesting to see whether 
achievement and nurturance enter into female 
competition, as well. Furthermore, Chaney and 
Vinacke used the Edwards Personal Preference 
Schedule to assess motivational characteristics, 
and it appeared desirable to inquire whether simi- 
lar results would appear with a different instru- 
ment. For this reason, the French Test of Insight 
was chosen for the present investigation. 

The problem, then, of this experiment was to 
replicate the Chaney-Vinacke study, to compare 
female with male triads, with motivation meas- 
ured by a different test. 


PROCEDURE 
Subjects 


Thirty male and 30 female subjects were recruited 
from psychology courses, in which the French Test 
of Insight had been administered. This test is a semi- 
projective instrument intended to measure motiva- 
tional tendencies in accordance with Murray’s form- 
ulations (French, 1958). It consists of 10 statements, 
like “Bill always lets the other fellow win.” The 
task is to explain why each individual in the state- 
ments behaves as he does, In scoring, judges were 
given definitions of achievement and nurturance, and 
asked to rate each story in terms of the degree to 
which it reflected these tendencies. A total score 
consisted in the number of stories in each category. 
Each test was scored separately by three persons, 
and only those subjects upon whom all three judges 
agreed were chosen, Triads were composed by select- 
ing. three persons of the same sex, one high in 
achievement, one high in nurturance, and one inter- 
mediate in these respects (control subject). 


TABLE 1 


TOTAL OCCURRENCE OF WEAK COALITIONS IN MALE 
AND FEMALE TRIADS WHEN MOTIVATIONAL CHAR- 
ACTERISTICS OF PLAYERS ARE VARIED 
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Game Situation 


The experimental conditions were similar to those | 
reported in previous studies. A board like that used 
for games like Parcheesi contained a simple pathway 
with numbered spaces running from “Start” to 


“Home.” The object of the game was to arrive | 


Home first. Each player moved ahead the number of 
spaces corresponding to the value of a single die, 
cast by the experimenter, multiplied by his weight, 
For example, if the die came up 4, a player who 
held the weight of 2 would move 8 spaces, All play- 
ers moved after each throw of the die. The weights 
were inscribed on the colored counters used for moy- 
ing. Players took turns drawing counters from the 
set selected by the experimenter to constitute the 
requisite power pattern. The number on the counter 
drawn by a player was his “weight” or strength in 
that pattern, 

At any time, any player could propose an alliance 
(or coalition?) with either or both of the other 
players, Rejections, counteroffers, or offers to other 
players could take place, until some decision wis 
reached, that is, to establish a coalition of some kind 
or to play without coalition. In these negotiations, 
the terms involved the division of an arbitrary prizt 
of 100 points. (Thus, a final alliance was based ona 
“deal.”) The agreements reached upon coalition rep- 
resented the number of points earned if the allies 
won (as they did in all patterns except for coalitions 
between weak players in Type IV). After a coalition 
was formed, the partners “pooled” their resources by 
combining their counters (as “Kings” in checkers) 
and moving ahead the total of their acquired spaces: 
For example, if B and C in Type II were on Space 
10, they would, as a coalition, move to Space 20. 
Once settled, a coalition could not be broken fot 
that game. Players soon learned that play was’ not 
actually necessary if a decision concerning coalition 
was made, 

A “game-by-game” procedure was used, whereby 
each game was treated as a separate event, that is, 
points were not accumulated over the series of games 


Power Patterns 


For the present experiment, four of the power be 
lationships described by Caplow (1956, 1959) wet 
chosen. The following table summarizes these condi- 
tions: 


Male triads Female triads 
pe taie Type Power patterns among players A B c 
ACs | ana | cn | ac | an | cn a A=B=C 1 j j 
A<(B +C); B=C 3 

Sic nore T) 4) 7 Mt Ac@iGiarssc 4 3 i 
51 or more 3 3 7 3 eee BoC f 
not i A 0 Each triad played three series of the above four 

or less 3 types, with i i es in 

y a ie e ER A a ypes, with order randomized, making 12 gamı 


Note,—Here, and in subsequent tables, A refers to 
high in achievement, C refers to control subjects, and Noes 
to subjects high in nurturance. oa e ee 


SR AC versus CN and AN versus CN, p <.05 by Fisher exact 


RESULTS 
Although a considerable variety of data may 


be obtained in this game situation, we shall P% 


2 These terms are used synonymously here. 


(i a a 
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TABLE 2 


INITIATION AND RECEPTION OF OFFERS TO ALLY, BY Motive CONDITIONS, 
ror Games or Types II anp IM 


Male triads Female triads 
age of 3 7 
pees a c N Aie o ilan N 
Ie Ra I R I R I | R I | R I R 
31 or more> 8 3 6 6 4 8 7 5 3 5 4 7 
30 or less 2 7 4 4 6 2 3 5 7 5 6 3 
N 10 10 10 10 10 10 10 10 10 10 10 10 


For A, male, I versus R, b = “05; male I, A versus Ñ, p <.10; male R, A versus N, p = 05. 


a I refers to initiations, R refers to receptions. 


| 

l 

l 

Note —All comparisons tested by Fisher exact test. 

T Dion points are shown uniformly for all columns, 
P 


sent, here, only those that bear directly on the 

central problem of the experiment. 

_ Chaney and Vinacke (1960) reported that coali- 
“tions between the control and nurturance subjects 
were most frequent, especially in Type I and 
when partners are weak (B and C in Types II 
and III). We shall confine ourselves to these 
data, and they are given in Table 1. Percentage 
of opportunity means the occurrence of coalitions 
in relation to the times when a given alliance 
couldebe established (i.e., when the two players 
were both weak). It can be seen that both the 
pairs AC and AN significantly less often arrive 
at coalitions than the CN pair, in the case of the 
male triads, but not in the case of the females. 
Thus, as in the previous experiment, there is a 
tendency for the control and nurturance subjects 
to prefer each other as partners. 

The pattern of bargaining is revealed quite 
clearly when we examine the interactions of the 


but adjustments were made, as necessary, for the median test. 


person to whom such offers are directed (“recep- 
tions”). The subjects high in achievement are 
seen to be the most active in initiating offers, 
and the subjects high in nurturance to be most 
frequently the recipients of offers. Although the 
same tendencies appear in the triads of both sexes, 
none of the differences attains significance in the 
case of the females. These findings agree with 
Chaney and Vinacke (1960). Thus, despite the 
fact that the subjects high in achievement play a 
more aggressive role in bargaining, the actual 
coalitions established favor the other two players, 
as shown in Table 1. A more specific analysis of 
bargaining behavior is presented in Table 3, in 
which the six kinds of offers are given. The let- 
ters indicate who made an offer to whom (e.g., AC 
means that the subject high in achievement made 
an offer to the control subject). It is apparent 
that there is a marked tendency for interactions 
between the male subjects high in achievement 
and high in nurturance to take a form in which 


three players. Table 2 summarizes the incidence L 2 a fc } 
of proposals to ally (“initiations”) and of the the subject high in achievement initiates offers to 
TABLE 3 
INITIATIONS AND RECEPTIONS BY PAIRS OF Motive CONDITIONS 
Pi 22 + 432)> a 
ee ity Pe Type I (1) Type IE g I (G2 t ) Total 
which at least one 
offer occurred® 
wc lanlon| ea |wa|Ne| ac| an] enlica| wal. Ne] Ac] AN | CN co [RIS 
ies qi 4| 7| 4) 4] 2 4 
| or more 6 5| a2] 2) 4) 6] 9 6| 4) 4 
49 or less 6] sps 21 21 6) $| of 41 51 3] 8) abl ablio| 19 | 10 | 10 
N 10 | 10 | 10 | 10 | 10 } 10 9| 9|10| 9 ọ | 10 | 10 | 10 | 10 10 | 10 | 10 
Female triads 
50 or more 6| 6| 4| 4 6 
E 2) 3) 21 $140] 10 Glee (> Soh STIS Pa 
or 9) or les 9|10| 10) 3| 4} 5 
4 y 3 10 10 10 10 | 10 | 10 | 9 | 10 ọ| 9|10| 9 10 |10 | 10 | 10 | 10 10 


versus NA, p = .005, by Fisher exact test. 
° AN versus NA, p <.05, by Fisher exact test. 


à Division-points are shown uniformly, but adjustments were made, 


as necessary for the median test. 
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the subject high in nurturance. However, no such 
distinction occurs in the female triads. Again, 
results for the male triads agree with Chaney and 
Vinacke. y 

The pattern just described does not achieve 
significance for the other possible power relation- 
ships, that is, when one of the two players holds 
the strongest weight, although it is in the same 
direction for the male triads, and we shall not 
present the data. In any case, offers involving the 
two weaker players are considerably more fre- 
quent, especially for the female triads, in which 
the strong player is seldom involved. 

In general, as in previous experiments, very 
few coalitions were established by the players 
of either sex in the “All-Powerful” pattern 
(Type IV). Chaney and Vinacke (1960) found 
that the subjects high in achievement were less 
likely to permit alliances when they held the all- 
powerful weight (i.e. could win without coali- 
tion). In this experiment the same tendency ap- 
peared for the males, but not the females; the 
latter, as in other experiments, more often es- 
tablish alliances in the All-Powerful type, when 
none is necessary. In view of the low incidence 
of bargaining in this type, the data are not pre- 
sented. 


Discussion 


The results of this experiment point to an 
association between strategy in competitive situ- 
ations and motivational characteristics. In most 
of the experiments in this series, we have focused 
on the general difference between the sexes which 
we have described as “exploitative” (male) and 
“accommodative” (female). These styles of play 
are associated with a variety of striking phe- 
nomena (Vinacke, 1962), including the tendencies 
of female triads often to arrive at triple alliances 
and often to fail to form any coalition at all, to 
establish two-person alliances predominantly on 
a 50/50 basis, and to “discuss” the problem of 
who should win instead of bargaining, These same 
differences occur, also, in this experiment, but are 
not reported in the interests of brevity (see 
Amidjaja, 1963). 

In a sense, the different roles played by sub- 
jects high in achievement and high in nurturance 
resemble the differences in strategy between the 
two sexes. Subjects high in achievement are more 
active in seeking coalitions, which are necessary 
to win (in Types I, II, and III), but nevertheless 
less often succeed in arriving at an alliance. In 
the previous study, this was interpreted as a re- 
action by the control and nurturance subjects 
against his aggressive tactics, perhaps bringing 
about a kind of social solidarity between them. 
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When the two sexes were combined in triads 
(Bond & Vinacke, 1961), a similar result char- 
acterized the two females when they were in the 
majority. In this case, too, the aggressive ex- 
ploitative strategy of the male minority member 
appeared to promote alliances between the twé 
(presumably more accommodative) females, with 
a similar consequence that the male had consid- 
erable difficulty in earning points. 

There is some as yet not fully clarified dimen- 
sion of strategies in the two sexes, because the 
effect of motivational differences among the 
players is much less marked in the female than 
in the male triads. That is, the two sex roles are 
apparently not fully described by distinguishing 
achievement from nurturance. In the female 
groups, even the subjects high in achievement act 
similarly to other females. It may be, therefore, 
that it would be more fruitful to develop meas- 
ures more directly indicative of exploitative and 
accommodative tendencies and to relate these to 
behavioral measures. It may well be that it is 
such characteristics that differentiate the two 
sexes, rather than motivational ones. Thus, we 
might expect to find a relatively higher degree of 
accommodative attitude in females, regardless of 
motivational characteristics. Research in, this 
direction is now underway. 

Nor can we ignore the general features of the 
interaction situation. When groups of the same 
sex interact there may be generated an atmos- 
phere to which the individual members adapt, te 


OOOO EEE R e a 


gardless of personality variables. Thus, even fe- 


males high in achievement may act in a compara- 
tively accommodative way when interacting with 
other females. Within this general pattern, moti- 
vational characteristics may influence the roles 
that members of the group play. In this exper 
ment, such roles are clearly defined for the males, 
but not for the females. 

As a final comment, it can be said that similar 
results have been found for the French Test of 
Insight, as a measure of motivation, as previously 
for the Edwards Personal Preference Schedule. 
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10 groups solved 15 problems in each of 8 conditions: 4- and 5-man groups 
(4-m, 5-m) in “Wheel” (W) and “Circle” (C) structures under 2 levels of 
task difficulty, H = 1.6 and H= 2.4 (H is information entropy). Solution-time 
data showed: (a) no difference between 4- and 5-m in W on H = 16, H=24, 
and in C on H=24, while in C on H = 1.6 4-m was faster than 5-m; (b) W 
was faster than C in 4-m on H = 2.4 and 5-m on H = 1.6 and H = 2.4, while 
4-m on H=1.6 no differences between W and C were obtained; (c) H=1.6 
was faster than H=2.4 in 4-m in both W and C and in 5-m in W, while in 
5-m C no differences between H= 1.6 and H=24 were obtained. Error data 


showed W making fewer errors than C in both 4- and 5-m on H=1.6. 


Literature reviews of group-size effects on per- 
formance (Kelley & Thibaut, 1954; Lorge, Fox, 
Davitz, & Brenner, 1958; Thomas & Fink, 1963) 
compel the conclusion that the relationship is 
a simple nor direct. Thomas and Fink con- 
clude, 


it appears that both quality of performance and 
group productivity were positively correlated with 
group size under some conditions, and under no con- 
ditions were smaller groups superior. In contrast, 
Measures of speed showed no difference or else fa- 
vored the smaller groups [p. 373]. 


The major difficulties with research on group 


. Size, in fact, social psychology in general, may 


1The research reported in this paper was spon- 
oe by the Aerospace Medical Research Labora- 
ories, Aerospace Medical Division, Air Force Sys- 


4 tems Command, Wright-Patterson Air Force Base, 


Ohio, in part under Contract No. AF 33(657)-10456 

with Miami University. This paper is identified as 

ARL Technical Report No. AMRL-TR-65-140. 

poet reproduction is authorized to satisfy needs 
the United States Government. 


be subsumed under two headings: ambiguity of 
task properties and simplicity of experimental de- 
sign. Consider tasks: invariably tasks are used 
whose properties, for example, difficulty, are left 
unspecified. This reflects a condition that is gen- 
eral in social psychology; namely, failure to 
quantify stimulus inputs. In the context of group- 
size studies this means that without such a de- 
velopment it is impossible with some tasks to 
study different size groups without substantial 
modification of the task, thereby giving different 
size groups different tasks (e.g, Leavitt’s, 1951, 
task). On the other hand, tasks exist (Shaw, 
1954) which may be used on different size groups 
without modification, but if this is done, indi- 
vidual “work loads” in the larger groups are 
necessarily less (by some unspecified amount) 
than the work loads in the smaller groups. With- 
out question there are task properties other than 
“difficulty” and “work load” that require explica- 
tion and quantification. However, the preceding 
indicates the difficulty of obtaining results with 
existing tasks that are not confounded. 
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Turning to experimental design, Thomas and 
Fink point out that 


many of the studies proceeded on the assumption 
that size should somehow have an immediate effect 
upon some aspect of behavior, and, as a consequence, 
relevant intervening variables were almost never 
measured or varied experimentally [p. 379]. 


Tt is this same concern with single isolated rela- 
tionships between two variables that Coleman 
(1960, pp. 7-14) advances as one reason for the 
inhibited development of mathematical models 
for the behavior of small groups. 

Without notions of expected outcomes we shall 
present the results of a study that has, for the 
most part, avoided the preceding difficulties. It 
examines the performance of two different size 
groups in two communication structures under 
two levels of task difficulty, the latter being 
mathematically defined. 


METHOD 
Subjects 
Subjects were 360 Miami University men students 


who were paid for their participation, Subjects were 
randomly assigned to one of eight experimental con- 


TABLE 1 


MEAN SOLUTION TIMES AND ERRORS PER PROBLEM 
AND VARIANCES FOR PROBLEMS 6 THROUGH 15 


Group size Structure Mai 
aay eis | Mij + 
Me hing: fot 
Four-man 
1.6 
M 1.91 2.14 4 2.46 -67 
i Y .25 2i 5 2.64 -70 
im | 256 | 325 | 16 | 220 | 42 
ot 23 1.06 24 2.91 71 
Five-man 
1.6 
M 1.81 2.95 W 2.21 38 
o AL 35 (0 2 18 
24 
M 2.57 | 3.25 
o 1 -16 
Four-man 
1.6 
Errors AL 87 4 -16 50 
o 07 | 72.) 5 5 
yk 65 40 
Errors 81 94 1.6 66 
à so | os | 24 | ya | S0 
Five-man 
1.6 
Errors Kyi 1.01 W 53 33 
a 10} 49 | ¢ 3 : 
Wy 88 52 
Errors 54 69 
o? 49 29 
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TABLE 2 
GROUP Size X Task: STRUCTURE CONSTANT 
Source of MS F 
Solution time f 
Wheel 
Size (G) 1 211.60 -= 
Task (T) 1 49,843.60 | 18.20 
Bà GXT 1 =| 249.99 — 
ğ Error 36 | 2,745.76 — 
k, Circle 
gh G 1 16,933.22 | 2.83 
Ra, T 1 50,055.62 | 8.37% 
GXT 1 16,443.02 | 2.75 
Error 36 5,980.22 — 
Errors 
Wheel 
G 1 24.02 = 
vs 1 81.22 | 2.53 
GXT 1 13.22 = 
Error 36 | 32.01 
Circle | 
G 1 3.02 = 
T es. | 15.62 se 
GXT 1 38.02 < 
Error 36 54.79 
*p <0 
"p $ iooi, 


ditions (two group sizes, four- and five-man; two 
structures, wheel (W) and circle (C); two levels of 
task difficulty, H = 1.6, H = 2.4). Ten groups were 
run under each condition. 


Apparatus and Task 


The apparatus used was essentially identical to 
that described by Leavitt (1951). The only differ- 
ence was that our apparatus had an accommodation 
for the experimenter, as well as five subjects, per 
mitting each subject to pass his problem solution 
directly to the experimenter. 

The task used in the experiment is described else- 
where (Morrissette, Pearson, & Switzer, 1965); 
therefore, it will be discussed only briefly. The infor- 
mation theory measure H (entropy) is used 10 
describe task difficulty. The information source cof 
sisted of the set of symbols +, O, A, O, ne) 
with each symbol having an associated probability 
of occurrence. The number of events in a problem 
was arbitrarily set at 40 (the symbols were stampée 
on 40 cardboard squares), with the frequency of oc- 
currence of each symbol proportional to its assoc 
ated probability. Two sets of 15 problems, Wit 
levels of task difficulty H=1.6 (“easy”) and H= 
24 (“difficult”) were used. The amount of informa 
tion (+) assigned to each group member, within à 
given level of task difficulty and group size, W% 
equal. The task of the group was to determine, aol 
each problem, which symbol occurred most fre 
quently and its frequency of occurrence. A problem 
was considered solved when all members of a gr0uP 
had transmitted the correct answer to the expe 
menter. 


,, it along [italics added],” 
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Procedure 


The procedure used followed Leavitt (1951) but 
for four exceptions: the communication structure 
was described to the subjects, with each subject hav- 
ing in his booth a diagram of his group’s structure 
*and his position in it identified; Leavitt considered 
a trial complete when each subject had signaled an 
answer (right or wrong) to the experimenter. In the 
present study, in an effort to obtain a correct an- 
swer from each subject, when the experimenter re- 
ceived an incorrect solution from a subject, the ex- 
perimenter indicated this to the subject by causing 
a small light to flash in the subject’s booth; where 
Leavitt had each subject work through praetice 
problems independently, we had the group work 
through two practice problems, under a 5-minute 
time limit, with the experimenter answering ques- 
tions and clarifying procedural points; where 
Leavitt’s instructions to the subjects read, “When 
you think you have the right answer, you may pass 
this was changed to read, 
* “will pass it along.” The changes were introduced in 
an effort to reduce learning effects and variability. 

The group’s solution time score on each problem 
was the average of the time required by the subjects 
in the group to obtain and transmit the correct an- 
swer to the experimenter, with the maximum score 
on each problem being 10 minutes, the limit set on 
each problem. A group’s error score was the total 
number of erroneous answers transmitted to the ex- 
perimenter. 


RESULTS 


The solution time and error data will be dis- 
cussed separately. To minimize the effects of 
learning, only the data from Problems 6 through 
15 will be considered. 


Solution-Time Data 


The mean solution times per problem and the 
variances are shown in Table 1. An analysis of 
variance of these data showed group size, “s; 
task, p < .001; structure, p < .001 (see main ef- 
fects, Table 1); none of the interactions attained 
acceptable levels of significance, although the 
Group Size X Structure and the Group Size X 
Task X Structure interactions were p < -12. To 
Minimize the effects of the significantly different 
variance in the four-man H =2.4 C, five-man 
H =2.4 W and C groups, the data were system- 
atically subjected to three analyses of variance. 
These analyses of variances are given in Tables 2, 
3, and 4 (along with the analyses of the error 
data). In the discussion to follow we shall be 
concerned equally with the absence, as well as 
the presence, of effects. 


Group Size X Task: Structure Constant—Table 2 


Wheel. Since only task is significant, the speed 
of four- and five-man groups in W is determined 
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TABLE 3 
Group SIZE X STRUCTURE: Task CONSTANT 
Source df MS F 
Solution time 
= 1.6 
Size (G) 1 12,996.02 | 5.64* 
Structure (S) 1 46,717.22 | 20.267 
GXS 1 20,839.22 | 9.04 
Error 36 2,305.37 — 
H=24 
G 1 2.50 — 
S 1 46,922.50 | 7.31 
GXS 1 .09 — 
Error 36 6,420.61 — 
Errors 
H=16 
G 1 2.50 — 
S 1 302.50 | 8.68** 
Gxs 1 8.10 Be, 
Error 36 34.83 
H=24 
G 1 67.60 | 1.3 
S 1 19.60 — 
GXS 1 .10 = 
Error 36 51.96 
*p <.05. 
> <.01 
wet <.001 


strictly by task difficulty. The four- and five-man 
groups do not differ on the H=1.6 (four-man 
= 1,91, five-man = 1.81) and the H=2.4 tasks 
(four-man = 2.56, five-man = 2.57), though both 
are significantly faster on the H=1.6 task than 
on the H = 2.4 task. 


TABLE 4 
STRUCTURE X Task: Size CONSTANT 


Source df MS F 
Solution time 
Four-man 
Structure (S) 1 20,748.02 4,71* 
Task (T) 1 78,234.02 | 17.77 
SXT 1 5,221.24 1.20 
Error 36 4,403.17 — 
Fiye-man 
S 1 83,356.90 | 19.28%* 
T 1 | 2798410 | 6.47* 
SxT 1 5,152.90 1.20 
Error 36 4,322.81 — 
Pi 
RIAAN 1 87.02 | 1:7 
T 1 55.22 | 14 
SKE 1 27.22 — 
Error 36 49.68 — 
Tepr 1 156.02 | 4.22 
} EE e 
SXT 1 i i 
Error 36 37.12 — 
SPUN EAEE EE a A 
.05. 
aeS 
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Circle. Again task is significant, while group 
size and the G X T interaction approach signifi- 
cance (p <.10). The four-man groups are faster 
on the H = 1.6 task than on the H = 2.4 task 
(H =1.6, 2.14; H =2.4, 3.25), while the five- 
man data show no substantial difference on the 
two tasks (H = 1.6, 2.95; H = 2.4, 3.25). Thus, 
the significant task effect is derived largely from 
the four-man groups. It should be noted that on 
the H=1.6 task four-man groups tend to be 
faster than five-man groups (four-man = 2.14, 
five-man = 3.25), while on the H= 2.4 task 
there are no group-size differences (four-man = 
3.25, five-man = 3.25). 


Group Size X Structure: Task Constant—Table 3 


H = 1.6. Here size, structure, and the G X S 
interaction are significant, These analyses am- 
plify the findings presented in Table 2 by show- 
ing that, with task constant, group size has no 
effect in the W, while four-man groups are faster 
than five-man in the C. However, it is also ap- 
parent that on this task four-man groups are 
“equally” fast in the W and C (W, 1.91; Ç; 
2.14), while five-man groups are faster in the W 
than in the C (W, 1.81; C, 2.95). 

H = 2.4. Here only structure is significant, and 
again the analysis extends that of Table 2. On 
the H=2.4 task group size has no effect. The 
data, however, do emphasize the superiority of 
the W as compared with the C in both four- and 
five-man groups, a superiority that holds on the 
H=166 task in five-man groups only. 


Structure X Task: Size Constant—Table 4 


Four-man. This analysis reflects the significant 
main effects of task and structure (see Table 1). 
Performance on the H = 1.6 task is consistently 
faster than on the H = 2.4 task, However, the 
Superiority of the W over the C is derived pri- 
marily from the H=2.4 task (H= 1.6, 1.91- 
2.14 = 23; H = 2.4, 2.56-3.25 = 69), 

Five-man, Again this analysis reflects the sig- 
nificant main effects. Now, however, the superi- 
ority of the W over the C is derived primarily 
from the H=1.6 task (H =1.6, 1.81-2.95 = 
1.14; H=2.4, 2.57-3.25 = 68), a result oppo- 
site from that found in the four-man groups. 


Error Data 


A quick rundown of the error data in Table 1 
shows that the mean number of errors per prob- 
lem was less than one, but for a single excep- 
tion. This result suggests that the present task 
at least for the levels of difficulty used, is un- 
satisfactory for evaluating quality of perform- 
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ance. As a consequence, we can pass over 
error data briefly. 

The analysis of variance of these data showe 
only structure significant, with C making mo 
errors than the W (p<.05). 

The significant structure effects in Table 
the H=1.6 task (four-man W=.41, C 
five-man W = .37, C= 1.01) shows that 
errors were made in the W than in the C, 
Table 4 the significant structure effect in fi 
man groups shows the W making fewer e 
than the C (H = 1.6 W = .37, H=2.4W 
H=1.6 C= 1.01, H = 2.4 C=.69). Consi 
ing Tables 3 and 4 together, both group siz 
made fewer errors in the W than in the C on 
H=16 task; on the H=2.4 task the 
ences between the W and C are substan 
smaller. Thus, the significant structure effects 
derived largely from differences on the H 
task, which implies, at least, a weak task efi 
No evidence exists suggesting even a weak g 
size effect. 


Discussion 


The analysis of solution times shows 
things: (a) Task difficulty, irrespective of g 
size or structure, produces significance 
ences. This result indicates the validity of 
operation (H) used to define task difficulty. 
The W is superior to the C in all conditions 
one; in four-man H = 1.6 groups there are 
differences between W and C. (c) Group size 
duces effects in some conditions.? In identifi 
these conditions we shall consider four- and fi 
man groups separately and then together. 

Four-man. On the H =1.6 task there are 
differences in the W and C, while on the H = 
task the W is faster than the C. 


? When all 15 problems are considered, four-ma 
groups are significantly faster than five-man g 

One possible reason why group size did not 
stronger effects is because individual work loads 
the four-man and five-man groups differed. For 
ample, the total amount of information, kr, 
task is given by HX N, where N is sampl 
Thus, we had, for example, hr = 1.6 X 4 
The amount of information each subject had 
Process (k: =hr/n, where n is group size) 
four-man, h: = 64/4 = 16.0; five-man, hr = 64, 
12.8. This inequality in individual work loa 
groups of different size, given tasks of equal d 
culty (H) is not inherent in the task. For exam 
if for five-man groups we had made N = 50, 
hr = 1.6 X 50 = 80.0; ki = 80/5 = 16.0, the sam 
as in the four-man groups when H= 1.6 and 
40. Individual work loads were permitted to 
because we wanted to keep group work load, 
with task difficulty, under control. Controlling indi 
vidual work loads is another experiment. f 
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Five-man. Performance is faster in the W than 
in the C on both tasks. 

Four versus five. In the W there are no dif- 
ferences between four- and five-man groups, ir- 
respective of task difficulty. In the C, four-man 
groups are faster than five-man on the H=1.6 
task, but there are no differences between them 
on the H = 2.4 task. 

The solution-time data are consistent with the 
conclusions drawn by Thomas and Fink, 


measures of speed showed no difference or else fa- 
vored the smaller groups [p. 373]. 


In this context, the data show that in the W 
there are no differences between four- and five- 
man groups, irrespective of task difficulty. In the 
C, four-man groups are faster than five-man 
groups on the easy task, but there are no differ- 
ences between them on the difficult task.* 

The data for five-man groups are also consist- 
ent with results reported by Leavitt (1951) and 
Cohen, Bennis, and Wolken (1961); performance 
in the W is faster than in the C. This same re- 
sult was obtained in four-man groups on the diffi- 
cult task. However, the present data are incon- 
sistent with those reported by Shaw (1954), who 
found the C faster than the W in the four-man 
groups. The difference in results is undoubtedly 
due to differences in the tasks used. Shaw’s task, 
which consisted of three puzzle-type problems, 
seems to be more difficult than the present one. 
This conclusion is based on the fact that the 
average solution times per problem on Shaw’s 
task were 12.6 minutes in the W and 11.8 minutes 
in the C, as compared with 2.6 minutes and 3.2 
minutes in the four-man W and C, respectively, 
on the H = 2.4 task. Further, the average num- 
ber of messages per problem passed by Shaw’s 
groups were 52.1 in the W and 65.8 in the (on 
as compared with 7.5 in the W and 16.7 in the 
Con the H = 2.4 task. 

Aside from the question of difficulty, Shaw’s 
data suggest that the superiority of the C over 
the W is primarily the result of the overloading 
of the central point in the W. This conclusion 
follows from: fewer messages were passed in the 


3 Since the error data of the present task seem to 
to be a poor indicator of quality of performance, we 
shall not honor it with a summary in the text. We 
note only that there were no group-size differences. 
At least these data do not conflict with previous 
findings. 
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W than in the C; the central point in the W 
handles from three to four times as many mes- 
sages as the other points in the W (eg., see 
Leavitt, 1951, p. 45). Therefore, the longer solu- 
tion time in the W, as compared with the C, is 
a result of exceeding the effective information 
processing capacity at the Ws central point. If 
this conclusion is correct, then any class of tasks 
requiring a large number of messages will result 
in a breakdown in the effectiveness of the W, 
irrespective of group size. 

The preceding is, of course, conjecture. How- 
ever, the hypothesis may be investigated with 
the present task, given appropriate adjustments, 
for example, increase the number of events in 
the information source, say from 6 to 12 or 
more; increase the sample of events from 40 to 
80; the levels of task difficulty, H=1.6 and 
H=24, may be changed or not, and in any 
case when sample size is increased individual 
work loads will be increased. 
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DISCRIMINABILITY OF REM AND NREM REPORTS * 


LAWRENCE J. MONROE, ALLAN RECHTSCHAFFEN, DAVID FOULKES, 
AnD JUDITH JENSEN 


University of Chicago 
Several recent studies have cast doubt upon the diagnostic efficiency of EEG 


Stage 1, rapid eye movement (REM) periods in the detection of dreaming. In 
the present study it is demonstrated that trained judges can discriminate REM 


period reports from non-REM (NREM) period reports, REM period content- 
producing reports from NREM period content-producing reports, and REM 
period “dream” reports from NREM period “dream” reports. The high degree 
of success attained by the judges indicates that physiological sleep phase, REM 
or NREM, is highly diagnostic of the presence, amount, and quality of re- 


ported sleep mentation. 


Aserinsky, Kleitman, and Dement (Aserinsky 
& Kleitman, 1955; Dement & Kleitman, 1957a, 
1957b) have distinguished two alternating 
physiological phases in the sleep cycle: periods 
characterized by low voltage, random EEG ac- 
tivity (EEG Stage 1) and intermittent rapid eye 
movements (REMs); and periods of 12-14 
cycles-per-second spindle and/or high ampli- 
tude, slow delta activity (EEG Stages 2, 3, and 
4), and an absence of REMs. The two phases 
have come to be known, respectively, as REM 
periods and NREM (no rapid eye movement) 
periods. These authors have also reported that 
the physiological phases are diagnostic of reports 
of dreaming. In their most extensive study (De- 
ment & Kleitman, 1957b), for example, dream 
recall was obtained from 80% of REM period 
awakenings but from only 7% of NREM period 
awakenings. More recent studies (Foulkes, 1962; 
Foulkes & Rechtschaffen, 1964; Goodenough, 
Shapiro, Holden, & Steinschriber, 1959; Kamiya, 
1961; Rechtschaffen, Verdone, & Wheaton, 1963; 
Wolpert, 1960), however, have shown that re- 
ports of mental activity can be obtained much 
more frequently from NREM sleep than would 
have been indicated possible by the earlier work 
of Aserinsky, Kleitman, and Dement. The more 
recent studies agree with the earlier ones, how- 
ever, that dream recall is highly likely following 
REM awakenings and that it is more likely fol- 
lowing REM than following NREM awakenings, 

Nevertheless, the frequency of NREM recall 
in these recent studies (Foulkes, for example, re- 
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ported a figure of 72%) has raised the question 
of the extent to which REM periods are diag- 
nostic of dreaming. A possible resolution of the 


differences of the earlier and later studies may 


be found in the results of Foulkes (1962) and 
of Rechtschaffen, Verdone, and Wheaton (1963). 
These authors have shown that REM content 
was more likely to be labeled “dreaming” and 
less likely to be labeled “thinking” than NREM 
content. REM content was also judged, by the 
experimenter and/or the subject, to be „more 
vivid and clear, to involve greater activity, to 
include more visual imagery, to refer less often 
to recent events in the dreamer’s life, to be more 


— 
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emotional, and to be more distorted and im- 


plausible than NREM content. The findings of 
these two studies suggest that the two physio- 
logical phases of sleep may be diagnostic of dif- 
ferent kinds and amount of mentation, rather 
than just mentation versus no mentation. More- 
over, the nature of the qualitative differences 
between REM and NREM content does seem 
to restore at least some of the justification for 
the earlier position that REM periods are highly 
diagnostic of that class of mentation calle 
dreaming. 

And yet, the issues are not completely 1° 
solved by the data of the Foulkes and the 
Rechtschaffen, Verdone, and Wheaton studies: 
Both studies report some degree of oven 
between characteristics of REM mentation 4m 
those of NREM mentation. The question then 
must be asked, precisely how diagnostic © 
presence, extent, and nature of mentation are 
the two phases of physiological sleep? Perhaps 
the best way of answering this question 1$ iD 
assign judges, who have been instructed in 
nature of typically observed REM-NREM 1% 
call and content differences, the task of dis- 
criminating between REM and NREM reports 
where they have no knowledge of conditions ° 
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awakening for any of the reports. In the present 
study, we have done just this in an attempt to 

l estimate the maximal possible degree of dis- 
criminability which can be obtained between 
REM and NREM reports. 


METHOD 


The materials to be judged consisted of 143 REM 

reports and 84 NREM reports collected from a 

| total of 24 subjects by Foulkes and Rechtschaffen 

(1964). All subjects spent two experimental nights 

in the laboratory on which the following awaken- 

| ings were scheduled, although it was not possible to 
complete the schedule on all subject nights. 

1, NREM: 10 minutes after initial sleep (spindle) 


2, REM: First REM period, 5 minutes after first 


3, REM: Second REM period, 10 minutes after 
|, first REM. 
4. REM: Third REM period, 10 minutes after 
first REM. 
5. NREM: 25 minutes after final REM of fourth 
REM period. 


The two judges were both familiar with the lit- 
erature on REM and NREM recall and on the 
typical qualitative characteristics of REM and 
NREM reports. Judge B had also had considerable 
experience in other sleep-research studies, running 
subjects and obtaining reports of their REM and 
NREM mentation, while Judge A had never had 
such experience. Both judges were completely naive 
with respect to the awakening conditions for any of 
the reports being judged. They performed, in the 
order listed below, discriminations under the fol- 
lowing conditions. 

1. All 227 reports were judged. The distribution 
of REM and NREM reports was unknown to the 
judges, The question asked here was how well REM 
and NREM reports can be discriminated where the 
only information consists of the reports themselves. 

It is known that other factors besides sleep stage, 
REM or NREM, are associated with the probabil- 
ity of dream recall and the nature of dream con- 
tent. Some subjects, for instance, consistently pro- 
duce more recall and more “dreamlike” recall than 
do others, whether one considers REM or NREM 
awakenings. Similarly, Verdone (1965) has shown 
that time of night is related to dream recall and 
to certain dreamlike characteristics of sleep menta- 
tion, with reports from later awakenings having 

; less recent temporal reference and being judged 
more vivid and emotional than content from earlier 
awakenings, Finally, the quantity and quality of 
sleep mentation may be affected by any conditions 
which are unique to any one night. It thus seemed 
likely that the maximal degree of discriminability 
between REM and NREM reports might be found 
Where these three contributors to inter- and intra- 
individual variation in mental content were par- 
tialed out. Therefore, a second discrimination analy- 
sis was performed in which there were controls for 
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the effects of these variables upon judges’ discrim- 
inations. 

2. In this series, 154 reports were judged. The 
judges were presented with 77 pairs of reports with 
the following conditions met in each pair: one 
report was from a REM and the other from a 
NREM awakening; both reports were from the 
same subject; both reports were obtained on the 
same night; both were from approximately the same 
time of night, early (first and second scheduled 
awakening) or late (fourth and fifth scheduled 
awakening). The question asked here was how 
well REM and NREM reports can be discriminated 
with a known distribution and with the simultane- 
ous elimination of three possible sources of error 
variance. 

In Analysis 2, all reports that could be appropri- 
ately grouped were employed, whether or not they 
contained any definite mental content. In the fol- 
lowing analysis, however, the judges dealt with 
only those reports in which at least one item of 
specific content was produced. Discriminations in 
this analysis cannot, then, rely upon the dimension 
of presence or absence of content as a cue in dis- 
criminating sleep phase; they must rely upon the 
nature and/or the amount of the content which the 
subject produces. 

3. Judges were given 92 content reports, sub- 
grouped into 46 pairs of content reports, one REM 
and one NREM, obtained from the same subject 
on the same night and at the same time of that 
night, The question asked here was how well REM 
and NREM content-producing reports can be dis- 
criminated with a known distribution and with 
the simultaneous elimination of the subject, time 
of night, and night sources of potential error vari- 
ance. This analysis is, therefore, comparable to the 
second analysis above, except that it is limited to 
content-producing reports. 

The final analysis further reduces the number of 
reports to be judged; it included only those con- 
tent reports in which the subject labeled his mental 
content as a dream, Discriminations in the fourth 
analysis, then, cannot rely upon the subject’s desig- 
nation of his mental activity as thinking or dream- 
ing, They must rely upon the nature of the dream 
which the subject produces. 

4. The judges were given 34 reports which the 
subjects had labeled dreams—17 REM reports and 
17 NREM reports, matched for subject, time of 
night, and night. The question asked here was how 
well REM and NREM dreams can be discriminated 
with a known distribution and with the simul- 
taneous elimination of three possible sources of 
error variance. This analysis is, therefore, com- 
parable to the third analysis above, except that it 
was limited to dream reports. 


RESULTS 


The proportion of correct judgments for each 
judge for each of the four discrimination analy- 
ses is reported in Table 1. Tn all analyses but the 
first, the proportion of errors for reports judged 
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TABLE 1 
RESULTS or DISCRIMINATION ANALYSES 
Percentage correct 
Nan saa Subjects 
Judge A | Judge B | Average 
1 227 24 731 86.8 | 80.0 
2 154 23 85.7 92.2 | 89.0 
3 92 20 87.0 97.8 | 92.4 
4 34 13 88.2 | 100.0 | 94.1 


REM and for those judged NREM would have 
to be identical, due to the known equal distri- 
bution of REM and NREM reports. In the first 
analysis, 72.5% of the total errors consisted of 
REM reports which judges identifed as NREM 
reports. 

The results show, first of all, that REM and 
NREM reports are highly discriminable. Chi- 
square analyses of hits and misses for REM and 
NREM reports in the first discrimination analy- 
sis yield highly significant p values (< .001) for 
both raters, indicating success considerably in 
excess of chance. 

In order to clarify the bases of this above- 
chance performance, two “mechanical” discrimi- 
nation analyses were performed on the 227 re- 
ports, The first such analysis relied solely upon 
the fact that REM awakenings produce content 
reports significantly more often than do NREM 
awakenings. In this mechanical analysis, all con- 
tent reports were classified as “REM” and all 
no-content reports as “NREM.” The result was 
70% correct identifications of sleep phase. The 
more experienced judge, Judge B, clearly ex- 
ceeded this level, indicating that content char- 
acteristics played some role in his attaining the 
degree of success that he did, while Judge A 
made correct identifications only slightly more 
often than would be expected were he simply 
using the presence or absence of content as his 
basis for classification. 

The second mechanical analysis employed the 
content versus no-content criterion plus an ad- 
ditional salient parameter differentiating REM 
and NREM reports, the perceptual versus con- 
ceptual nature of the report. REM periods 
characteristically yield reports of perceptual 
dreams, while NREM periods yield a fairly high 
proportion of reports of thinking. In this me- 
chanical analysis, therefore, all reports which 
the subjects designated as “primarily percep- 
tual” were classified as “REM,” while all re- 
ports designated as “primarily conceptual” and 
all reports containing no content were classified 
as “NREM.” The result in this case was 74.4% 
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correct identifications. Again, Judge B clearly 
exceeds this level while Judge A closely approxi- 
mates it. This indicates that Judge B, at least, | 
successfully relied upon cues over and above the 
presence of recall and a simple perceptual-con- 
ceptual distinction in attaining his relatively 
high degree of discrimination of REM and 
NREM reports. 

The results in Table 1 further indicate the 
fact that elimination of night, time of night, and 
subject variables was accompanied by an im- 
provement in the discriminability of REM and 
NREM reports (Analysis 2). A comparison was 
made of the judges’ classification of the 154 
reports in Analysis 2, paired for these three 
variables, and of their classification of these 
same 154 reports in Analysis 1, where there was 
neither matching nor a known distribution of 
REM and NREM reports. Both judges improved } 
their discriminative accuracy from the first to' 
the second analysis at the .01 level, as assessed 
by the McNemar test for the significance of 
changes. It is not clear whether this significant 
degree of improvement is due to the partialing 
out of the three variables or to the differing 
nature of the judgment task itself (judging the 
entire group with an unknown distribution of 
REM and NREM reports versus judging within 
pairs). In either event, the second analysis Te- 
veals an extremely high discriminability of REM 
and NREM reports (89% average correct for 
the two judges) and indicates that the potential 
discriminability is underestimated by the previ- 
ous analysis. 

Finally, the results in Table 1 demonstrate 
that not only can REM and NREM reports be 
differentiated, but also NREM content reports 
can be differentiated from REM content reports 
(Analysis 3), and, as Foulkes (1962) noted in 
another context, even NREM “dreams” can be 
differentiated from REM dreams (Analysis 4). 
This last result clearly indicates that NREM 
and REM content can be discriminated quite : 
apart from the thinking-dreaming dimension. 

To what extent results of later-conducted 
analyses might represent some kind of transfer 
from earlier-conducted analyses is not entirely 
clear. The order of the analyses was arranged t0 
minimize inadvertent feedback of any sort t0 
the judges. In addition, a third judge who Pe . 
formed the fourth analysis (with reports U 
Paired) four times failed to show any improve 
ment over those four sessions. Although indi- 
vidual hits and misses varied from session t0 
session, the proportion of correct judgments 
remained absolutely constant. While this is not 
a control for transfer across different analyse 
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it does seem to indicate that familiarity with 
the materials being judged, in and of itself, was 
no particular aid in arriving at correct judg- 
ments, 
Discussion 


The results demonstrate that REM and 
NREM reports can be successfully discriminated, 
particularly: when amount and quality of con- 
tent are considered, in addition to the mere 
presence or absence of dream content; when 
possible error variance due to intersubject, inter- 
night, and intranight variables is controlled by 
presenting judges with pairs of reports, one 
member REM and the other NREM, matched 
for these three variables. Under these conditions, 
discriminability is almost total. 

* The extent to which controls for intersubject, 

! internight, or intranight variables, or the pair- 
.. ing of reports to be judged, individually con- 
tributed to increased discriminability is not 
known, However, what is most important with 
respect to the major intent of this study, that 
is, to describe the discriminability of REM and 
NREM reports, is that the discriminability un- 
der the controlled-paired conditions, being 
higher, necessarily gives a more accurate de- 
scription of the full extent to which reports from 
the two stages differ. The fact that discrimina- 
bility was higher under the controlled-paired 
conditions than under uncontrolled-unpaired 
conditions or under mechanical sorts simply 
demonstrates that all the previous research re- 
ports which relied upon mechanical sorts into 
content-no content and/or thinking-dreaming 
dichotomies, without the use of such controls, 
underestimated the extent to which REM and 
NREM stages of sleep are related to what sub- 
jects report upon awakening. The exact nature 
of the cues the judges used and the relative 
Weights given different cues in the process of 
achieving higher discriminability than that pro- 
duced by the mechanical sorts is unknown, apart 
from the description of typical differences be- 
tween REM and NREM reports given by 
Foulkes (1962) and by Rechtschaffen, Verdone, 
and Wheaton (1963). 

These results seem to complete a full circle 
started with the work of Aserinsky, Kleitman, 
and Dement. It appears that REM periods are, 
after all, highly diagnostic of a kind of sleep 
mentation which has all of the characteristics 
associated with the term dreaming, of the kind 

a vivid, distorted, elaborated, visual-hallucina- 

ory mentation which has been the major interest 
of the clinician. 

And yet, in another respect, the results sug- 
gest that a considerable advance has been made 
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since the earlier studies of sleep consciousness. 
Those studies (e.g., Dement & Kleitman, 1957b), 
having noted the association of dreaming with 
REM periods, gave little attention to the other 
kinds of sleep mentation which occur outside 
REM periods. Little interest was taken in 
“vague, fragmentary impressions of content,” 
and, where more substantial reports were ob- 
tained outside REM periods, they were as- 
sumed to represent residual REM period recall. 
It is now fairly clear, however, that NREM 
mentation cannot be explained in terms of re- 
sidual REM period recall (Foulkes, 1962), and 
it also appears that, at the very least, some of 
the mental activity subjects report on NREM 
awakenings does indeed occur during NREM 
periods (Rechtschaffen, Verdone, & Wheaton, 
1963). Thus there seems to be a continuum of 
sleep mentation ranging from relatively plausi- 
ble, thoughtlike ideation to the hallucinatory, 
bizarre productions we generally think of as 
dreams. While full-blown dreams are sometimes 
reported on NREM awakenings, the kind of re- 
port generally elicited on NREM awakenings is 
easily distinguishable from the more dreamlike 
reports elicited from matched REM awakenings. 
The existence of different types of mentation in 
physiologically different stages of sleep quite 
naturally suggests a further problem for experi- 
mental investigation: the structural and func- 
tional connections between the NREM and the 
REM mentation occurring on any given subject 
night (Rechtschaffen, Vogel, & Shaikun, 1963). 
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EFFECT OF SOCIAL ISOLATION AND PHYSICAL CONFINEMENT 
ON MOTOR AND SEXUAL BEHAVIOR OF GUINEA PIGS* 


HELENE D. GERALL 


Tulane University 


25 male guinea pigs, reared from 2 days of age, were randomly assigned to the 
following conditions of social environment and physical restraint: isolated con- 
fined, isolated nonconfined, social confined, and social nonconfined. It was hy- 
pothesized that the absence or diminution of sexual activity previously observed 
to be associated with social isolation might be a function of a confinement- 
induced increase in frequency of aberrant motor activity. Ss reared in social 
isolation and/or confinement exhibited a high frequency of aberrant response 
but no relationship between motor symptoms and sexual adequacy was found. 
In addition, confinement initiated after maturity resulted in aberrant motor 
activity but no accompanying decrement in sexual behavior. 


In recent studies of guinea pigs reared in 
social isolation (Gerall, 1963; Gerall & Dun- 
ning, 1958) it was observed that the subjects 
not only exhibited inadequate sexual behavior 
but also displayed unusual motor responses con- 
sisting of sudden twitching of the head, twisting 
of the forepart of the body, and/or leaping into 
the air, These behaviors, which will be called 
either aberrant or headshaking responses, are 
not exhibited by the mature, sexually proficient 
male when interacting with an estrous female. 
On the basis of the above observations it was 
hypothesized that the aberrant motor activity, 
regardless of its origin, might effectively compete 
with and thereby alter the normal display of 
sexual behavior (Gerall, 1963). One of the pur- 
poses of the present study, therefore, was to 
examine any possible relationship between the 
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degree of disruption of the mating pattern ang 
the rate of headshaking responses in sociall 
isolated subjects. $ 

A second purpose of the present experiment 
was to determine whether the aberrant responses 
could be induced in socially reared guinea pigs: 
The condition selected for study was physic 
confinement which Levy (1944) has shown leads 
to an apparently similar type of repetitive head 
shaking activity in chickens. If socially reaté® 
guinea pigs can be made to display these t% 
sponses, a comparison of their sexual behavidl 
with that of socially isolated subjects would 
afford another means of evaluating the relative 
importance of rearing circumstances and com 
peting reactions as related to inadequate matne 
behavior. In addition, the results show® 
provide information leading to the identifica 
tion of circumstances or variables related to %4 
development of these aberrant motor tenden 


METHOD 
Subjects 


Guinea pigs from a heterogeneous stock ™ 
tained at the University of Kansas were used 
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subjects. The 25 subjects in the main experimental 
groups were removed from their mothers at 2 days 
of age and assigned to one of four groups according 
to a random listing of the conditions drawn up 
before the start of the study. Some of the groups 
did not receive their quota of subjects because 
fewer than the anticipated number of males were 
born to females mated specifically for this study, 
and three subjects died before the beginning of the 
mating tests. A second group of 5 males and a third 
group of 12 males were weaned at 25 days of age 
and reared in large social cages under usual labora- 
tory conditions. After reaching maturity, the former 
group was confined for 6 weeks in a single ,cage 
having just sufficient floor space for each subject 
to be in a couchant position. Any subject had 
to scramble over another in order to move. The 
remaining 12 socially reared subjects lived in large 
cages and were used only to determine base-line 
oxygen (O:) consumption of subjects in the colony 
at the time of the experiment. 


Procedure 


The 25 subjects in the experimental groups were 
removed from their mothers at 2 days of age and 
randomly placed in one of four types of living cages. 
All were bottle-fed commercial milk modified by 
the addition of sugar, ascorbic acid, and vitamins 
until they were approximately 20 days of age. The 
foureliving-cage conditions were: social, nonconfined, 
N=5; social, confined, N =8; isolated, noncon- 
fined, N=5; isolated, confined, N=7. Two or 
three males lived together in the social cages while 
only a single subject was placed in an isolation cage. 
Physical confinement was produced by adjusting the 
wall panels of specially constructed boxes to the 
subjects’ body size plus approximately 1 extra 
square inch so that they were just able to stand 
and move very slightly. The nonconfined cages were 
relatively large and room for running, playing, etc., 
was available. 

For the mating tests, initiated at 60 days of age, 
each male was placed in a 24 X 24 X 12 inch cage 5 
minutes before the introduction of the estrous 
female. The test lasted 10 minutes or until ejacula- 
tion occurred. Five to 7 days intervened between 
each test. During a test, the experimenter recorded 
in sequence circling, nuzzling, abortive mounts, 
Posterior mounts with clasps, intromissions, and 
ejaculations. Also, all aberrant responses were 
recorded. 

Oxygen consumption of each of the experimental 
and control subjects was measured at 24 and 49 
Sa of age in a metabolism apparatus described 

y Peterson, Webster, Rayner, and Young (1952), 
and was used as an indication of the effect of the 
isolation and restriction conditions upon the sub- 
jects’ physiological status. 


RESULTS 


The percentage of males in each of the four 
nal conditions exhibiting the complete 
exual pattern terminating with ejaculation is 
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TABLE 1 


PERCENTAGE OF ANIMALS IN EACH GROUP EXHIBITING 
THE COMPLETE MALE COPULATORY SEQUENCE ' 
AT Least ONCE IN THREE Matinc TESTS 


Group Social Isolated M 
Nonconfined 80.0 40.0 60.0 
Confined 87.5 28.5 60.0 
M 84.6 33.3 


presented in Table 1. The similarity of the row 
means of Table 1 indicates that the presence 
or absence of physical confinement did not dif- 
ferentially affect the display of the complete 
sexual pattern. The social environment, how- 
ever, was a statistically significant factor related 
to the display of sexual behavior, with a higher 
proportion of isolated. than social subjects failing 
to exhibit all of the mature mating behaviors. 
Similar results were found when the index of 
male sexual adequacy was the presence of the 
posterior mount and clasp of the female’s flanks, 
responses necessary for achieving intromission. 
The posterior mount and clasping responses were 
performed by confined and nonconfined subjects 
on 76.7% and 74.4% of the tests, respectively. 
On the other hand, while social subjects mounted 
and clasped the female on 94.6% of tests, iso- 
lated subjects did so on only 56.5% of the tests. 

In order to analyze aberrant motor tendencies 
of the experimental subjects, the number of 
headshaking responses recorded during a test 
was divided by the duration of time the male 
interacted with the estrous female. A rate 
measure was preferred over a total count of 
headshaking activity since the time span of a 
mating test depended upon when and if the 
male ejaculated. Very little activity of any 
sort occurred during the refractory phase 
following ejaculation. Also, 15-second periods 
during the 10-minute tests in which the male 
neither touched nor approached the female were 
omitted. The mean rate of headshaking per 
minute of activity, computed as described above, 
recorded for each of the four experimental 
groups during the three mating tests is shown 
in Figure 1. A Kruskal-Wallis nonparametric 
test indicated that the null hypothesis of no 
difference among the means of the groups could 
be rejected at the 5% level which was the sig- 
nificance level adopted for all statistical tests. 
Individual comparisons between the means of 
each of the groups revealed that the two iso- 
lated groups did not differ significantly from 
each other, but each had a higher rate of head- 
shaking than either of the socially reared 
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Fic. 1, Mean rate of headshaking activity dis- 
played in each of three mating tests by the four 
experimental groups. 


groups. The rate of headshaking of the socially 
confined group was found to be reliably higher 
than that exhibited by the socially nonconfined 
group. It appears, therefore, that both social 
isolation and physical confinement conditions are 
associated with the occurrence of aberrant 
responses in the male guinea pig. 

As a further experimental check on the finding 
obtained in this experiment that headshaking 
due to confinement, especially in socially reared 
animals, was not related to and probably does 
not interfere with sexual performance, five 
socially reared, mature males were confined in 
a single cage which permitted them no move- 
ment with the exception of climbing over each 
other. Sexual tests were initiated after the sub- 
jects lived 2 weeks in this cage. The rate of 
headshaking and the type of sexual response 
recorded during each test is shown in Table 2. 
Four of the five subjects showed the complete 
sexual pattern in at least one of the three tests 
even though some of them also exhibited a high 
rate of headshaking. One subject did not achieve 
an intromission in any of the tests. To check 


TABLE 2 


RATE or HEADSHAKING (PER MINUTE) AND HIGHEST 
SEXUAL RESPONSE MEASURED DURING MATING 
TEST IN MATURE SOCIALLY REARED SUB- 
JECTS CONFINED IN A SMALL CAGE 


Test 
Subject Es 
I IL Ill IV *, 

3378 | 0.54 (E) | 138 () | 2.70 

3379 | 5.12 (E) | 6.77 (E) | 6.00 D par 

3380 | 1.50 (E) | 0.00 (E) | 0.40 (Œ) 

3381 | 1.71 (M) | 0.30 () | 0.18 (E 

3382 | 4.74 (M) | 8.20 (M) | 2.33 (M) 10.10 AD 
pee = = ejaculation, I = intromission, M = posterior 
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on whether the headshaking interfered with this 
subject’s sexual performance, it and another 
social subject from the group were placed in 
a large cage for 3 weeks and then tested again, 
The results as shown in Column IV of Table 2 
reveal that for both subjects, the headshaking 
tendency was almost eliminated, yet each ex- 
hibited the same sexual responses as observed 
in previous tests. Headshaking activity, there- 
fore, probably was not responsible for the poor 
sexual performance of the subject that had no 
intromission in the previous tests. 

A5 noted previously, 25 socially isolated and 
confined subjects were removed from theirt 
mothers at 2 days of age and fed by the experi- 
menter, whereas 12 control subjects were reared 
with lactating females for 25 days before wean- 
ing. The mean Op consumption rates of the ex- 


TABLE 3 


MEAN Rate OF Oz CONSUMPTION FOR EXPERIMENTAL 
AND CONTROL GROUPS RECORDED AT 24 AND 
49 DAYS OF AGE 


Test 


Group _—_——— 
24 days 49 days M 
Social nonconfined 85.67 69.89 11.18 
Social confined 80.75 75.25 78.00 
Isolated nonconfined | 87.08 71.02 79.05 
Isolated confined 76.17 74.78 75.48 
M 81.72 73.21 77.41 
Control 87.21 | 72.82 | 80.02 


perimental and control groups at 24 and 49 days 
of age are shown in Table 3. The results of at 
analysis of variance revealed that the differences 
among the means of Ox consumption rates of the 
four experimental groups were not statistically 
significant at 25 days of age; the mean rate of 
Os consumption for the four experimenta 
groups was slightly but reliably lower than that 
obtained by the control group. The lower than 
normal O, consumption rate was found only 1 
the confined groups. At 49 days of age, there 
was no significant difference in Oz consumption 


between the experimental and control groups 


Thus, shortly before the subjects were sive? 
their first mating test, they apparently hat 
normal physiological functioning, insofar as this 
is measured by rate of O, consumption. 
decrease in Os consumption rate between 24 an 
49 days of age was statistically significant 4 
similar to that reported in other studies using 
the guinea pig. 


The 


_., of floor space in the isolation cages. 
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Discussion 


As reported in previous studies (Gerall, 1963; 
Valenstein, Riss, & Young, 1955; Valenstein & 
Young, 1955) the sexual performance of male 
guinea pigs was found to be influenced by the 
condition of early rearing. Socially isolated sub- 
jects exhibited fewer mature mating responses 
than socially reared subjects. In those subjects 
affected by the social isolation treatment, st 
responses lower in the hierarchy such as growl- 
ing, sniffing of the anogenital region of the 
female, and circling were adequately, even,*in- 
tensely, exhibited. The posterior mount and 
clasping of the female’s flanks were incorrectly 
performed; consequently, neither intromission 
nor ejaculation was accomplished by these males. 
Both social isolation and limiting the space avail- 
able for movement in the rearing cage were found 


“+. to influence the number of aberrant responses 


emitted by the guinea pig. The frequency of 
these aberrant responses, however, was not 
critically related to sexual performance. The 
results of the present study, therefore, indicate 
that very little of the variance in sexual per- 
formance could be attributed to presence of 
aberrant responses per se. 

Afthough aberrant responses apparently do not 
directly interfere with the expression of sexual 
behavior, the conditions leading to their pres- 
ence are of empirical and theoretical interest. 
Unusual motor responses have been observed in 
many species, including chickens (Levy, 1944), 
mice (Keller & Umbreit, 1956), dogs (Thomp- 
son, Melzack, & Scott, 1956), and human in- 
fants (Bowlby, 1951) reared in physically con- 
fined or socially isolated environments. Levy has 
reported that the frequency of headshaking in 
chickens was directly proportional to the amount 
} A rela- 
tionship was also found in the present study 
between rate of headshaking and physical con- 
finement of the guinea pig. The factor common 
to physical restriction and social isolation in the 
induction of aberrant responses might be the 
diminution in freely expressed motor activity. 
While play, exploratory, and aggressive actions 
are directly limited by physical constraint, these 
behaviors are indirectly inhibited by the ab- 
sence of normally adequate evoking stimuli in 
the social isolation environment. Therefore, the 
critical variable in these studies probably is not 
physical confinement per se, but the effect of 
this environmental imposition on the motility 
of the subject. A considerable variability in 
rate of exhibiting aberrant responses was found 
among confined subjects, especially those reared 
in social groups, and the results might have 
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been more meaningful 
measured directly, rather than indirectly in- 
ferred from restriction, had been correlated 
with rate of headshaking and sexual perform- 
ance. This methodological consideration is par- 
ticularly relevant to the task of identifying the 
underlying cause of aberrant responses in socially 
isolated animals. 

The theoretical implications of aberrant motor 
patterns have been considered by Levy (1944) 
who suggested that various tics may have their 
origin in early restriction of motility, observing 
that such restraint can lead to repetitive move- 
ments in infants and animals. A broader theo- 
retical consideration of motility has been pro- 
vided by Mittelmann (1954) in his discussion 
of the role movements play during personality 
development and the implications their restric- 
tion may have for psychopathology. According 
to this view, movement has properties of drive 
and when blocked can lead to intensification 
or exaggeration of many activities in the organ- 
ism, Another interpretation applicable to aber- 
rant responses has been provided by ethologists 
who would classify them as a form of displace- 
ment activity (Lorenz, 1957), which occurs 
when there is a conflict of drives or when the 
releasing stimuli for a given behavior are only 
partially effective. Displacement responses ap- 
pear to be jrrelevant to the ongoing appetitional 
behavior and presumably involve rather common 
motor tendencies evoked by excitation “sparking 
over” from the inadequately expressed or re- 
duced drives. The observations made in the 
present study can be subsumed within either 
the psychodynamic or the ethological frame- 
works, since the data support the view that 
headshaking responses accompany rather than 
cause the inadequate sexual behavior. According 
to the ethological view, these responses are 
emitted by some of the socially isolated ani- 
mals, partly because their sexual drive has been 
aroused but not reduced by the appropriate 
consummatory responses. In the case of the 
restricted animals, hheadshaking may accompany 
the sexual pattern because of the conflict of 
concomitant arousal of sexual and motility 
drives. However, it is likely that aberrant re- 
sponses in physically confined animals are more 
directly related to the sustained frustration of 
movement rather than a competition of drives. 
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PERFORMANCE EXPECTANCY AS A DETERMINANT 
OF ACTUAL PERFORMANCE: 


DISSONANCE REDUCTION OR DIFFERENTIAL RECALL? 


ALAN S. WATERMAN anv LEROY H. FORD, Jr. 
State University of New York at Buffalo 
An experiment by Aronson and Carlsmith indicated that Ss tended to be dis- 


satisfied with a performance inconsistent with their expectancy, even when the 
expectancy was for a poor performance and the actual performance was excel- 


lent. Dissatisfaction was measured by 
performance when given the opportuni 


confirm their hypothesis, 


Aronson and Carlsmith (1962), working 
within a dissonance-theory framework, investi- 
gated the effect of the disconfirmation of an 
expectancy on subsequent behavior relevant to 
that expectancy. They hypothesized that: a per- 
Son would experience dissonance if his expect- 
ancy about his performance on a task were 
disconfirmed, even if the expectancy was for a 
poor performance and the actual performance 
was a superior one; and the person would, if 
given the opportunity, institute changes in sub- 
Sequent performance on the same material to 
bring it into consonance with the expectancy, 


the degree to which Ss changed their 
ty. The results of the present study indi- 
cated that Aronson and Carlsmith’s method of manipulating expectancy pro- 
duces differences between the high- and low-expectancy Ss in their ability to 
recall their performance and, therefore, in their capacity to change it appro- 
priately. Thus, recall is a confounding variable which may account for Aronson 
and Carlsmith’s results. An independent test, using recall scores, also failed to 


even if such changes involved lowering the 
of performance. The results of their st 
pear to provide striking support for 
Potheses; but a closer inspection of th 
cedure suggests that the results may 
differences between the high- and low: 
ancy groups in the ability of the subj 
recall their original performance and, th 
in their ability to change the performanti 
Propriately. y 

The possibility of differential recall b 
the expectancy conditions used by Arons 
Carlsmith may be hypothesized on at 
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separate grounds. The series of successes given 
to the high-expectancy group may have resulted 
in the acquisition by the subjects of a consistent 
rule or method for making their choices, while 
„the series of failures experienced by the low- 
expectancy group would tend to discourage the 
development of any consistent rule. Such a rule 
would facilitate recall since the subject could 
simply reapply the rule when attempting to recall 
his previous responses. A second possibility is 
that the poor scores reported to the low-expect- 
ancy subjects may have resulted in their be- 
coming less interested in the task than the highs. 
Thus, the lows may have paid less attention to 
the choices they were making and, therefore, 
may have been less able to recall them. 

* The experiment conducted by Aronson and 
Carlsmith involved the implicit assumption that 
all groups were equally able to recall their re- 
sponses on the fifth section when that part was 
readministered. If the groups differed in the 
degree to which the subjects correctly recalled 
their original choices, then differences among the 
groups in the number of original choices changed 
might reflect the effect of such differential re- 
call. Tf the high-expectancy subjects who received 
a high score on the last part of the task tended 
to recall correctly and stay with their recalled 
Tesponses, while the low-expectancy subjects who 
received high scores tended to recall incorrectly 
and also stay with their recalled responses, then 
the latter group would appear to have “changed” 
more of their original responses than the former 
group. Similarly, if the low-expectancy subjects 
who are given a poor score on the last part of 
the task accurately recall fewer of their original 
choices than do the high-expectancy subjects 
who are given a poor score on that part, and if 
both groups tended to change their recalled re- 


- sponses, then fewer of the original responses 


could be changed in the low-expectancy group 

and more recalled responses might inadvertently 

be changed back to the original. In this case, 

the low-expectancy subjects would appear to 

have changed fewer of their original responses 

ma would, therefore, appear more satisfied with 
eir original poor performance. 

The net effect of the differences in recall would 
appear to support the contention made by Aron- 
oa a Carlsmith that the subjects were moti- 
ee to maintain cognitive consistency. If, how- 
nee differential recall hypothesis is correct, 
a the observed results may be consistent with 
es that on a cognitive level the subjects 
i rying to retain a high score or improve @ 
ow score regardless of their expectancies. Any 
confounding of these two variables would cast 
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doubt on whatever conclusions might be drawn 
regarding the direction of the subjects’ attempts 
to reduce dissonance. 

The present study was undertaken to deter- 
mine if the experimental design used by Aronson 
and Carlsmith created a differential in the ability 
of the high- and low-expectancy groups to ac- 
curately recall their previous responses. Since 
Ward and Sandvold (1963) have suggested the 
importance of experimenter demands, these de- 
mands were minimized to test for the effect over 
and above the reported variable in their study. 


METHOD 


Subjects and Conditions 


The subjects were 40 male undergraduate students 
at the State University of New York at Buffalo en- 
rolled in the introductory course in psychology. They 
were randomly assigned to one of four experimental 
conditions: a high performance expectancy confirmed 
(high-high), 2 high performance expectancy discon- 
firmed (high-low), a low performance expectancy 
disconfirmed (low-high), and a low performance ex- 
pectancy confirmed (low-low). 


Procedure 


As in the study by Aronson and Carlsmith, the 
experiment was introduced as an attempt to corre- 
Jate interview-type personality tests with short an- 
swer paper-and-pencil tests. The subjects were told 
that they would take a few short answer tests dur- 
ing the first session and that they would be inter- 
viewed at a later date at their convenience. A short 
semantic differential self-rating scale was adminis- 
warm-up device. 

The next test to be administered was the actual 
experimental task which was the same as the one 
used by j 


asked to identify 
mounted on a 5X 8 inch index card which of them 
had been diagnosed as suffering from schizophrenia. 
There were no correct answers since all of the pic- 
tures were taken from an old college yearbook. One 
hundred such items were arranged into five groups 
of 20 cards each. Each card was presented by the 
experimenter and held up until the response was 
given. The subjects were informed that they had to 
make their choice within 10 seconds. 

All subjects were informed that some people scored 
extremely well on the test, getting as many as 85% 
correct, while others performed very poorly, getting 
as few as 20% correct. It was also pointed out that 
it was very difficult for people to judge their own 
performance on the test. 

The subjects were told that they would be given 
a 3-minute rest period after each section. During 
these periods the experimenter pretended to com- 
pare the subject’s responses with an answer key. 
Prearranged false knowledge of results, either high 
or low, was reported to the subjects after each of 
the first four sections in order to build up the ap- 
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TABLE 1 
MEAN NUMBER OF RESPONSES CORRECTLY RECALLED 


Score reported on fifth section 


Performance 

expectancy iA High 
High 14.8 15.9 
T 11.9 12.5 


propriate performance expectancy for the experi- 
mental condition to which they were assigned. Half 
of the subjects (high-expectancy groups) received 
scores of 17, 16, 16, and 17, respectively, and half 
of the subjects (low-expectancy groups) received 
scores of 5, 4, 4, ánd 5, respectively. On the fifth 
section, half of each group received scores of 17 
(high performance) and half of each group received 
scores of 5 (low performance). Subjects scored their 
own performance on this section with an answer key 
given to them by the experimenter while he pre- 
tended to prepare for the next task. In reality, the 
answer key was prepared at the time the subjects 
were responding so that they received the proper 
prearranged score, 

The procedure to this point was the same as that 
used by Aronson and Carlsmith with two exceptions. 
The experimenter demands hypothesized by Ward 
and Sandvold (1963) were minimized by not paying 
the subjects and by removing any mention of the 
reliability or validity of the test from the instruc- 
tions, 

At this time, the fifth section was readministered ; 
but instead of asking subjects to pretend it was a 
completely new set of cards, as was done by Aron- 
son and Carlsmith, they were requested to make two 
specific responses to each card, First, they were told 
to remember, as accurately as they possibly could, 
which of the faces they had selected the previous 
time, Second, they were asked to select which of the 
faces they now believed to be that of the schizo- 
phrenic. Thus, two responses were recorded for each 


subject, the one that was recalled and the final 
choice, 


RESULTS AND Discussion 


The mean accurate recall scores for each of the 
groups are shown in Table 1. A summary of the 
analysis of variance is given in Table 2. The 
mean recall scores indicate that the high-expect- 
ancy subjects were better able to recall their 
original fifth-trial choices than were the low- 
expectancy subjects (F = 14.90, p < .001). Thus, 
the implicit assumption that all groups were 
equal ïn the ability to recall their responses is 
untenable for the design employed by Aronson 
and Carlsmith. The effects of expectancy discon- 

_firmation and the ability to recall accurately are, 
“therefore, confounded in their study. Although 
both variables lead to the same results on the 
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measure of discomfort used by Aronson and 
Carlsmith, they lead to opposite conclusions con- 
cerning the appropriateness of the subjects’ be- 
havior on a cognitive level. While the need for 
cognitive consistency implies that subjects will 
actively endeavor to maintain a low performance 
consistent with a low expectancy, the effect of 
differential recall implies that although trying to 
attain a high score they are inadvertently appear- 
ing to perform poorly. 

It should be noted that the present recall 
scores may not be a valid index of the degree 
of accurate recall under the conditions of the 
Aronson and Carlsmith study since the present 
procedure of asking subjects to actively remem- 
ber their previous responses probably resulted in 
greater saliency of recall than would occur under 
the condition of their study. Thus the present 
study does not provide a means for determining _ 
the degree to which Aronson and Carlsmith’s re- 
sults may be a function of differential recall. The 
present findings do, however, indicate that the 
effects of dissonance reduction and differential 
recall are confounded in their study and that, 
therefore, no definitive conclusions can be drawn 
from their data. A further implication is that 
future studies using similar procedures willsalso 
yield equivocal results unless the design can be 
modified to avoid such confounding. 

The accuracy of recall may also be seen as 4 
measure which provides an independent test of 
Aronson and Carlsmith’s hypothesis. They hy- 
pothesized that a person would attempt to re- 
duce the discomfort resulting from a dissonant 
successful performance by “denying” the per- 
formance (p. 182). They also suggested that A 
“behavioral” measure should reflect the person's 
discomfort, even if he verbally expresses satis 
faction with the successful performance (p. 179). 
From this point of view, it would appear that oné 
means by which subjects might “behaviorally 
reduce any dissonance aroused by a performance 
inconsistent with their expectancy would be t0 
“deny” the memory of that performance by hav- 
ing poor recall of it. This would lead to the pre 


TABLE 2 


SUMMARY OF ANALYSIS OF VARIANCE 
FOR RECALL SCORES 


_ 
Source af SS MS F 
Expectancy (A) | 1 | 99.22 | 99.22 | 14.90* 
Fifth section (B) | 1 7.22 7.22 1.08 
AXB 1 63 63 09 
Error 36 | 239.90 6.66 
Total 39 | 346.97 


*p <.001. 
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diction that groups whose performance expect- 
ancies are disconfirmed will have lower recall 
scores than groups whose expectancies are con- 
firmed. However, the results show that the inter- 
action effect is nonsignificant (F =.09), thus 
“failing to support the prediction. 

In conclusion, it is suggested that the presence 
of differential recall as a confounding variable, 
and the failure of the independent test of Aron- 
son and Carlsmith’s hypothesis using recall scores, 
together cast considerable doubt on their conclu- 
sion that the tendency to confirm a low perform- 
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ance expectancy outweighs the tendency to be 
satisfied with a high actual performance. 
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1 PERSONS AS MEDIATORS: 
A FOUR-STAGE PARADIGM + 


SEYMOUR M. BERGER 


Indiana University 


© An analysis of a social learning situation is presented in 


terms of a 4-stage 


paradigm from mediational learning theory. Ss were required to learn pairs of 


of a well-known celebrity. 


lated rather than unrelated persons. 


Social learning, in the sense of learning from 
others, by definition involves the presence and/or 
actions of another person, or agent (A). In a 

By previous study (Berger, 1965), it was shown 
that A can serve as a mediator for learning. In 
accordance with a three-stage paradigm from 
mediational learning theory (Jenkins, 1963), re- 
sponses associated with the same A were learned 
Fa rapidly than responses associated with dif- 
oe A’s, The study reported below employs 

| af our-stage paradigm to investigate whether 
E ae A’s can serve as mediators if they have 
| ae prior association: A;—As, Ai— B, As 
| pa —C. If A; and Ag are associated in the 

É nod FE then in the final stage Response B 

Mediati ave some association with Response C. 
ation is inferred if the B— C association 
learned more rapidly when A, and Ag were 

ey aa ae Saye was supported by Research Grant 
nae ae the National Science Foundation. The 

. of Sand: shes to acknowledge the able assistance 

ra Johansson who served as experimenter. 


each pair had some professional relationship, 
celebrities associated with each pair had no special relationship to each other. 
The former group learned the pairs more rapidly than the latter group. Sepa- 
rate responses acquire greater coherence when they 


nonsense syllables, Each syllable previously had been associated with a picture 
For one group of Ss, the celebrities associated with 


while for another group the 


have been learned from re- 


paired rather than unpaired in the initial stage. 
This paradigm has implications for social learn- 
ing phenomena. For example, social role learning 
should be facilitated when different behavioral 
aspects of the role are learned from related 
persons (e.g. members of one’s family) rather 
than unrelated persons. 

Rather than building up relationships between 
different A’s through extensive training, this 
study takes advantage of strong existing associ- 
ations which have been brought about through 
exposure to mass media. Motion picture, tele- 
vision, and recording celebrities who have re- 
ceived considerable publicity as a team, ‘or as 
members of the same cast, should have a high 
degree of association for most college students. 
The pairs of celebrities serving as A’s were: 
Bing Crosby and Bob Hope (numerous motion 
picture and television associations) ; Ginger 
Rogers and Fred ‘Astaire (motion-picture dance 
team); Ringo Starr and John Lennon (popular 
members of the singing group called the 
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“Beatles”); Matt Dillon and Kitty Russell 
(members of the Gunsmoke television show); 
Jed Clampett and his nephew Jethro (members 
of the Beverly Hillbillies television show); 
Gomez and Morticia (members of the Addams 
Family television show); and Ben Cartwright 
and his son Hoss (members of the Bonanza 
television show). 


METHOD 


Thirty students were recruited from the introduc- 
tory psychology courses at Indiana University. 

The subjects were run individually and instructed 
that the experiment was concerned with how people 
learned new names for familiar objects. The familiar 
objects were television and movie celebrities. The 
subjects were to learn a 3-letter nonsense syllable 
name for each celebrity. Prior to beginning the 
experiment, however, the experimenter made certain 
that the subject recognized each celebrity. Slides con- 
taining pictures of each celebrity were projected 
on a screen in front of the subject. The subject was 
asked to call out the stage name of all the motion 
picture and recording celebrities, and the name of 
the character played by each of the television celeb- 
rities, The experimenter assisted the subject with 
names as necessary. Each celebrity appeared on a 
different slide with a pause separating the different 
pairs of celebrities. The list of pairs was repeated 
until the subject could name all of the celebrities 
correctly. Typically, all celebrities were named cor- 
rectly after two presentations, although a few 
subjects required as many as four. 

Following this initial familiarization stage, all 
subjects were shown the first list of celebrity- 
syllable combinations. The celebrity and the syllable 
appeared on the same slide. One member of each 
pair of celebrities appeared in the first list of eight 
items, After each presentation of the list, the subject 
filled out a test by printing the associated syllable 
next to the name of each celebrity. The order of 
presentation of the celebrity-syllable items during 
training, and of listing the celebrities on the test, 
was randomized from trial to trial. The training 
continued until the subject correctly matched the 
nonsense syllables with the celebrities on two 
consecutive trials, 

The procedure for the second list was exactly 
the same. This 8-item list contained the remaining 
member of each pair of celebrities with a different 
nonsense syllable, 

Each slide was exposed for 2.5 seconds with a 
4-second pause between slides. 

In the final stage, the subject was told that 
he would learn pairs of nonsense syllables that had 
appeared in the previous lists and that each syl- 
lable would appear on a different slide, with a pause 
between pairs of syllables. Each syllable was ex- 
posed for 2.5 seconds with a 4-second pause between 
pairs. After each presentation of the 8-item list, the 
subject was tested. The first syllable of each pair 
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asked to print the second syllable next to it. Only 
two test trials were used. 

The subjects were alternately assigned in advance 
to either the mediation condition, Group M, or the 
nonmediation condition, Group NM. Both groups 
were given the same pairs to learn in the final stage, 
The conditions of mediation were varied by re- 
arranging the celebrity-syllable items in the second 
list. In this way it was possible to have a given 
pair of syllables in the final list associated with 
related celebrities in the previous lists for Group M 
and with unrelated celebrities in the previous lists 
for*Group NM. 


was listed on an answer sheet and the subject was | 
i 


RESULTS 


Both groups took essentially the same number 
of trials to complete the first, second, and thitd 
stages of training. mh | 

Group M differed reliably from Group NM it 
the total number of items learned, summed over 
the two test trials? The mean total number of 
correct responses was 12.40 for Group M and 
6.33 for Group NM (Mann-Whitney U = 26.5, 
p<.01). 

Since the fourth-stage difference could be 
attributed to the facilitating effects of media- 
tion in Group M or to the inference effects of 
attempts at mediation in Group NM, an error 

| 


analysis of Group NM’s test trial data was pet 
formed in an effort to identify which effect 
took place. On the first test trial, 13 out of the 
15 subjects responded with one or more incon 
rect syllables.’ These errors were classified into 
two categories: mediational and nonmediational. 
An error was considered to be mediational if 
the nonsense syllables were matched in accord- 
ance with related stars from the initial stage 
All other errors in matching were considere! 
as nonmediational for this analysis. Comparing. 
the mediational errors with nonmediational er 
rors for each subject revealed that only 1 sub- 
ject made more mediational errors than E 
mediational errors (p<.01 by sign test wi 
N = 12). Only 3 out of 13 subjects made ya 
which could be classified as mediational (le 
these subjects made four mediational errot 
while the other 2 made only one each). ; 
mediational errors occurred on the second tes 
trial. i i 


2 The groups also differed reliably on each of w 
trials (p<.01 in each case), with the large, 
difference occurring on Trial 1. * the 

Even though the instructions encouraged t 
subjects to guess on the test, the subjects did ri 
always respond to every item. These omissions cout 
not be treated in this error analysis, of course. 
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Discussion 


Previously it was shown that the same indi- 
vidual can serve as 4 mediator for learning 
(Berger, 1965). The present study shows that 
different but related persons also can serve as 
“mediators. The group that learned pairs of 
“syllables which were previously associated with 
related persons learned these pairs more rapidly 
than the group which had to learn the same 
pairs of syllables which were previously associ- 
“ated with unrelated persons. This difference is 
“interpreted as a reflection of the facilitative 
effects of mediation rather than interference. 
If attempts at mediation interfered with Group 
_NM’s learning on the test trials, then their 
errors might be expected to show some tendency 
toward mediation, This was found not to be 
T the case. 
. No attempt is made in this study to identify 
the dimensions of relationships between persons 
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that can function to mediate learning. Personal- 
ity and social characteristics of the persons 
serving as mediators, as well as the personality 
and social characteristics of the individual 
learner, should provide some worthwhile parame- 
ters for further study. 

Much of the research and theory in social 
learning has been based upon reinforcement 
paradigms, The present research demonstrates the 
feasibility of applying mediational paradigms 
from verbal learning to enrich our understanding 
of social learning phenomena. 
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The findings of the APA’s Project on Scientific Information Exchange in Psychology ca 
attention to the difficulties of the scientist in establishing and maintaining contact with cu 
work relevant to his interests. Accordingly, the project is sponsoring a series of changes duti 
the next 12-15 months which are designed to enhance scientific communication. These chan 
are planned as limited trials. Moreover, they are wholly supported by the Project and 
do not cut into our page allotment. 

The Editor has agreed to make possible a test of one of the changes, that is, the titles a 
authors of accepted papers will be listed following the last articles in the journal. Such listi 
will make it possible for our readers to become aware of research many months in the advan 
of its journal publication. At the end of the year’s trial the results of the trial will be evaluat 
and further consideration will be given at that time as to whether the service is worth continui 

This listing plus those published in the preceding 1965 issues are the entire backlog of mam 
scripts accepted by this journal through March 1965. 


1965 (March) Manuscripts Accepted for Publication in the 
Journal of Personality and Social Psychology 


Individual Differences among Subjects and Experimenters and Subjects’ Self-Description: Irwin G. Sarason" 4 
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‘Film Violence and the Cue Properties of Available Targets: Leonard Berkowitz and Russell G. Geen*: Departmt 
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The Structure of Boredom: P, James Geiwitz*: Department of Psychology, University of Michigan, Ann Ath 
Michigan 48104. 
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Development and Validation of Ego Identity Status: James E. Marcia*: Department of Psychology, State U 
versity of New York at Buffalo, Buffalo, New York 14214. 
Achievement Motivation and Task Recall in Competitive Situations: Bernard Weiner*: Center for Persona 
Research, University of Minnesota, Minneapolis, Minnesota 55455. 
The Representation in Dreams of Material Presented Subliminally during Sleep: Calvin Hall*: Institute of Dit 
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Relationships of Test Anxiety and Hostility to Description of Self and Parents: Irwin G. Sarason* and 
Koenig: University of Washington, Seattle, Washington 98105. 
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Replicability of Two GSR Indices as a Function of Stress and Cognitive Activity: Lyle H. Miller and Barty 
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NOTE.—The address for Timothy C. Brock, whose article was listed in the March issue of this journal, § 


be cans to: Department of Psychology, Ohio State University, 1945 North High Street, Columbus; 
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ANTICIPATORY BELIEF LOWERING FOLLOWING 
FOREWARNING OF A PERSUASIVE ATTACK * 


WILLIAM J. McGUIRE ann SUSAN MILLMAN 


Columbia University 


It has been suggested earlier (McGuire & 
Papageorgis, 1962) that forewarning a per- 
son of an impending persuasive attack may 
have a paradoxical effect: the warning, what- 
ever jts effect on resistance to the attack 
when it comes, may itself weaken the belief 
that is designated as the intended target. The 
Prediction that forewarning of attack results 
in an anticipatory lowering of the belief prior 
to the attack can be derived from several 
alternative sets of assumptions, but in design- 
ing the present study we were guided by 
what we call “self-esteem” theory. 

This theory posits that people tend to be 
anxious about their ability to resist social 
pressure; the person suspects that he will 
prove more susceptible to influence than he 
would like to appear to himself and to others. 
Hence, when told in advance that certain 
beliefs (which he happens to accept) are 
about to be exposed to skillful persuasive 
attacks, the person worries that he will suc- 
cumb to the arguments and appear gullible. 
An „elegant self-esteem preserving (and face- 
saving) maneuver in this situation is for the 

A to decide, before the attack comes, 
hs did not really hold the target belief 
a e first place. To this end he will mod- 

rate his belief away from his initial position 
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(which is to be attacked) when he gets the 
warning. Then when the attack comes, it will 
reduce his belief to the given point, but the 
change from just before to after attack will 
be less for this forewarned person, who has 
subtly traversed part of the distance between 
the warning and the attack, than it will be 
for the unforewarned person. In Janis’ (1962) 
terminology, the individual responds to warn- 
ing of persuasive attack in a way that will 
provide reassurance, rather than by hyper- 
vigilance. 

Clearly, the psychological availability and 
suitability of this anticipatory-lowering strat- 
egy depends on a number of factors. Hence, 
we introduced several additional variables 
into the design to test some of these further 
implications about conditions likely to inter- 
act with the forewarning-of-attack variable in 
inducing anticipatory lowering of belief. First, 
we added a type-of-issue variable, since the 
believer’s need to make use of this strategy 
will be more pronounced to the extent that a 
change of belief following the attack is clearly 
indicative of gullibility rather than of some- 
thing more commendable. If the issue is a 
technical matter-of-fact one, rather than an 
emotional unverifiable one, the person could 
ascribe any opinion change after the attack 
to a commendable openness to evidence and 
ability to take in new information, rather 
than to a socially undesirable inability to 
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stand up for one’s own opinion. Second, we 
introduced a positive versus negative source- 
valence variable. The believer is likely to 
use the anticipatory lowering strategy to the 
extent he expects the impending attack to 
produce a great deal of change. If he expects 
that it will come from a highly persuasive 
source he should show more anticipatory 
change than if he expects an attack from a 
less effective, negative source.” 

The above predictions all deal with main 
or interaction effects of the forewarning of 
attacks, prior to any actual attacks. As de- 
sctibed above, self-esteem theory does not 
predict that the forewarning will mitigate the 
effect of the attack when it comes. We did, 
however, introduce the additional variable of 
attack versus no attack into the design to 
check this assumption of no effect and also 
because self-esteem theory did predict that 
the actual-attack manipulation would inter- 
act with our other two variables, type of issue 
and source credibility, in determining the 
final belief levels. 

The actual attacks should be more effec- 
tive against the technical than the emotional 
issues, since, as discussed above, changing 
one’s opinion on a technical issue can be 
interpreted as an openness to information, 
intellectual flexibility, and ability to assimi- 
late new information (all desirable traits), 
rather than to the undesirable gullibility that 
is more clearly indicated by a yielding on 
the controversial, emotional issues. It will be 
noted that the very principles that required 
our predicting that the forewarning would 
have a greatey impact on emotional than on 
the technical issues, requires our predicting 
that the actual attacks will have a lesser im- 
pact on these emotional than on the technical 
issues. It also follows from self-esteem theory 
that the attacks will have more persuasive 


2 A third relevant factor is the ambiguity of the 
believer’s initial opinion on the issue. If he is al- 
ready clear in his own mind (and especially if he is 
publicly committed) to an initial stand on the issue 
prior to the forewarning of attack, it would hardly 
be feasible to employ this face-saving anticipating 
lowering strategy. Initial ambiguity was not manip- 
ulated in the present study because there are two 
earlier studies bearing on it (McGuire & Anderson. 
1959; Sears, Freedman, & O'Connor, 1964). d 


Wut1am J. McGuire AND Susan MILLMAN 


impact if attributed to highly respectable 
sources rather than to disreputable ones, on f, 
the assumption that it is less damaging to 
self-esteem to have yielded to a reputable 
person, 


METHOD 


General Procedures \ 


Participation in the experiment required one 15- 
minute session. The 96 subjects were run in small 
groups ranging from 12 to 20 during the regular 
class meetings of the various sections of an educa- 
tional-methods course. All of the materials and 
measures were administered silently to the subjects | 
by means of preprogramed mimeographed booklets 
that were designed to allow each of the participants 
at a given administration to be serving in a different 
condition, 

The experiment was explained to the subjects a 
designed to develop a test of analytic thinking abil- | 
ity that could be used by the Federal government 
in selecting technical personnel for the Foreign 
Service, the CIA, and other sensitive agencies. To 
justify the mechanical, preprogramed administration 
procedure, the subjects were told that the study 
was being conducted cooperatively by several large 
universities and hence it was necessary for us to 
exercise care in keeping the conditions highly stand- 
ardized from college to college. To those ends, all of 
the instructions were printed in their test booklets. 
These instructions were to be read silently by the 
students and no further questions could be asked of 
the experimenter, Also, the time allowance had beet 
carefully predetermined and the subjects were 1 
quested to work on a given page only during the 
time provided for it, and not to turn ahead of 
back at any time, except when instructed by the 
experimenter to turn to the next page. 

The first pages of all booklets told the students 
that the test section of the booklet contained read” 
ing passages that had been written by skillful Pe 
cialists on the topics being discussed, Who w 
endeavoring to convey their points of view. The 
students would be required to read these passag” 
analytically, and then their own opinions 0n the 
issues would be measured, The instructions con 
ued by stating that a wide variety of passages Wi 
being pretested and that different passages -jbl 
go to different people, but four of the poss! i 
topics were listed to give them an idea of the WP 
of issues to be discussed. This listing constitul 
the forewarning of attack: for any one subject p 
additional items went unmentioned and constitute 
the “no-forewarning” condition. The four mention : 
issues included two technical ones and two 
tional ones; the four unmentioned issues Were sim 
larly subdivided. i. 

Next followed the prior description of the son 
Subjects in the high source-credibility condi 
were told the passages would come from editors i 
learned journals in the areas discussed, the pass 
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being taken from reports (urging the actions indi- 
cated) that they had made as chairmen of Presi- 
dential Advisory Committees to which they had 
been appointed by either President Eisenhower or 
President Kennedy. The other half of the subjects 
(in the low source-credibility condition) were told 
the passages come from highly skilled criminals 
who had been convicted of rather distasteful crimes 
which carried heavy penalties, the passages being 
taken from part of their skillful defense testimony, 
arguing for the point indicated. 

The subject then read persuasive messages; On 
four different issues, each of a Jength that filled a 
single-spaced letter-sized sheet. Before each message 
was a short description of its source (depicting him 
in detail as a presidential advisor and editor, or as 
a skilled criminal, depending on the source condi- 
tion for that subject) which the subject received 1 

l minute to read. Four minutes were allowed for 
reading the persuasive message itself, and to pick 
out and underline the principle clause in each of its 

-*three paragraphs (these underlining tasks having 
been introduced to assure attention to the pas- 
sages). The four messages received by any one sub- 
ject include two on forewarned issues (one technical 
and one emotional) and two on issues that had not 
been mentioned in advance (again a technical and 
an emotional issue). The subjects were then asked 
to fill out an opinionnaire designed to measure their 
personal opinions on all issues, their responses to 
this opinionnaire providing the dependent-variable 
scores, 


Materials 


The eight issues used included four emotional and 
four technical ones. The four emotional ones were 
as follows: the high likelihood of further Communist 
takeovers in Latin America; the difficulties of devel- 
oping a cure for cancer; the probability of a serious 
economic depression in the United States; and the 
growing likelihood of a third world war. The four 
Noncontroversial technical issues included: the grow- 
as shortage of laboratory animals for experimental 
ee the failure of the earth sciences to aid in 

nding oil deposits; the continuing need for pro- 
planes in commercial aviation; and the likely 
AN the sales tax. Prestudy revealed that 
ae opinions of college students were preponder- 
Rea the opposite sides of the issues from those 
tien above. Since in the forewarnings of attack 
4) essages were represented as taking an extreme 
tical Pi the unpopular sides of the issues, prac- 
h all of the subjects could anticipate receiving 
their sad arguing for a position far removed from 
Bic wn initial positions on each of the forewarned 
(eras on each issue was about 600-words 
“graph Ae en into three paragraphs. The first para- 
that a eied the thesis of the message and stated 
1 the ae people were largely in agreement with 
At why the expressed. Two of the “many” reasons 
Mm Were th experts were in agreement with the thesis 
i en stated, and the following two paragraphs 
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TABLE 1 
Fwar BELIEF LEVEL IN ALL CONDITIONS 
TOE Emotional Technical 
nia 
oN Positive | Negative | Positive | Negative 
source source source source 
No warning, 
no attack 9.29 9.53 9.79 9.55 
Warning, 
no attack 8.60 8.76 9.66 9.39 
No warning, 
attack 7.87 8.73 6.83 7.67 
Warning, 
attack 7.11 8.48 7.00 7.94 


Note.—Scores are the means based on a 15-point scale, with 


the attacks aimed at lowering the mean, Each cell mean based 
on 48 subjects. 


developed these reasons in detail. The tone of pres- 
entations was scholarly and the arguments used 
were rational ones making use of statistics and 
technical analyses of the situation. 

The final opinionnaire that furnished the depend- 
ent variable scores contained 50 items, 6 dealing with 
each of the eight issues and 2 repeated items to allow 
a reliability check. Each item was in the form of a 
statement paraphrasing the thesis of the message on 
that issue. The subject was asked to indicate his 
agreement with each statement by circling a number 
between 1 and 15 on a scale immediately beneath 
the statement. The 6 statements dealing with any 
issue were counterbalanced for acquiescence when 
presented to the subjects, but in analyzing the data, 
these raw scores were converted so that in the pres- 
ent paper, the closer to 15,00 the reported condition 
mean, the less successful was the attack or warning 
in reducing the belief. 


Design and Subjects 


The study included four two-level experimental 
variables in a 2X2 x2X2 design, including: 
warning versus no warning, attack versus no attack, 
emotional versus technical issues, and high- versus 
low-credibility sources. The first three were within- 
subject variables. Hence, each subject served in 
eight conditions, each on a separate issue, so that 
eight issues were required (four technical and four 
emotional). The specific issues were systematically 
rotated around the experimental conditions from 
subject to subject. The fourth (source-credibility) 
variable was the only between-subject manipulation 
of theoretical interest, and 48 subjects served in 
the high-credibility condition and another 48 in the 
low. 

The subjects were 99 students taking an educa- 
tional-methods course required for a teacher’s cer- 
tificate. Most of them were recent college graduates 
in liberal arts who had gone into elementary or 
secondary school teaching and needed this and other 
educational-methods courses to qualify for a perma- 
nent teacher’s certificate. The experiment had not 
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been announced in advance and all students who 
came to the regular class meeting (about 90% of 
total enrollment) were used in the study, except 
that the data of the last three students who would 
have constituted a part of a thirteenth replication 
and resulted in unequal n’s among conditions, were 
excluded to make statistical analyses more con- 
venient, thus reducing the total N from 99 to 96. 
Some characteristics of the 96 subjects used were as 
follows: 88% were females; the age range was 
from 20 to 55 years. 


RESULTS 
Main Effects of Warning 


Our main hypothesis was that the warning 
per se would produce an anticipatory lower- 
ing of the beliefs, prior to the actual attacks. 
The appropriate data for testing this predic- 
tion are the mean belief levels in the “warn- 
ing, no-attack” row of Table 1 as compared 
with the mean of the “no-warning, no-attack” 
row. The difference between the respective 
means, 9.10 and 9.54, is significant at the .05 
level. (We used the residual within-subjects 
variance as the error term.) The effect of 
warning on the impact of the actual attack is 
negligible: the belief level in the “warning 
and attack” row is 7.78, which is almost 
identical with that in the “no-warning, at- 
tack” row, 7.77. Each of these “attack” 
means is significantly (p < .001) lower than 
the corresponding “‘no-attack” mean. 

Hence, as predicted, the warning itself re- 
duces the belief and so does the actual attack, 
but the two effects are not additive. The be- 
lief ends up after attack about 1.77 points 
below the prewarning level, whether or not it 
was preceded by a warning. What the warn- 
ing seems to do is to move the believer in 
advance part of this distance, about .44 
point, so that when the attack comes he has 
only to move the remaining 1.33 points. The 
forewarned person, by subtly adjusting his 
belief prior to the attack, has to face the fact 
that he was swayed from before to after the 
attack only three-fourths the distance he 
would have to admit changing had he not 
been forewarned. 


Interaction with Type of Issue 


Self-esteem theory yielded the predictions 
that the forewarning of attack would produce 
more anticipatory lowering on the emotional 
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than on the technical issues, but that the 
attacks themselves would produce more of 
an effect on the technical than on the emo- 
tional issues. The results in the no-attack 
conditions confirm the prediction about the | 
effect of the warning. After the warning (as 
can be seen in the “warning, no-attack” row 
of Table 1) the belief levels on the emotional | 
issues go down to 8.68; while on the technical 
issues they go down only to 9.52, the differ- 
ence between these two levels being signifi- 
cant at the .01 level. This difference in post- 
warning levels cannot be attributed to initial 
difference in belief levels for the two types of 
issues. The actual change from before to after 
the warnings is .73 point for the emotional 
issues (p< .05), while the corresponding 
change for the technical issues is a statisti“ 
cally trivial .15 point. $ 

As predicted, the direction of this effect is 
reversed as regards the impact of the actual 
attacks, The most appropriate data for test 
ing this effect is found in the “no-warning, 
attack” row of Table 1 (though the slightly 
less-relevant “warning, attack” rows also give 
clear results, as can be seen in the table). The 
attacks reduce the belief levels on the tech 
nical issues to 7.25 while reducing the belief 
levels in the emotional issues only to 8.30, a 
difference significant at the .01 level. Because 
of this predicted reversal between effect of 
warning and effect of attack, the warning 
reduces the emotional beliefs 66% of the dis- 
tance that the attacks will eventually reduce 
them; while with the technical beliefs, the 
warnings reduced the distance they &%. 
changed by the attacks by only 6%, the dif- 
ference in absolute magnitudes being signifi- 
cant at the .001 level. Obviously, as the self- 
esteem formulation would indicate, mut 
more use was made of the warnings to reduce 
the apparent amount of yielding on the emo 
tional than on the technical issues. 


Interactions with Type of Source 


According to self-esteem theory, both the 
warning and the actual attacks should be 
more effective in reducing the beliefs wher 
the attacks are attributed in advance to 1% 
spectable rather than to disreputable sources: 
The predicted source differential was not coh 
firmed with respect to warning, as Can 
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ane from the “warning, no-attack” row 
of Table 1. The belief level after forewarning 
‘of attacks by reputable sources was 9.13 
which is only trivially different from the 9.07 
, level in the disreputable source condition. 
T This was the only prediction from self-esteem 
“theory to fail to receive confirmation. This 
“exception will be discussed further below. 
| The prediction about the effect of this 
“source variable on the persuasive impact of 
“actual attacks can be tested most appropri- 
ately in terms of the data in the “no-warn- 
“ing, attack” row of Table 1 (though it can be 
‘seen that the results in the slightly-less-rele- 
vant “warning, attack” row are almost identi- 
dal). When the attacks were attributed to 
“high-credible sources they reduced the beliefs 
+ to 7.35 and where attributed to low-credibil- 
‘ity sources, only to 8.20, a difference in the 
predicted direction, significant at the .05 level. 
(While residual within-subject variance was 
used to test the significance of effects involv- 
ing the previously discussed within-subject 
“variables, we here, to test the source-credi- 
bility effects, employed the larger between- 
“subjects residual variance as the error term.) 
As can be seen in the “no-attack” rows of 
Table 1, these postattack levels represent 
changes from almost identical preattack lev- 
els in the two credibility conditions, tending 
to rule out their being attributable to chance 
A 


differences in the initial levels. 
p 


DISCUSSION 


Several points about the present study call 
for further discussion, We shall comment, in 
turn, upon a revision in our initial theoretical 
formulations that is suggested by the failure 
to confirm one of the source-credibility pre- 
dictions; upon some peculiarities in the pres- 
ent experimental design that may have had 
an effect on the outcome; and finally upon 
some related earlier findings and alternative 
theoretical formulations relevant to the pres- 
ent results, 


Source Credibility and the Effect of Warning 


In retrospect, it seems possible that the 
reason our  source-credibility manipulation 
had no impact on the effectiveness of the 
warning was that we failed to consider suf- 
ficiently analytically the various components 
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of source credibility. Specifically, we made 
our theoretical derivation in terms of the be- 
liever’s expectation of the persuasiveness of 
the source, but our actual manipulation af- 
fected also, and perhaps especially, the be- 
liever’s perception of the respectability of the 
source, What we would predict on the basis 
of self-esteem theory is quite different for 
these two components, perceived effectiveness 
and perceived respectability. 

In practice the two components are prob- 
ably highly correlated, in that a person prob- 
ably expects typically that he will be more 
influenced by respectable than by disrepu- 
table sources; hence, there is some justifica- 
tion for our attempt to manipulate perceived 
effectiveness by varying the predesignated 
respectability of the source. Nevertheless, 
the source inductions we used here were un- 
fortunate for two reasons, First, our descrip- 
tions stressed the persuasive effectiveness of 
all our sources, even the disreputable ones. 
The latter were represented as unsavory crim- 
inals, it is true, but were also called ‘skill- 
ful,” and “highly intelligent and specialists 
in the areas they were discussing,” and 
“highly motivated to get their views across.” 
The source descriptions were phrased in this 
manner in order that the warnings might 
arouse, as required for our main prediction, 
an appreciable threat to the belief. We may, 
in accomplishing this aim, have negated the 
differential in anticipated effectiveness of the 
two types of sources. A second theoretical 
problem arising from our “respectability” 
manipulation is that it introduces another 
consideration suggested by self-esteem theory, 
namely, the question of how embarrassing it 
would be to yield to a given source. Obvi- 
ously, it should be more painful to self-esteem 
to admit one had been persuaded a given 
amount by an unsavory criminal than by a 
scholarly presidential advisor. 

Hence our specific manipulation of source 
credibility may inadvertently have set two 
considerations raised by self-esteem theory in 
opposition to one another, On the basis of 
their perceived effectiveness, we would pre- 
dict more impact of warning with the positive 
than negative sources. On the other hand, 
equating for expected persuasiveness, we 
would predict less anticipatory change with 
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the positive source, since any anticipated 
change due to the attack would be less ego 
dystonic if produced by the positive rather 
than by the disreputable sources, In the 
present study, our failure to find any differ- 
ence due to the source manipulation may have 
resulted from the mutual cancellation of these 
two factors, A study could be designed to test 
this revision by orthogonally varying these 
two components of source valence: high- 
versus low-respectability and high- versus 
low-anticipated persuasive effectiveness. Our 
revised self-esteem formulation would predict 
that the effect of the warning would be in- 
versely related to the former and directly 
related to the latter.* 


Possible Effects of the Present Experimental 
Design 


We have already indicated in Footnote 2 
why we used the less sensitive after-only ex- 
perimental design, comparing the end-of-ex- 
periment belief levels on forewarned versus 
nonforewarned issues. The before-after design 
might have eliminated some individual dif- 
ference and issues variance from the error 
term, but it would have had the undesirable 
effect of taking away some of the believer’s 
saving ambiguity about his initial belief lev- 
els, and ambiguity is required to make psy- 
chologically feasible the use of the anticipa- 
tory-lowering strategy as a self-esteem saving 
device. 

A second aspect of the present design that 
calls for some comment is our introduction of 
the warning manipulation as a within-subject 
variable, All subjects served in a forewarning 
condition on some issues, and in a no-fore- 
warning condition on others. It is possible 
that our warning all subjects of forthcoming 
attacks on at least some issues would tend to 
produce a generalized wariness in the subjects 
that would to some extent wipe out the effect 
of the specific warning variable on one issue 


3The study by Sears et al. (1964) mentioned in 
Footnote 2 reports one finding relevant to this 
issue. Their uncommitted subjects responded with 
less anticipatory change when forewarned of ex- 
posure to a two-sided discussion rather than to a 
debate. According to Sears et al. this difference 
occurred because the believers perceived the debates 
as more likely to be effective than the two-sided 
discussions, 
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versus another. We considered the alte 
design of introducing forewarning as 
tween-subject manipulation with some 
jects in a forewarning condition and 0 
in a no-forewarning condition. However, 
felt that too much ambiguity is introdu 
by the need to interpret the induction u 
for the no-forewarning subjects (to bala 
the forewarning induction used for the of 
subjects) as in the Allyn and Festin 
(1961) study, The difference between 1 
such inductions tends to be open to al 
tive interpretations, as Festinger has him 
recently admitted (Festinger & Maccol 
1964). Hence, we decided to risk the los 
sensitivity entailed by the within-subject 
nipulation, in return for more clear-cut 
parability of the warning and no-wal 
conditions, It may be that the lack of ef 
of warning on the persuasive impact of 
attacks is due to this design decision. AS 
as self-esteem theory is concerned, no 
effect is predicted, but it is predicted by otht 
theorists, Allyn and Festinger (1961) 
Freedman and Sears (1965) have been abie! 
derive such a prediction from disson 
theory, though Festinger and Maccoby 
presented an alternative distraction exp 
tion, The “overheard” versus intent-to7 
suade study of Walster and Festinger (1 
makes a similar analysis, but again an 
native explanation is possible since 
sources always argued in a “desirable” 
tion. The above studies found some sup 
for the prediction, but it was usually co! 
to highly involved believers. A theory 
forewarning lowers the credibility of thi 
tacking source (Hovland & Mandell, 1 
likewise could predict an attack-miti 
effect of forewarning. 


Relations to Other Theories and Results 


When we originally commented on the poi 
sibly belief-weakening effect of forew: 
of attack (McGuire & Papageorgis, 1962) 
accounted for it in terms of dissonance th 
We still feel dissonance theory could acct 
for the present results. The two formulatio 
dissonance theory and self-esteem thi 
have rather similar logical status, each 
a guiding principle of heuristic value 14 
than a formalized system. The basic m 
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of dissonance theory is that people tend to 


behave in such ways as will minimize discrep- 


ancies among their cognitions; the basic no- 
tion of self-esteem theory is that people be- 
have so as to maximize their self-esteem, Each 
could without violence be made a special case 
of the other: thus within dissonance theory, 
behaving to maximize self-esteem could be 


` Jooked upon as a minimizing of discrepancy 


between the real and ideal-self; alternatively, 
within self-esteem theory, behaving so as to 
reduce cognitive discrepancies could be re- 
garded as maximizing one’s concept of one- 
self as internally consistent. Worchel and 
McCormick (1963) find the two theories 
useful in conjunction. 

Dissonance theory is, of course, worked out 


‘in much more detail than is self-esteem 


theory: there is certainly nothing in the lat- 
ter so thoroughly developed as the trivium of 
“importance, commitment, and volition,” as 
elaborated by Brehm and Cohen (1962) to 
supplement the basic notion of discrepancy. 
Still the notion of self-esteem has not been 
neglected either, and the raw materials for its 
development lie conveniently at hand in 
Wylie’s (1961) review. Beginnings of an 
opposition of a self-esteem theory to disso- 
nance theory are found in the work of 
Deutsch (Deutsch, 1961; Deutsch, Krauss, 
& Rosenau, 1962; Deutsch & Solomon, 1959). 
We do not, by any means, wish to suggest 
that self-esteem theory should be further de- 
veloped to the level of dissonance theory. One 
such theory is quite enough. 

., We avoided using dissonance theory in de- 
riving the present prediction because some of 
its proponents. (Brehm & Cohen, 1962; Fes- 
tinger, 1964) have chosen to oppose disso- 
nance with conflict theory by saying that the 
former requires that the cognitive change 
occur after the decision, while conflict theory 
(Janis, 1959a, 1959b) requires that the cog- 
nitive change be predecisional. We regard 
this distinction as a rather uninteresting one 
and quite difficult to apply to the present 
situation, Brehm and Cohen themselves point 


„out that this difficulty of application will 


oe arise. However, if the distinction be a 
P testable one (as is indicated by some 
nteresting studies in Festinger, 1964) it 


> st : : * 
eemed somewhat inappropriate to use disso- 
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nance theory to predict an “anticipatory” 
change. One could, of course, attribute the 
postwarning change to the discrepancy be- 
tween the believer’s initial opinion and the 
information contained in the warning that an 
expert source holds a very different opinion, 
rather than an anticipatory change to avoid 
the coming dissonance that would otherwise 
result from the discrepant information to be 
contained in the attack. According to this 
reinterpretation, there should be as much 
belief lowering if we merely mention to the 
believer that the expert source holds the 
discrepant opinion, as if we forewarn him 
that this source is moreover about to present 
an attack on the belief. Dissonance theory 
predicts an equal change in the two condi- 
tions and self-esteem theory predicts more 
change with the additional forewarning of 
attack, The point remains to be tested. 

A further reason why we avoided using 
dissonance theory in this study was that it 
seemed more equivocal than self-esteem 
theory as regards its predictions in the pres- 
ent situation (though, as we have seen in our 
discussion of the lack of source effect, self- 
esteem theory is not free of equivocality 
either). We were unable to decide, for ex- 
ample, if dissonance theory would predict 
that the effect of warning would be greater 
with emotional beliefs or with technical ones. 
On the one hand, beliefs on emotional issues 
would tend to be tied in with more cognitions 
and hence any change would introduce more 
discrepancies and higher dissonance (Fes- 
tinger, 1957) than a change on the cognitively 
more isolated technical issues. Hence, perhaps 
we should predict that with the more cogni- 
tively articulated emotional issues, there 
would be less postwarning lowering of beliefs 
and perhaps even a postwarning strengthen- 
ing of the belief to bolster it against the com- 
ing attack (Abelson, 1959; Cohen, 1962). 
On the other hand, perhaps dissonance theory 
implies the reverse, because the emotional 
issues would tend to be more important and 
involving to the believers than would the 
technical issues. It would, therefore, be more 
dissonant to hold opinions discrepant from 
those of significant other people on such is- 
sues (Zimbardo, 1960) and hence the fore- 
warning would have more pelief-lowering im- 
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pact with these emotional issues. We were pears that during the prehistory of a 
unable to decide which of these two opposing even equivocal theories offer heuristic 
processes suggested by dissonance theory F 
would have the greater influence under the 
present experimental conditions. AseLsoN, R. P. Modes of resolution of 
Nor was it clear what predictions would lemmas. Journal of Conflict Resolution, 
follow from dissonance theory about the effect aoe es _ Lee 
of forewarning on the persuasive impact of o taiiva communication 
the attacks themselves under our experi- of Abnormal and Social Psychology, 196%) 
mental conditions, Both theory and findings 35-40. 
tend to be ambiguous on this point, as de- Bremm, J., & Comen, A. R. Explorations in c 
scribed earlier in this section. Dissonance five dissonance. New York: Wiley, 1962. 
theory would also readily yield predictions Comex, A. R. A dissonance analysis of the boom 
of interactions between source respectability sas, effect. Journal of Personality, 1AA 
and the emotional versus technical issue Deurscm, M. The interpretation of pra 
variable in determining the impact of the criticism as a function of their social 
warnings and of the attack. However, as can Journal of Abnormal and Social Psychology, 
be seen in the various rows of Table 1, there __ 6% 391-400. 


is no sign of any such interaction in the ob- Devrscy, M, Krauss, R. M, & Rosewavy 
Dissonance or defensiveness? Journal of 


tained data, : ality, 1962, 30, 16-28. 

We have already pointed out that self- Deurscm, M., & Sotomon, L. Reactions to 
esteem theory is itself equivocal about what ‘tions by others as influenced by self-eval 
effect our source manipulation should have on p E ERA Bem te) ais 
the impact of the warning. It must be ad- eR Lie Aeon “aa eere E 

itted further that it is al hat equi eas Conftcs, decison 
mitte n is also somew: at equiv- Fesrivcer, L. (Ed.) Conflict, decision and 
ocal regarding the other variable, emotional "ance. Stanford: Stanford Univer. Press, W 


versus technical issues. We predicted (and Festrncer, L., & Maccosy, N. On resistance tot 
found) more anticipator: i R suasive communications. Journal of Abnormal 
) ipatory lowering after fore- Social Psychology, 1964, 68, 359-366. 


warning on the emotional issues, because i 

hatan ) Freeman, J. L., & Sears, D. O. Warning, 
yielding to the attack on these would be more tion, and resistance to influence. Journal 
damaging to the self-esteem. But on the other sonality and Social Psychology, 1968, 1, 
hand, one’s initial, prewarning belief might Hovtann, C. I, & Manvext, W. An expt 


tend to be clearer on these moi 3 comparison of conclusion-drawing by th 
re thought municator and by the audience. Journal 
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EFFECT OF CHILD-REARING ATTITUDES AND 
BEHAVIOR ON CHILDREN’S RESPONSES TO 
HYPOTHETICAL SOCIAL SITUATIONS * 


ROBERT S. WYER, Jr? 


University of Iowa 


35 preschool children were asked to respond verbally to hypothetical social 
situations by selecting for each situation an outcome that was similar to 
what would happen to them. Decision time in making these selections corre- 
lated positively with teachers’ ratings of social inhibition in these children and 
was therefore interpreted as a measure of uncertainty in social situations, The 
assumed attractiveness of each child’s selections was also recorded. Parents of 
these children were administered questionnaires that assessed child-rearing 
attitudes and behaviors. Indexes of social uncertainty were positively corre- 
lated with (a) the magnitude of the absolute difference between parents in their 
acknowledged child-rearing attitudes and behavior and (b) the degree to which 
the mother exceeded the father in warm personal contact with the child. Scores 
by both parents on factors indicating lack of interest in the child related posi- 
tively to the frequency with which attractive outcomes to social situations were 
selected by children, while scores by mothers on factors indicating expression 
of interest and love and encouragement of contact related negatively to the 
frequency with which attractive outcomes were selected. 


An individual may manifest uncertainty in 
a given situation if he feels unable to predict 
future events in which he is involved. At 
least two general classes of such situations 
are of interest: social situations, in which an 
individual is unable to predict the manner in 
which he will be responded to by other per- 
sons and nonsocial situations that call for prob- 
lem solving or motor performance of the 
individual which has not previously been re- 
quired of him, A similar distinction has been 
made by Ausubel (1958) in analyzing execu- 
tive (instrumental) and volitional (social) 
dependence. In neither of these two types of 
situations would the individual’s previous ex- 
perience have led to the development of clear 
expectancies for the future. However, since 
uncertainty is hypothetically a function of 
the previous experience of the individual, 
uncertainty in social and nonsocial situations 
need not necessarily be correlated. One hy- 
pothetical manifestation of uncertainty is 
relatively long latency in responding to situ- 


1Thanks are extended to Malcolm Bransky and 
John Love for their assistance in planning and col- 
lecting data for this study, and to Gordon Cantor 
for his helpful comments regarding preparation of 
the paper. 

2Now at the University of Illinois at Chicago 
Circle, 


ations producing the uncertainty. The re- 
search reported here was in part an attempt 
to determine some of the antecedent condi- 
tions that produce uncertainty in children 
when responding to general social situations. 

A child’s expectations about how other 
persons outside the home will respond to him 
may derive from his previous experience 
with his parents. Differences between parents 
in their responses to their child’s past be- 
havior may prevent the child from developing 
a generalized expectancy for how other per 
sons will react toward him in the future. 
These differences may therefore create 4 
temporary .cautiousness on the part of the 
child in responding to new interpersonal 
situations. Such cautiousness, which may be 
reflected by relatively long response laten- 
cies, should be manifested until sufficient 
experience has been gained to develop Spe- 
cific expectancies for others’ behavior in the 
situation; once this behavior has become 
fairly predictable, uncertainty should de- 
crease, 

Differences between persons in their Te 
sponses to another’s behavior should not be 
confused with inconsistency in the response 
of a single individual to this behavior. If ê 
parent does not respond consistently to m 
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child across similar situations, the latter is 
unable to predict effectively the consequences 
of his behavior in the home environment even 
after substantial experience has been gained. 
The child may therefore come to perceive 
that his physical or emotional well-being is 
in continual danger. Delineation of the effects 
of inconsistent parental behavior on social 
uncertainty is beyond the scope of this study; 
these effects may differ substantially from 
those of interparent differences suggested 
above. 

Previous experience in social relationships 
may affect not only the decisiveness of re- 
sponses to socially relevant stimuli but also 
the content of these responses. Persons’ ex- 
pectancies of how others respond to them in 
hypothetical social situations outside the 
home may in part be a reflection of how oth- 
ers have responded to them in the past. To 
this extent, a positive relationship should 
occur between the frequency with which 
gratifying social contacts have occurred in 
the past and the frequency with which they 
ate expected in the future. On the other 
hand, some interpretations of hypothetical 
experiences may contain aspects that as a 
result of actual past experiences have come to 
elicit anxiety or avoidance tendencies. These 
interpretations would tend to be avoided in 
favor of less aversive alternatives. More sim- 
ply, the frequency with which potentially 
gratifying or attractive outcomes to a hypo- 
thetical situation are selected may increase 
to the extent that aspects of less attractive 
: outcomes have become cues for avoidance re- 
actions, Similar reasoning has been outlined 
by Brown (1961) in his discussion of learned 
Tesponses as sources of drive. Therefore, if 
children whose parents avoid contact with 
them or reject them have relatively stronger 
generalized avoidance tendencies associated 
with unpleasant interpersonal interactions, 
these children may tend more to interpret 
hypothetical social situations outside the home 
in a way that would be gratifying to them. 

: In this study, the following hypothesis was 
ested: children’s uncertainty in the domain 
of social relationships will be related posi- 
tively to the difference between parents in 


their attitudes and behavior toward their 
child, 
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In addition, an investigation was made of 
the relationships between relevant back- 
ground factors and the degree to which ob- 
jectively gratifying or attractive interpreta- 
tions were made of hypothetical social inter- 
action situations, A specific hypothesis con- 
cerning the nature of this relationship was 
felt to be unwarranted. 


METHOD 
Subjects 


The parents of 40 children enrolled in two classes 
of the Institute of Child Behavior and Develop- 
ment Preschool were sent questionnaires assessing 
child-rearing attitudes and behavior. From these 
children, 18 females and 17 males between the ages 
of 3.5-5.5 years of age whose parents had responded 
to the questionnaires were selected for use as sub- 
jects. 


Assessment of Social Variables 


Procedure. Each subject was exposed to two 
experimental sessions designed to assess variables 
pertaining to his perception of how other persons 
tend to respond to him in social situations. Six 
hypothetical social interaction situations were de- 
vised. In each situation three alternative outcomes 
were provided. These outcomes were selected so 
that one would be very pleasant or gratifying if it 
occurred in actual experience, a second very un- 
pleasant, and a third between the others along this 
dimension. The three outcomes of each situation 
were depicted in cartoon drawings which were used 
primarily to maintain the subjects’ interest in the 
experiment. The experiment was administered in 
two sessions. In each session, each of the six social 
situations was presented three times in the form of 
a story. Each time, the subject was requested to 
choose from a different two of the three possible 
outcomes the one which would be like what 
would happen to him, Following his selection, the 
subject was shown a cartoon drawing depicting 
the outcome he had chosen. 

The following initial instructions were given: 


I am going to tell some stories and I want you 
to help me. When I tell you a story, I want you 
to tell me what would happen to you if you 
were the boy [girl] the story is about, After 
picture of the 


truck? 


This story was used to familiarize the subject 
with the procedures and was not scored. Following 
the subject’s choice, the experimenter showed him 
a picture of a boy (girl) engaged in the activity in 
question. Of the six social situations and outcome: 
—very pleasant (+), very unpleasant (—), ©! 
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moderate (0)—used to assess experimental varia- 
bles, the following two are Presented as illustra- 
tions: 


1. Let's pretend you ask someone to play with 
his [her] toy, 
a. Would he [she] give you the toy? (+) 
b. Would he [she] not listen to you? (0) 
c. Would he [she] Push you away and not 
let you play? (—) 
2. Let’s pretend someone has a bag of candy and 
you ask him [her] for some. 
a. Would he [she] give you a lot of candy 
(+) 


b. Would he [she] give you a piece of candy 
and keep the rest for himself [herself]? 
(0) 

c. Would he [she] keep you from having any 
candy? (—) 


Wt SEO, 6(=0), 1), 2(+;-), 3+0). 


ond, One story, which described the interaction be- 
tween Parent and child, Was not included in analyses 
since responses to this story were irrelevant to the 


+), moderate (0), and hi y unattractive (— S 
spectively, and then summing the values assigned 


to each subject’s selections over the 30 respon 
made during the two experimental sessions, 
possible ambiguities in the Perceived attractiven 
of the moderate alternative threw doubt on 
advisability of considering selections on trials 
which it was involved, a second measure of thi 
characteristic (As) was the number of times hat 
the subject selected a very attractive (+) alterna. 
tive over a very unattractive (—) one, In the se 
ond score, only 10 of the subject’s 30 responses if 
the two sessions were considered. 
Social uncertainty: Teachers’ evaluations. A § 
ond measure of the subjects’ uncertainty in social 
situations was obtained by asking preschool teach. 


defined as follows: 


1. Completely uninhibited in social play; ter 
to plunge into group situation and become i 
mediately involved, 

4. Average. Cautiousness dependent upon fi 
miliarity of other children in group; will usually 
Participate if asked. q 

7. Extremely cautious in joining into group 
activity despite desire to do so; tends to remaif 
on the periphery of group as an onlooker, 


Interjudge reliability of these ratings was 74 fo 
the group of subjects selected from one preschoi 
class and .75 for the group selected from the sec 


were assessed through use of a two-part ques 
naire, The first Part consisted of 63 items from 
Parental Attitude Research Instrument 


oped by Thurstone (1947, 1954)? Eight rotate 

factors were extracted, several of which differed 

nature from the PARI scales described by Scha 
ell. 


d (eg, “I play with my child,” “I am strict an 
firm with my child,” etc.), The remaining itemi 
assessed parents’ interpretations of their child’s 
Sponses toward them (eg, “My child is easy 
manage,” “My child resents the discipline I 
a 


3 The analysis has been Programed for the : 
7040 by Harold Bechtoldt at the State Univers 
of Towa, 

_A complete description of the test and facto 
extracted from it may be obtained from the autho 
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him,” etc.). All items were responded to using a 
6-point scale. Factor analysis of these items also 
' resulted in eight rotated factors (see Footnote 3). 
Both parents responded individually to the same 
set of questions; the order of the questions on the 
smother and father forms was made to differ in 
order to discourage comparison of responses by the 
parents. Two measures of the degree of difference 
between parents’ responses toward the child were 
used, Differences between parents’ general attitudes 
toward child rearing were assessed by summing the 
absolute differences between parents’ responses to 
the 63 items taken from the PARI. The difference 
between parents’ acknowledged behavior toward the 
child was assessed by summing the absolute dif- 
ferences between parents’ responses to the 34 items 
of the Radke questionnaire pertaining to this be- 
havior. The sum of the absolute differences between 
parents’ responses to the 20 items which indicated 
parents’ interpretations of their child’s activity were 
also calculated; however, this sum was not expected 
~ to enter into hypothesized relationships between 
interparent differences and the child’s social un- 
certainty, 

To assess background factors possibly related to 
Ai and As, scores were computed on three factors 
which were assumed to reflect the degree to which 
gratifying parent-child relationships existed in the 
home environment, and on two additional factors 
which, were assumed to indicate the degree to which 
rigid standards for behavior were enforced. ‘These 
factors and two of the principle items loading on 
them are: 


A, Lack of interest, rejection: 
1, Parents often feel they can’t stand their 
children a moment longer. 
2, nine children is often a nerve-wracking 
job. 
B, Expression of interest and love: 
1, A parent should express openly his love and 
tender feelings for his children. 
2. Children are better behaved when parents 
show an interest in their affairs, 
C. Encouragement of contact, acceptance: 
1. I prefer a quiet child to a chatterbox (nega- 
tive loading). 
2. I play with my child. 
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D. Enforcement of rules; 
1. My child is expected to be neat and tidy 
in the home. 
2. I expect unquestioning obedience from my 
child 


E. Authoritarian control: 
1. Children should be allowed to criticize rules 
their parents make (negative loading). 
2. Children should be taught to abide by firm 
rules, 


Factor scores for parents of each subject were 
computed by converting the responses to the prin- 
ciple items loading on each factor to standard scores 
and summing them after taking into account the 
signs of the factor loadings of the items involved. 


RESULTS 


Preliminary analyses of the subjects’ re- 
sponses to hypothetical social situations indi- 
cated that some subjects tended to respond 
on the basis of order of presentation alone. 
Subjects manifesting such a response set were 
eliminated. The criterion for elimination was 
either a proportion of responses to the second 
alternative given which was significantly 
(p< .05) greater or less than chance, or, a 
greater or lesser number of runs of responses 
to the same (first or second) alternative than 
would be expected by chance using the Wald- 
Wolfowitz run test (Siegel, 1956). Five sub- 
jects showed response set during the first 
session and were eliminated from all analy- 
ses. Six additional subjects showed response 
set during the second session and were elimi- 
nated from analyses pertaining to A; and Ap. 

The means, ranges, standard deviations, 
and intercorrelations of the experimental 
measures of response time and the attractive- 
ness of selections are shown in Table 1. Cor- 
relations of these variables with social un- 
certainty as assessed by means of teachers’ 
ratings are also shown, The correlation be- 


TABLE 1 
MEANS AND INTERCORRELATIONS OF EXPERIMENTAL VARIABLES 


I EUS BT) 


(n = 24) 
. N M SD Range 
if itic 
A B EAA ES 
A. Respons 
time | 30 1.79 52 2.0 = —324 | —.256, 6318" 

ek 24 3462 | 840 | 280 wi {339 016 

ae 24 6.58 2.77 9.0 — i 

àn = 30, 


*p <.01, 
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TABLE 2 


CORRELATIONS BETWEEN INTERPARENT DIFFERENCES 
AND SOCIAL UNCERTAINTY 


Response time | Teachers’ ratings 


Interparent differences ages ty eb 
Attitudes .350* 34 
Behavior toward child .330* 389 
Interpretation of 

child’s behavior 113 303 


* p < .05, one-tailed, 


tween teachers’ ratings of social inhibition 
and response time suggests that the latter is 
a reasonable experimental index of uncer- 
tainty in social situations. These measures 
were both unrelated to A; and Ap. 

Correlations between the three measures of 
interparent differences and the two measures 
of social uncertainty are shown in Table 2. 
The hypothesis that the discrepancy between 
parents’ acknowledged child-rearing attitudes 
and behavior toward the child would relate 
to social uncertainty was supported in all 
four cases in which it was tested. As ex- 
pected, parental differences in the interpreta- 
tion of their child’s behavior did not relate 
to the experimental measure of social uncer- 
tainty; however, it approached significance 
(p < .10, two-tailed) in its relationship to 
teachers’ ratings of social uncertainty. 

Correlations of A; and Ag with the five 
parental attitude and behavior factors de- 
scribed above are shown in Table 3. Measure 
As, in which responses involving moderate 
alternatives were not considered, appeared to 
be related somewhat more consistently to the 
parental factors investigated than Ay. Cor- 
relations between attractiveness of selections 
and the three factors assumed to be related 
to it (Factors A, B, and C) show that in 
all cases subjects whose mothers expressed 
lack of love or interest in their child tended 
to report other children as responding to 
them in a more gratifying manner, This rela- 
tionship was less clear where fathers were con- 
cerned and in one case was in opposition to 
the tendency suggested by other results, A 
significant negative relationship occurred be- 
tween Az and the degree to which mothers 
acknowledged strict enforcement of rules 
within the home environment, 
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Discussion 


Although the hypothesized relationships 
between indicators of social uncertainty vari. 
ables and interparent differences in attitudes 
and behavior toward their child received 
consistent support, additional questions were 
raised, To determine whether the hypothe- 
sized relationship was a general one or was 
confined to particular aspects of parent-child 
interaction, absolute differences between par- 
ents’ responses were summed separately for 
each factor extracted from the factor analy- 
ses described earlier, The correlations be- 
tween response time and parental differences 
were in the predicted direction on 12 of 13 
factors which reflected parents’ attitudes and 
behaviors toward their child (p < .01). On 
the other hand, only two of these correlations 
taken separately reached significance (Table 
4). The contention that the effect is specific 
to the behavior of the parent toward the 
child was supported by the low correlation 
between interparent differences in interpret- 
ing their children’s behavior and social 
uncertainty, \ 

A second question arose as to whether 
social uncertainty was related to between- 
parent differences per se without regard to 
their direction or whether the relationship 


TABLE 3 


CORRELATIONS BETWEEN BACKGROUND Factors AND 
SELECTION OF ATTRACTIVE OUTCOMES TO 
HYPOTHETICAL SOCIAL SITUATIONS 


(n = 24) 
Attractiveness of selection 
Factor — 
Ai ag 
a 
A. Lack of interest, rejection 
Mother À 490" | 49 
6 aes .448* 502 
- Expression of interest, love 
Mother —.602** | — a" 
Father —.124 | =.269 
C. Acceptance, encourage- 
ment of contact 1* 
Mother —.259 | —Alty 
Father 403" | 403 
D. Enforcement of rules ee 
Mother 1389 | eT 
Father —.277 -3 
E. Authoritarian control 3 
Mother —.282 am ae 
Father =.253 |2 


* p <.05, two-tailed. 
** p <.01, two-tailed, 


: 
| 
l 
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TABLE 4 


CORRELATIONS OF RESPONSE TIME WITH PARENTAL DIFFERENCES AND PARENTS’ SCORES ON 
SELECTED Factors (n = 24) 


j Interparent differences Individual factor scores 
‘aly 2 Signed 
Factor difference 
Absolute | (mother's | Mother | Father | Multiple R* 
father's 
score) 
1, Lack of interest, rejection —.128 —.551*** | —.564*** | —.067 .610* 
2. Acceptance, encouragement of contact .328* A67*** .239 — 404" .604* 
3. Avoidance of communication 310° | —.413** | —.217 384" .540* 
4, Encouragement of communication .369* .231 .149 —.360 -398 
P 5. Accentuation of parental status .135 | —.406** | —.264 .212 .478 
4 ó. Demand for achievement .006 .392** 379** | —.012 400 
a Ros = „498. 


*p <.05, one-tailed. 
**+ p <.05, two-tailed. 
we >) <.01, two-tailed. 


"was the result of more specifiable unidirec- 
‘tional differences between parents’ attitudes 
and behavior. To clarify this issue, numerical 
(signed) differences between the scores of 
mothers and fathers on relevant factors were 
computed. Correlations between signed factor 
differences and response time were higher in 
11 of 13 cases (p < .01) than corresponding 
relationships involving absolute differences 
between parents’ responses. Five of these 
correlations were significant (Table 4). 
These results suggest that the signed differ- 
ences between parents’ child-rearing attitudes 
are generally better predictors of uncertainty 
as measured in this study than are absolute 
differences, Specifically, uncertainty may be 


most pronounced among children from homes 
in which the mother greatly exceeds the 
father in the degree to which warm personal 
contact with the child is encouraged. An 
effective predictor of social uncertainty may 
also be determined by computing multiple 
correlations based upon the mothers’ and 
fathers’ individual factor scores on these 
variables (Table 4). Therefore, although the 
major hypothesis of this study was sup- 
ported, these additional findings throw some 
doubt as to the adequacy of the rationale on 
which the hypothesis was based. 

In this regard, correlations between un- 
certainty and individual parent factor scores 
suggest that either a high degree of avoid- 


TABLE 5 
A MEAN ATTRACTIVENESS OF SELECTIONS AT THREE LeveLs or Factors A AND B 
Level 
Factor — F F MSe 
Middl Hi 
one's) (n= 8) (n = 8) 
A. Lack of interest, rejection 
mar 39.9 3.14 55.5 
1, 32.1 i 3. ! 

As 3538 5.50 8.00 2:79 6.28 

a 40.8 3.66" 57.1 

4 34.5 f ; 57. 
B As 25 6.75 8.25 432" 5.67 
» Expression of interest, love 
Mother 4 

` Aa 40.6 30.9 31.5 4.70* 50.6 

ET 8.25 5.38 5.25 3.99 5.76 
T ather 

A 35.6 34.5 31 74.8 

A eat 6.88 5.75 119 757 
*p <.05, 
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ance of contact and communication with the 
child by the father or a high degree of. 
interest and acceptance of the child by the- 
mother tends to create uncertainy in social 
situations, Social uncertainty may in part be 
a consequence of the failure to break the 
initial dependency ties which normally exist 
between a child and his mother. These ties 
may prevent effective social relationships 
from being formed outside the home. The 
strength and duration of these bonds should 
increase if the mother manifests excessive 
contact with and general interest in the child. 
On the other hand, a rejecting father may 
also prevent the child from severing initial 
maternal dependency ties by forcing him to 
depend exclusively upon the mother for satis- 
faction of his needs. If this reasoning is 
correct, both of these factors could contribute 
to social uncertainty, as results suggest. 
Further research is required to determine the 
adequacy of these interpretations. 

Two additional questions regarding the 
generality of these findings cannot be an- 
swered on the basis of data reported here. 
One concerns whether uncertainty as as- 
sessed in this study (response latency) is a 
domain specific characteristic or whether it 
generalizes into domains other than those 
involving social relationships. The second 
concerns the extent to which uncertainty 
manifested in different domains is a conse- 
quence of similar antecedent conditions, As 
noted previously, uncertainty involving inter- 
personal relationships is not necessarily re- 
lated to uncertainty in problem-solving situa- 
tions or situations involving physical and 
motor skills. Research dealing specifically 
with these issues is currently being per- 
formed. 

Maternal enforcement of rules (Factor D) 
was negatively related to As (Table 3). This 
suggests that a strict home environment de- 
creases the tendency for children to manifest 
gratifying expectancies for social relations. 
On the other hand, relationships between 
indicators of warm social contacts in the 
home environment and the frequency with 
which children selected attractive alternatives 
were also fairly consistently negative, indi- 
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cating that selections were not based upon 
the similarity between alternatives and pre- 
vious experiences. These results could be due 
either to a tendency for rejected children to 
avoid selecting unattractive alternatives or to 
a tendency for accepted children to select 
unattractive outcomes. To explore nonlineati- 
ties in these relations, the mean acknowl- 
edged attractiveness of selections was calcu- 
lated for subjects whose parents fell into 
the upper, middle, and lower thirds of scores 
on Factors A and B (Table 5), One-way 
analyses of variance were significant with 
regard to Factor A among fathers and Fac- | 
tor B among mothers. Both measures of 

selection attractiveness appeared linearly re- 

lated to Factor A among fathers. However, 

A; and As among subjects whose mothers 

were either high in Factor A or low in Fac- 

tor B were both significantly higher than 

corresponding scores among the residual of 

subjects whose mothers fell into the other 

two levels of these factors (p< .05 in all 

cases), This lends support to the interpreta- 

tion that the selection of attractive outcomes 

to hypothetical social situations may be due 

to avoidance tendencies elicited by less at 

tractive alternatives and furthermore, that 

these tendencies are created by a home en- 

vironment in which attempts by the child to 

interact with the parent are avoided of 

responded to negatively. 
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Some studies of the social behavior of children are used to illustrate 2 
methods for investigating interaction sequences. One method is that of multi- 


variate informational analysis; 


Studies of interaction, whoever the sub- 

“jects and whatever the situations, presume 
communication whereby the participants in 
an interchange influence one another. The 
fact of such influence permeates our daily 
lives, We greet someone and expect and most 
often get a greeting in return; we ask for 
directions and most often get a reasonable 
reply? we shop and expect that money will be 
accepted for our purchases, Our lives as social 
beings are premised on some degree of abil- 
ity to change the behavior and attitudes of 
others by means of our own actions. 
_ Social interaction as an influence or change- 
inducing process involves several features. 
First is the fact that interchanges occur in 
sequences over time. Second, the process of 
change presupposes variation and constraint. 
A third feature may be posed as an issue of 
balance between efficiency and flexibility. 

1. Sequential aspects. Interchanges occur 
sequentially. A sequence may be confined to 
a single act by A followed by a single re- 
sponse by B. More often, it involves a chain 
of interchanges, By means of chaining, mul- 
tiple contingencies may be taken into ac- 
count, and, even for binary events, given a 


ae those who have contributed most to the 

q uh ent of this paper, Richard A. Littman of 

the University of Oregon and Allen Te Dittmann of 

ie gratatory, of Psychology at the National Insti- 

> ther Mental Health deserve very special mention. 
AR major contributors to the work and ideas have 
Renee Arasteh, Donald S. Boomer; Robert 

per, ae and my colleagues, noted later in the pa- 
2N om the Institute for Social Research in Oslo. 

ow at the University of Michigan. 


paper suggests processes of control and organization by which situational and 
personal variables influence social interaction. The methods and concepts are 
thought to be applicable to the study of the temporal chaining of events within 
other groups and within paired relationships. 


long enough chain, any level of complexity 
may be achieved (Ashby, 1958), The adap- 
tive properties of such a chaining process have 
been noted by Bonner (1960). An effort to 
study the process of interchange and the 
chaining of interchanges, rather than single 
behavioral acts, gave rise to the studies dis- 
cussed here. 

2. Variation and constraint. Like chaining, 
variability allows for possibilities by which 
organisms can be sensitive to multiple con- 
tingencies (Beach, 1960; Campbell, 1959). 
Social interaction may be thought of as a 
process of constraint or modification of con- 
straint on initial variability. That is, A’s ac- 
tions may be said to have an effect if they 
modify the distribution of actual responses 
from B’s total potential repertory. A series 
of interchanges may thus be characterized 
by successive transitions over which the be- 
havioral variation among participants under- 
goes modifications of constraint until a point 
of equilibrium is reached. The present studies 
investigate the extent of constraint produced 
by the action of one person on the immedi- 
ately subsequent acts of another, the degree 
and manner in which some situational and 
personal factors constrain social interaction, 
and the nature of sequence chains and transi- 
tions as these appear in interactions of sev- 
eral different groups of children. 

3. Efficiency and flexibility. The greater 
potential for variation and chaining in hu- 
mans as contrasted with lower species con- 
tributes to the greater complexity and lower 
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Fic. 1. Relation between “sent” and “received” acts 
with peers, 


predictability of human behavior. So, even 
at the very simplest levels of perception in 
humans, there appears to be a residual “er- 
ror” or uncertainty between stimulus and 
response (Brunswik, 1944; Garner, 1962), 
and more complex human transactions lack 
the efficiency that characterizes the spinning- 
off of a pattern of behavioral events in, for 
example, an insect mating ritual. However, 
since a system is modifiable only to the ex- 
tent that some uncertainty exists between 
the classes of events, the inefficiency permits 
a flexibility which would not be possible if a 
given event a, always led to a given event b,. 
That such flexibility has evolutionary adap- 
tive advantages is suggested by Russell and 
Russell (1961) who contrast “exploratory” 
with “automatic” systems. The flexibility, 
associated with residual uncertainty between 
classes of events, allows diversity in forms of 
control and organization of behavior. 


CONTROL AND ORGANIZATION 


Human interaction can thus become sensi- 
tive to a variety of constraints. Some of these 
are considered below. More generally, how- 
ever, two major modes of constraint may be 
distinguished, One is that of control; the 

. other is that of organization or patterning. 


Control 


Participants, in interacting, generally exer- 
cise some degree of control on one another’s 
behavior. That is, B’s actions will vary some- 
what according to what A does. For some 
behaviors B’s responses to A tend to be com- 
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plementary. Gellert (1962), for exam 
that a child’s dominance tends to 
when he is paired with the less a 
two peers; similarly, Raush, Ditt: 
Taylor (1959a) suggest a compl 
relation between dominance by one ch 
submission by another. Other kinds 
havior tend to lead to similar ra 
complementary responses. This appe 
be so for affectional behavior, in 
friendly action given is associated 
friendly action received. This latter 
ism is illustrated in Figure 1, which | 
marizes a few results from observati 
studies of several groups (cf. Raush, 
man, & Llewellyn, 1960). The figure 
that the proportion of friendly or hos 
havior which children receive from their p 
reflects rather closely the behavio 
“send” to them. The studies, however, 
demonstrate directly the contingen 
tween the actions of one person and th 
another, nor do they indicate their degre 
contrast, the present report deals wi 
contingencies through which the ani 
action of one child exerts control on thi 
sequent actions of another. i 
Tt is obvious, however, that the an 
behavior of another is not the sole sout 
control on social behavior. For exam] 
potency of situational effects has bet 
onstrated in the ecological studies of 
and his colleagues (Barker, 1963; 4 
Wright, 1954). Similarly, previous mvé 
tions with the subjects of the present 
showed the influence on social beha 
erted by both situational and individ 
tors (Raush, Dittmann, & Taylor, 
Raush et al., 1960). k 
Each of the components mention 
actions of others, the situation, 1m 
factors—illustrate classes of constr 
control. In addition to its direct effects 
control may also have indirect effec 
trol is direct in terms of its immediat : 
ence. For example, as suggested in 
lier studies, a given situation e may 1p 
the probability of social acts a; in Co 
a; similarly, a given individual A 
general tend to favor a; acts. So too, 
in the information transmission ané 
low, antecedent actions have an 1m 
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effect; thus, antecedent act a, may increase 
the probability of responses bı .. . bs and 
decrease the probability of responses bg. -- 
bio: 

, A class of events may, however, also have 
remote or indirect effects. That is, its effects 
may be mediated by intervening events. This 
would be the case if, for example, situation @ 
increased the probability of act a, and t, 
in turn, increased the likelihood of response 
by, In this case, the situation would have a 
direct effect on a and an indirect effect on b. 
Similarly, an initial act may affect not only 
the immediately subsequent response, but, by 
this, may exert influence over more remote 
responses. The transition analyses below illus- 
trate the chain of events, that is, the indirect 
effects, which may flow from the contingen- 
cies set by initial controls. Constraint by di- 
rect control thus refers to modifications of 
initial probabilities of certain actions out of 
the total repertory of possible actions; if the 
influence on initial probabilities then affects 
more remote events, we may speak of indirect 
contral, 


Organization 


When more than two variables are present, 
possibilities for organization arise. Organi- 
zation corresponds to statistical interaction. 
Thus, whereas control would be evidenced if 
Variable Z influenced the particular values 
of Variables X and/or Y, organization would 
be evidenced if Variable Z influenced the re- 
lationship between X and Y. An example of 
such organization may be referred to from 
previous studies (Raush et al., 1960). First, 
as noted above, the nature of the situation in- 
fluenced the social behavior of children. But 
the relation between situational effects and 


„Social behavior was in part a function of a 


pee’ which might be thought of as re- 
TE the psychological status of the group. 
‘hat is, the behavior of normal children was 


highly sensitive to situational variations, 


a the behavior of very disturbed chil- 
en was much less sensitive to these varia- 


‘tions; as the disturbed children improved in 


t * i * 
course of residential treatment, their be- 


. havi A t 4 
vior became more sensitive to situational 


con: i x : 
_ Constraints, The connection between situa- 
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tion and behavior is thus not invariant, but 
rather is affected by a third component. 

Although the present report is concerned 
with the contingencies between the antecedent 
social acts of one child and the subsequent 
acts of another, two other variables are con- 
sidered. One is the nature of the situation— 
for example, whether it is that of a mealtime, 
or a structured game, or an unstructured so- 
cial event; the other is the nature of the 
group—whether it consists of disturbed hy- 
peraggressive children (Redl & Wineman, 
1957) or of matched well-adjusted children. 
We thus have conditions for exploring issues 
of both control and organization. According 
to our previous definitions, control is demon- 
strated if, for example, situation œ increases 
the probability of a acts (direct control), 
which may, in turn, be followed by an in- 
crease in the probability of bı acts (indirect 
control), whereas situation 8 is associated 
with greater likelihood of a acts followed by 
increase in likelihood of bə acts. Organiza- 
tion is demonstrated if in situation œ anteced- 
ent act a, is associated with greater likeli- 
hood of subsequent act b1, whereas in situa- 
tion B antecedent act % yields a higher 
probability of ba acts. Control would thus 
affect the “level” of the variables, whereas 
organization would affect their patterning. 
The information transmission analyses con- 
sider issues of control and organization for 
immediate contingencies between antecedent 
and subsequent actions. The transition analy- 
ses consider issues of control and organiza- 
tion for chains of interactions. 


GENERAL METHODS 


Situations and Subjects 


Each child, plus those with whom he interacted, 
was observed twice, systematically over six situa- 
tions—breakfasts, other mealtimes, structured games, 
unstructured group activities, group instructional 
situations, and snack periods just before bedtimes. 
The children were in one of a number of groups: a 
group of hyperaggressive boys at two stages of resi- 
dential treatment 1.5 years apart (Raush et al., 
1959a, 1959b); groups of normal socially well-ad- 
justed boys, who were matched to serve as controls 
(Raush et al., 1960) ; and more recently a group of 
Norwegian boys, studied in Norway. This latter 
group was matched in age and roughly in intelligence 
and socioeconomic status (lower to middle) with 
the younger American normal group. Like the 
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American controls, the Norwegian children were se- 
lected by teachers as socially well-adjusted and the 
situations for observation were designed to be simi- 
lar. The adult staffs were, however, quite different as 
was the general environment, the American subjects 
having been studied while living on a hospital ward 
for several weeks during the summer, the Norwegian 
sample having been studied in a rather isolated 
farm-resort for a period during the winter.” In all 
cases a group was composed of six boys, The early 
patient group and the Norwegian children were 9-11 
years of age; the other groups were composed of 
10.5-12.5 year olds. Because of some limitations in 
the earlier phases of data collection, together with 
the fact that the present analyses were not planned 
in the original design, the early patient group could 
not be used for all analyses. Apart from this, there 
were 12 observations for each child and 12 for each 
situation, which, with six children, made a total of 
72 observations in each group. 


Data Entry 


Dictated raw protocols of observations were tran- 
scribed and then coded according to the interper- 
sonal behavior categories devised by Freedman, 
Leary, Ossorio, and Coffey (1951), and elaborated 
by Leary (1957). The manner of data gathering and 
the coding scheme have been described in the previ- 
ous papers. In order, however, to analyze sequences, 
a particular method is useful for entering codings. 
The technique is simply to record each interaction 
coding in the natural order in which it occurs, and 
to include in each code a notation of who does what 
to whom—A hits B, B hits A, A runs away, B shouts 
an insult at A, etc. The codes are entered onto lined 
data sheets, The rows of the sheet may be used for 
the successive acts in an observation, and a column 
may be assigned to each participant. For the analy- 
ses presented below, actions were classified dichoto- 
mously, as either friendly or unfriendly. 


Sequencing 


The previous studies employed frequencies irre- 
spective of order, For the present studies it was 
necessary to refer back to both codings and the 
original raw protocols in order to determine se- 
quences, A sequence was defined as occurring with 
any interchange in speech or action between two or 
more participants, The unit for a sequence was simi- 
lar to one used by Barker and Wright (1954). A 
sequence continued so long as there was obvious 
continuity or contingency in the response of one sub- 
Ject to another; a break in the contingencies termi- 
nated a sequence. Farbman (1961) presents some 


2 The author was a Fulbright scholar at the Insti- 
tute for Social Research in Oslo in 1959-60. He and 
his Norwegian colleagues, Claus Fasting and Sissel 
Berggrav are indebted to the American Social Sci- 
ence Research Council for support of the study. We 
are grateful also to Ingrid Jacobsen, Jacob Nord- 
land, Torild Skard, Berit Aas, and the Institute staff 
for their great help. j 
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longer episode units (cf. Barker, 1963; Barker & 
Wright, 1954). For example, within a breakfast ob- 
servation there might occur an episode of several 
minutes in which the children discuss a movie they 
had seen; within the episode may be several se. 
quences, as when A, B, and C talk about a particular 
scene, or when A asks C to pass the milk, or when 
D comments to A about his favorite actors. Although 
such episodes might be treated as supersequences, we 
have not done so here. In the observations, sequences 
of interaction occurred between children and other 
children and between children and adults. This re- 
port focuses on the analyses of sequential interac- 
tions in affectional behavior among children. 


Analytic Methods 


Despite the temporal sequential aspects of behav- 
ior, noted cogently by Barker (1963), formal schemes 
for the analysis of the process have been, until re- 
cently, difficult to come by. Two such schemes att 
used here. Both derive from information and com- 
munication theory and deal with stochastic proc- 
esses in which sequences of events are represented 
by probabilities. One technique, presented by Smith 
(1953), McGill (1954), and Garner (1958), is that 
of uncertainty or multivariate information analysis 
A second scheme for analyzing sequential data 1 
the transition probability model, described by Ashby 
(1958) and by Kemeny, Snell, and Thompson 
(1956) .4 


further details and examples. Sequences occur within | 


MULTIVARIATE INFORMATION ANALYSES 


Garner (1962) summarizes the experimen- 
tal literature which uses multivariate infor 
mation analysis, and he presents the theo 
retical implications of this mode of approach 
to psychological data. The application of the 
method is most clearly discussed by Attneave 
(1959). Since Attneave describes the analys 
with sufficient examples so that it can be fol- 
lowed by the mathematically unsophisticaté 
reader, it is not necessary to go into any fe 
tail here. A general comment by Ashby (1962) 
is perhaps appropriate: 


= 
O a ee, 


[this approach] gives us a method for the treatment 
of conditionality that is not only completely rigoro 

but is also of extreme generality. Its great ge 

and suitability for application to complex beht ily 

lies in the fact that it is applicable to any arbi 

defined set of states. Its application requires BO" 
linearity, nor continuity, nor a metric, nor even i 

ordering relation. By this calculus, the degree of eo 


Robert G. Ryder, of the Child Research Branch 
at the National Institute of Mental Healthy $ 
worked out generalized computer programs for ity 
multivariate information and transition pre abil 
analyses. 
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ditionality can be measured, and analyzed, and ap- 
portioned to factors and interactions in a manner 
exactly parallel to Fisher’s method of the analysis of 
variance; yet it requires no metric in the variables, 


only the frequencies with which the various com- 
, binations of states occur [p. 256]. 


Ashby looks forward to the time when these 
methods will become the accepted language 
for the quantitative treatment of complexities 
‘of relation in nonmetric variables of psychol- 
“ogy and biology. 

Multivariate information analyses were 
used previously for describing the relations 
among individual differences, situational in- 
“fluences, and behavioral change in treatment. 
Here, however, the method is used for ana- 
lyzing contingencies between the behaviors of 
“one person and another, and for describing 
| how these contingencies are affected by the 

natures of the situation and the group. The 
"findings may be considered from several per- 

Spectives. One is from the point of view of 

enhancing the predictability of behavior; an- 
“other is from the point of view of studying 

‘Communication processes; & third perspec- 
tive) relevant here, comes from the question, 
raised earlier, of control and organization.’ 


Main Effects 


Effect of the situation on antecedent acts. 
= Table 1 presents results on information 
 Yielded about classification of antecedent acts 

-from classification of the situation. T, in this 
‘and in the tables which follow, is the infor- 
ration metric, in bits, from which an esti- 


. | Another perspective is noted by Stuart A. Alt- 
“Mann of the Department of Zoology of the Univer- 
s of Alberta (Altmann, 1965). Altmann suggests 

wee it information analysis of behavior sequences is 
Televant not only to issues of prediction and com- 

A nication, but also to the issue of social memory. 

at is, the number of intervening events over which 

an act by A continues to influence B’s behavior is 
‘Operational measure of B’s social memory for 
an act, This issue, which relates to the temporal 
nt Of what is here called indirect control, cannot 
tudied by the present information analyses, since 
e data were sufficient to enable only an analysis 
amediate contingencies. Although the transition 
` RAN p since they involve predicting the ultimate 
e of an initiating event, are related to this issue, 


om 


Would need more complete data for tackling the 
ion properly. It is worth noting that Altmann’s 
ion of the social memory perspective intro- 
Possibilities for comparative analyses across 
cultures, social classes, or personality types. 
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TABLE 1 
EFFECT OF THE SITUATION ON ANTECEDENT ACTS 


% reduction of 
uncertainty of 
antecedent acts 


854 | .092 11 


Group f T» 


Hyperaggressive (later 
ein plus American 
well-adjusted controls 


Norwegian well-adjusted | 1,237 .081 13 


a T is the information, in bits, transmitted between the classi- 
fication of the six situations and antecedent friendly or un- 
friendly actions, Values are statistically significant, » <.001. 


mate of statistical significance can be de- 
rived. The last column in the table is the co- 
efficient of constraint, D (cf. Attneave, 1959, 
pp. 35-36, 59-60), multiplied by 100 to 
yield the percentage of uncertainty—of, in 
this case, the antecedent act classification— 
accounted for—by, in this case, the situa- 
tional classification. It represents the propor- 
tion of information shared between the two, 
and is analogous to the amount of variance 
accounted for. The table indicates that by 
distinguishing among the six situations we 
may reduce the uncertainty of antecedent 
affectional acts by some 12%. The values 
seem relatively consistent for the two analy- 
ses. An example from the data may illustrate 
the finding. For the American normal chil- 
dren, 42% of antecedent acts were unfriendly 
in game situations °; however, only 5% of 
antecedent acts were unfriendly at mealtimes. 
Thus, if we know the social situation, we 
can improve, considerably beyond chance, 
our guesses about the relative likelihood 
of friendly or unfriendly behavior.” From a 
communication viewpoint, the situation sets 
a tone, influencing the likelihood that certain 
acts will occur. 

Effect of the group on antecedent acts. The 
group variable also related to antecedent acts. 
As one might indeed expect, hyperaggressive 
boys, as compared to normal boys, produced 


6 Acts which were an intrinsic part of the games 
were eliminated from the analyses. 

"This and following statements about statistical 
significance refer, it should be noted, to populations 
of acts rather than to populations of children; one 
may consequently argue that statistical requirements 
of independence are not met. I have therefore been 
cautious about interpreting results; unless the dif- 
ferences seem sizable, and unless they were in some 
way replicated, they are not commented on. 
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TABLE 2 
EFFECT oF GROUP ON ANTECEDENT ACTS 


| % reduction of 


Group Í Tə | uncertainty of 
antecedent acts 
Hyperaggressive (later 854 | .087 10 
group) plus American 
well-adjusted controls i 
Hyperaggressive (both 514 | .061 7 


groups) plus American 
well-adjusted controls ; 
incomplete but 
matched data 


aT is information yielded about antecedent actions from 
classifying children as to hyperaggressive or normal, Values are 
statistically significant, p <.001. 


fewer friendly acts as stimuli for their peers. 
For the American normal children, antecedent 
acts were coded friendly in 89% of the in- 
stances; for the Norwegian boys, the value 
was 85%; the later patient sample produced 
only 58% antecedent friendly acts, and the 
earlier group, for whom only partial data 
were available, showed only 51% such acts.® 
As shown in Table 2, by distinguishing be- 
tween the clinical group and their matched 
normals, we may account for about 10% of 
the variance of antecedent affectional behav- 
ior. Like the situation, the nature of the group 
and/or its pathology sets a tone for the com- 
munication process. 

Effect of antecedent on subsequent acts. 
Table 3 shows that the major determinant of 
an act was the immediately preceding act. 
That is, if you want to know what Child B 
will do, the best single predictor is what 
Child A did to B the moment before. For the 
three groups for which there were complete 
data, the results indicate that by classifying 
A’s acts we can account for about 30% of 
the variance of B’s immediately following be- 
havior. The more limited sampling of data 
available for the younger, more disturbed pa- 
tient group yields somewhat different results. 
The prior act was still the best single pre- 
dictor, but in this case it accounted for only 
7% of the variance of subsequent acts, Un- 
fortunately, we cannot say here whether this 
difference is related to age, pathology, or to 
the nature of the data; the method, however, 

$ Data on reliability of coding and on questions of 


' »xater bias are presented by Dittmann (1958) and by 
Raush et al, (1959a, 1960). a 


Harotp L. RausH 


allows and suggests such investigations for 
studying development and psychopathology, 


Control and Organization 


Antecedent social acts thus exercise a meas- 
ure of direct control on the subsequent be- 
havior of others. Since the nature of ongoing 
situations and of particular social groups 
exert a degree of direct control on antecedent 
acts, they also indirectly control the behav- 
iors which follow. 

One advantage of multivariate information 
analysis is that it allows us, in examining the 
relation between two variables, to partial out 
either single or joint effects of other variables. 
The ability to hold variables statistically con- 
stant allows us to test for what has been 
called here organizational effects. Organiza- 
tion would exist, if, let us say, the situational 
variable exerted constraint on subsequent 
acts, when antecedent acts were partialed out. 
We would then have evidence that in situa- 
tion a antecedent act a, has greater likelihood 
of leading to subsequent response bı, whereas 
in situation £, antecedent act a: has greater 
likelihood of leading to subsequent responst 
bs, and this would meet our definition of or- 
ganization. 

Organizational effects of the situation. 
After eliminating the effect of antecedent acts, 
the situation had a reduced, but still statist 
cally highly significant (p < .001) effect on 
subsequent acts. The same was true when 
both group and antecedent act variables weré 
partialed out. An example selected from the 


data may give concreteness to the finding. In, 


TABLE 3 
EFFECT OF ANTECEDENT ON SUBSEQUENT ACTS 


% reduction K i 


Group ey Ta | uncertainty ol 


subsequent acts 
Hyperaggressive (later, 472 | .280 28 
after 2 years treat- 
ment) 
American well-adjusted | 382 .192 31 
Norwegian well-adjusted | 1,237 | .192 a 
Hyperaggressive (early, 181 | .068 
after 5 year treatment; 
incomplete data) 
it 
* Tis the information, in bits, transmitted between anteceden 


rst 
acts and subsequent responses, both classified as friendly fr, 
unfaeealy All values in the table are statistically S18 
p < . 


INTERACTION SEQUENCES 


zame situations among the American normal 
boys, friendly acts led to unfriendly responses 
n 31% of instances, whereas at mealtimes, 
friendly acts led to unfriendly responses only 
4% of the time; similarly, among the Nor- 
wegian children, friendly acts were followed 
by unfriendly responses 20% of the time at 
games and only 5% of the time at meals. 
Thus, the situation not only controlled the 
initiating events in interaction contingencies, 
but it modified the patterns of contingent re- 
lations. : 
Organizational effects of the group. The 
comparison between hyperaggressive boys 
later in treatment and matched socially ad- 
justed children similarly illustrates not only 
control by the group or pathology variable, 
‘but also organization. Eliminating the influ- 
ence of antecedent acts, or of both anteced- 
ent acts and situations, yielded a significant 
(p < .001), though reduced, group variable 
influence on subsequent acts. The results thus 
indicate that the two groups of children re- 
acted somewhat differently to the same 
stimuti, We may again turn to the data for 
concrete cues as to what goes on. The hyper- 
aggressive and the normal boys responded 
rather similarly to unfriendly antecedent acts, 
with 80% and 77% unfriendly responses. 
Data for the early patient group, studied 
after 6 months of residential treatment, could 
not be submitted to an exactly comparable 
information analysis, but they were consist- 
ent in that this latter group responded to un- 
friendly acts with 75% unfriendly responses. 
Where the groups differed was in their re- 
Sponses to friendly antecedent acts. Among 
the normals, the friendly acts of one child 
were followed by unfriendly behavior on the 
part of the recipient in 8% of instances; the 
group later in treatment reacted to friendly 
» acts in an unfriendly fashion in 19% of in- 
stances; and for the early patient group, this 
value is 45%. It is not clear to what extent the 
effects are due to differences in personality 
A Pathology; or to differences in group for- 
EA or, in the case of the earlier group, 
‘nica differences. It does, however, seem of 
ae fe practical significance that the 
< tile ps differed not in their response to hos- 
k gestures, but rather to friendly ones. 
ere is the further implication that the dis- 


493 


turbed children were less able to differentiate 
friendly from unfriendly acts. From the view- 
point of organization, it is apparent that the 
groups responded differently to antecedents 
which were coded as equivalent. The group 
variable thus exhibited an organizing effect. 
Like the situational variable, it not only con- 
trolled the initiating events, thus influencing 
subsequent acts, but it also modified the pat- 
terns of contingencies. 


Relative Contributions of Components 


Estimates of the relative contributions of 
each variable to behavior appear in Table 4. 
The table presents estimates of the propor- 
tion of remaining variance that is accounted 
for by a particular variable, when the con- 
tribution of other components is removed.’ 
Although the figures in the table cannot be 
evaluated for statistical significance, they sug- 
gest roughly how much of the possible vari- 
ance is attributable to a given component in- 
dependently. Both control and organizational 
effects are indicated. As noted earlier, the 
data were insufficient to examine organizing 
effects of antecedent acts, such as would oc- 
cur, if, for example, act bı, anteceded by 
act a, led to act c1, whereas bı, anteceded 
by a, led to c2. But as determiners of im- 
mediately subsequent acts, antecedent acts 
play the predominant role. The situation and 
group variables contribute secondary inde- 
pendent effects. Other results indicate that, if 
all three components—antecedent acts, situa- 
tions, and groups—are taken into considera- 
tion, some 40% of the variance of immedi- 
ately subsequent behavior is accounted for. 
Individual difference factors, which could not 
be dealt with in these analyses, would, from 
indications of the earlier studies, increase 
this value. 

Some further qualifications should be noted. 
One consideration involves the number of 
categories used. This is likely to affect spe- 
cific values. A replication of an analysis, 
using eight categories for interactive behav- 
jor rather than the bivariate friendly versus 


9 The general formulae are: for three variables, 
D-(y:8)/1 — D(x:2); for four variables, Dwo(y:2)/ 
1—D(w, x:2)- The procedure was suggested to me, 
in personal communication, by Fred Attneave of the 
University of Oregon. 
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TABLE 4 4g 
ESTIMATED CONTRIBUTIONS OF VARIABLES TO BEHAVIOR Mf 
7 
Accounts for When— 
Classification | 22 of the or In sample(s) Be? ak 
his vars o @ (8) 3 
1. Antecedent act 23 Subsequent Hyperaggressive Situation and Ce 
response (later) plus group 
American controls 
2, Antecedent act 25 Subsequent Norwegian Situation 
response 
3. Situation 15 Antecedent oe gressive Group 
act ter) plus 
American controls 
4. Situation 13 Antecedent Norwegian == 
act 
5, Grou 15 Antecedent Hypera; ive Situation 
E act Yaten plus 
American controls 
6. Situation 10 Subsequent Hyper ressive Group and 
response (later) plus antecedent 
American controls act 
7. Situation 6 Subsequent Norwegian Antecedent 0 
response act 
8. Group 7 Subsequent Hyperaggressive Situation and (0; 
response “Tater plus antecedent 
American controls act 


Note—The table is to be read across each row. Thus, Row 1 says that classification by antecedent act accounts 
of immediately following responses in th 


the potential remaining 


fects of situational and group differences are removed, This illustrates control on subsequent responses by ani 


Formulae for Column 2 are presented in the text. 


unfriendly categorization, yielded somewhat 
different specific values. Nonetheless, the or- 
derings discussed above were maintained. One 
should also note that for other kinds of inter- 
action sequences than those involved in affec- 
tional relations among peers, the same order- 
ing of results need not, and, indeed, does not 
hold, For example, children’s friendly or un- 
friendly responses to adult behavior were de- 
termined relatively less by the specific ante- 
cedent acts of the adult, and relatively more 
by situational and group components. None- 
theless, similarities do appear. For example, 
as in the case of peer relations, the hyper- 
aggressive children did not differ appreciably 
from the normals in reacting to unfriendly 
adult actions, but differed considerably in 
their responses to friendly actions. As com- 
pared to the data reported here on affectional 
behavior, studies of status behavior yielded, 
in general, a lower level of predictability and 


je hyperaggressive and well-adjusted chil 


less consistency between American 4 
wegian data. It was clear, howe 
among all groups of children, submis! 
cedent acts of one child were strongiy 
ciated with immediately subsequent doi 
acts of another; dominant antecedent! 
ior tended to be followed by submis: 
sequent acts, but at a much lower 
probability. Different kinds of beha 
different sources for, or recipients of | 
are thus likely to yield different 0 
effects. What should be noted, hov 
that the method enables us to stu 
orders and to infer from them the 
of communication as these relate 
circumstances of interaction. 


TRANSITION PROBABILITY ANAL) 


The information analyses consid 
gencies between antecedent and. 
actions. In addition to furnishing ! 


INTERACTION SEQUENCES 


about sequential relations, the analyses yield 
precise measures of the contributions of other 
identified components. The transition analy- 
ses which follow do not yield such measures. 
But unlike the information approach, they 
do treat chains of interactions. In this way 
they can provide an insight into the process 
of interaction from its initiation to its natural 
termination. 

The coding and entry of data are as dis- 
cussed above. As a final notation, however, 
each chain of acts composing a sequence is 
listed, From the set of sequences one can 
then obtain initial probabilities and probabili- 
ties for each subsequent step along the series. 
Given two adjacent steps, we can determine 
the probability of any category of behavior 


¢ on the part of one person leading to a given 


category of behavior on the part of another 
person, From this empirical relation, by mak- 
ing the assumption that these contingencies 
will continue, a subsequent course of events 


. can be predicted. Finally, one can compare 


these probabilities, obtained from the assump- 
tions of reiterated events, with the actual 
empirical probabilities. Thus, the method en- 
ables comparison between the events pre- 
dicted from a given transition and the actual 
events as they occur. In a sense, then, we 
may compare the expected fate of a set of 
actions with the actual fate. 

Early patient group. Figure 2 shows the re- 
sults for the hyperaggressive boys early in 
treatment. We may note that in interactions 
among peers 70% of the first acts were 
friendly. By the second step, that is, the re- 
Sponse by another to the first act, the pro- 
SNe of friendly acts dropped to 48%. The 
| aaa curve, represented by the dotted 
peers the theoretical continuation from 
ae initial matrix of probabilities. If succes- 
a actions followed the hypothetical curve, 
a eeortions woul stabilize at the fourth 
ne t that point only 34% of the children’s 
eee be expected to be friendly. The 
ee ata for successive actions are repre- 

ed by the solid line. The frequencies of 


_ Tesponses for the limited data from the early 


Sea group are too small to warrant much 
lence, but so far as the results are to be 


pnt 
rusted they suggest two trends. The first is 


for h 
hostile responses to increase in successive 
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Fic. 2. Peer interactions of hyperaggressive boys 
early in treatment. 


steps; the increase is quite sharp—by the 
second act less than half of the responses are 
friendly. More interesting is a suggested trend 
for hostility to increase beyond its expected 
theoretical course. A similar trend, not shown 
here, appeared in the interactions of these 
children with adults. It is as though in this 
group of very disturbed aggressive children, 
as interaction continues, the course of hos- 
tility is progressively accelerated. 

Later patient group. The results for these 
same boys after an additional 1.5 years of 
residential treatment are shown in Figure 3. 
The figure shows that first acts in interaction 
chains were coded as friendly in 63% of in- 
stances; thus, in their initiations, the children 
were no friendlier toward their peers at this 
stage than they were at the earlier time. But 
the difference appears in the slope of the 
curve. The drop in the proportion of friendly 
acts at the second step was to 56%, consid- 
erably less sharp than occurred in the earlier 
period. The predicted course of events from 
the initial transition suggests that the process 
would stabilize at the fourth step with 47% 
friendly acts, as compared to the theoretical 
34% for the earlier period. Comparing the 
obtained and hypothetical curves suggests 
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Fic. 3, Peer interactions of hyperaggressive boys 
later in treatment. 


another difference from the earlier phase. 
The obtained data—and here frequencies are 
somewhat more respectable—suggest that 
whereas hostility may increase as interaction 
continues, it does not, unlike the earlier situa- 
tion, accelerate beyond its expected course as 
given by the theoretical curve. Successive acts 
are perhaps even somewhat less unfriendly 
than we would expect on the basis of the first 
transition, Data on interactions with adults 
showed some differences from the above pic- 
ture of interactions with peers: as compared 
to their earlier performance, the later group 
was considerably friendlier in initiations of 
sequences; furthermore, successive transitions 
became somewhat less friendly than was ex- 
pected theoretically. But a major similarity 
held. As in the case of peer relations, the 
sharply progressive increase in hostility which 
characterized the earlier pattern of interac- 
tion was attenuated in the relations of the 
later group with adults. 

Normal American children. As we may note 
from Figure 4, the control children showed a 
higher proportion of friendly initiations with 
peers than did either patient group (94%). 
As with the other groups, there was a de- 
crease in friendliness at the second step. The 
matrix of probabilities from the initial tran- 
sition indicates an expected stabilization by 
the fifth act in a chain with 63% friendly 
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acts. Thus, in contrast with the patient 
groups, a chain of interactive events with 
these well-adjusted children would be ex- 
pected to wind up in a friendly fashion more 
than half the time. A further difference be- 
tween patients and controls occurs in the dis- 
crepancy between theoretical and obtained 
data. The obtained data for the early pa- 
tients tended to show a higher proportion of 
hostile actions than theoretically expected; 
the later group showed proportions close to, 
or perhaps slightly less than expected; the 
control children deviated from expectations 
in a direction clearly opposite that of the | 
early patients. At each successive step, the | 
normal boys showed greater friendliness than 
would be expected from the hypothetical 
curve based on the initial transition matrix. | 
Relations with adults yielded similar indica- 
tions, although high initial proportions of 
friendly acts limit the differences. For the 
control group, then, there is the suggestion | 
of a corrective factor inhibiting the expected 
course of increase in hostility. In contrast to 
the acceleration of hostile actions found in 


10 Analysis indicated that these results are not an 
artifact of differences in the length of sequences. 
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the very disturbed boys, a kind of negative 
feedback on hostile responses seems to occur 
in the interaction stream of the well-adjusted 
boys. 

Normal Norwegian children. A partial rep- 
lication is found in the data for the Nor- 
wegian boys. Despite cultural differences and 
differences in the environments in which the 
studies were conducted, the two socially ad- 
justed groups show similar processes in se- 
quential chains of interactions (Figure 5). 
Both groups started with a high proportion 
of friendly acts which dropped somewhat at 
the second act. For both groups the point of 
expected stabilization is on the friendly side 
of the ledger, although it is at a higher level 
for the Norwegians (759%). As with the 
: American controls, the empirical data deviate 
consistently from the hypothetical curve, and 
the direction of deviation is the same for the 
two groups. That is, in the course of interac- 
tion the Norwegian boys also maintained a 
_ higher level of friendliness than was to be 
expected on the basis of the starting points. 
For «this group too, then, a corrective fac- 
tor seems to operate to inhibit the expected 
course of degradation of interaction toward 
greater hostility. 

The results from the transition probability 
studies thus suggest an organizational com- 
ponent in the flow of social interaction. In- 
teraction does not proceed in an automatic 
fashion based on its beginnings. The process 
of interaction appears rather to be modified 
systematically by a component which differs 
among the groups studied. From knowledge 
of the groups and of their selection, we would 
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suggest that one variable organizing the se- 
quential process of social interchange is what 
has been called ego control (Redl & Wineman, 
1957). 


Discussion 


We have demonstrated two methods, that 
of multivariate informational analysis and 
that of transition analysis, for studying the 
sequential aspects of social interaction. In a 
sense the findings tell us what we have al- 
ways known: that among peers, affectional 
relations are strongly contingent—that is, 
what A does affects B’s response; that the 
same kind of act will have different conse- 
quences, depending on the situation; and that 
the same kind of act is apt to be interpreted 
and reacted to somewhat differently by differ- 
ent groups. Nor would the patterns indicated 
by the transition analyses be particularly 
novel to anyone who spent much time with 
these groups of children. Those who worked 
with the disturbed children were indeed aware 
of the aptitude of these boys for moving 
rapidly from an atmosphere of friendliness to 
one of wildly chaotic aggression. And after 
working with hyperaggressive boys these 
same workers were impressed by the con- 
trasting capacity of normal children to catch 
themselves up—to reorganize and correct for 
threats of disruption in group interaction. 

The specificity of the methods allows us, 
however, to achieve something more than the 
clinically obvious. Given the situation, given 
the nature of the group, we can, knowing the 
actions of one child, predict rather well what 
another child will do. Even without consider- 
ing individual differences—which on the ba- 
sis of previous studies are a potent contributor 
to interaction—these three components yield 
a prediction of peer behavior corresponding 
to an R of more than .60. Specifically, we can 
detail which situations are likely to induce 
which behaviors and how particular situa- 
tions are likely to modify the contingencies 
between stimulus and response acts. Further- 
more, whereas earlier studies suggested that 
situations were not as discriminable for very 
disturbed children, the present data suggest 
additionally that behavioral events them- 
selves are less differentiated by the very dis- 
turbed children. There was evidence that the 
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groups of children differed rather little in 
their responses to unfriendly acts; rather it 
was acts thought by coders to be friendly 
which differentiated the groups, the disturbed 
children more often responding as though 
such acts had hostile meaning. 

To fully confirm some of these suggestions 
would require more adequate data than were 
available, but what is clear is that the meth- 
ods yield a lever for a more specific under- 
standing of normal and pathological interac- 
tion processes. For example, an extension of 
the methods of analysis to the interactions 
of autistic children—or for that matter, to 
schizophrenic adults—suggests several alter- 
native hypotheses: that contingencies between 
antecedent and response acts are highly fixed, 
albeit unusual and socially inappropriate (cf. 
Wallace, 1961); that contingencies are highly 
uncertain, fluctuating from moment to mo- 
ment; that contingencies are fixed or uncer- 
tain, according to subgroups; that contin- 
gencies are primarily idiosyncratic; that it is 
not contingency relations which distinguish 
the interactions of these populations, but 
rather the range and/or nature of antecedent 
and consequent acts in themselves. The de- 
tails of who and what are responded to and of 
modes of response might also provide more 
adequate bases for theory and clinical prac- 
tice. 

In a more general sense, the analyses point 
to two broad issues, one of differentiation or 
discrimination learning, the other of timing. 
Concerning differentiation, the data suggest 
that patterns of interaction are organized by 
contextual circumstances and by the psycho- 
logical status of the child. Yarrow (1963) 
similarly notes that parent-child behaviors 
are likely to differ according to the situation, 
the developmental level of the child, the psy- 
chological state of the mother, etc., and she 
comments that few parents are uniformly 

nurturant or hostile; consequently, she sug- 
gests that the distribution of parental behav- 
iors in relation to specific developmental 
stages and specific circumstances are likely 
to be more relevant for the child than are 
` averaged occurrences. In this sense, a trait 
approach to personality and its learned as- 
pects is, as we suggested earlier (Raush et al., 
1959b) apt to be misleading. Children are, 
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for example, no more taught not to be de 
pendent than they are taught not to urinal 
What they are taught are right and v 
times and places. For some parents and 
some cultures, as compared to others, there 
are more wrong places, and in that se 
one might speak of quantitatively greater 
lesser amounts of expression or inhibitio 
But this is a rather different matter than 
sumed quantities of traits. On a resea 
level, our data imply that it would be 
for’questionnaire and experimental, as well ¢ 
observational studies to take into account t 
matter of sampling of situations called for b; 
Brunswik (1947)."* 
Turning to the matter of timing, the tran 
sition probability analyses indicate that th 
position of an act in a sequence chain make 
a difference. Experience thus appears to 
organized temporally. The data suggest ta 
normal preadolescent children are able | 
foresee things going from bad to worse 
are able to pull themselves up from the 
pected course of events. There is, to be sure 
nothing magical about this “rescue” obu 
interaction process. On the basis of a pi 
clinical analysis, Blanche Sweet and I (Ra 
& Sweet, 1961) suggested the sensitivity 
these children to the potentials of interact 
and we illustrated some techniques by wal 
healthy preadolescents foreclose and ameli 
rate the dangers in a potentially destruct 
course of events. A shift of topic, a Tee 
nition of the other’s feelings, a conventio 
encouragement, for example, “nice try,” © 
vey a sensitivity to the potentials of the othe 
and of the situation. In contrast, the very di 
turbed children not only failed to “resctl 
the interaction process, but even failed to p! 
serve its presumed course. I would sū 
that this failure is less a matter of pleas 
in malice, and more a matter of loss of í 
ready tenuous controls. p 
There is, however, another aspect to timui 
As is familiar to every therapist-patient, Pi 
ent-child, husband-wife pair, when somethl 


11 This is true also for cross-cultural studie 
example, our data showed no differences 1% ge 
between the peer behavior of Norwegian 
matched American children, but there was 45% 
cally significant difference in which settings prodi 
which behaviors. : 


INTERACTION SEQUENCES 


is done is often as important as what is done. 
For example, an interposition between quar- 
reling children can come too early so that its 
relevance is lost, or it can come too late when 
the situation is past all except heavy-handed 
“control. Given sufficient data, the transition 
probability model can enable us to under- 
stand the kinds of interventions and the tim- 
ing of interventions which are effective in re- 
lation to specific goals. Finally, the present 
data suggested that chains of interaction se- 
quences move toward stable equilibria. The 
achievement of such equilibria in interaction 
thus becomes a variable subject to study. Too 
little is known, but it is reasonable to sup- 
pose that time to reach equilibrium and the 
end points of equilibrium will vary with the 
«character and the deyelopment of particular 
human relationships. 
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EFFECTS OF PRELIMINARY POSITIVE AND NEGAT 
INTERACTIONS AND DELAY CONDITIONS ON 
CHILDREN’S RESPONSIVENESS TO 


SOCIAL REIN 


HERSHEL BERKOWITZ anp EDWARD ZIGLER 
Yale University 


The satiation and valence positions 


outlined and certain hypotheses derived from them were tested. 80 2nd graders 
were administered a socially reinforced tisk following exposure to 1 of 3 
experimental treatments: positive social contact (P), negative social contact 
(N), and no social contact. Of the Ss in the P and N treatments, 14 were 


given the criterion task immediately 


a delay of 1 wk. The significant interaction between the delay and treatment 


variables (p < .005) indicated that th 


sistency in earlier findings. The significant ordering of treatments in the delay 
condition (p < .005) lent support to the valence position. The study indicated 
that children’s positive and negative 
important factors in determining his 


A number of investigators have attempted 
to assess the influence of various types of 
social contact between experimenter and sub- 
ject on the subsequent effectiveness of the 
experimenter as a reinforcing agent (Berko- 
witz, 1964; Erickson, 1962; Gewirtz & Baer, 
1958; McCoy & Zigler, 1965; Shallenberger 
& Zigler, 1961). In general, the findings have 
indicated that, compared to no or negative 
social interaction, a preliminary period of 
social contact with a warm or rewarding ex- 
perimenter reduced the effectiveness of social 
reinforcers. A view frequently advanced to 
explain this finding has been that the sub- 
jects’ drive for social reinforcers is relatively 
satiated during the preliminary period of 
social interaction (Berkowitz, 1964; Erick- 
son, 1962; Gewirtz & Baer, 1958). 

McCoy and Zigler, however, have argued 
that the satiation position alone is incapable 
of parsimoniously encompassing certain find- 
ings reported in the social reinforcement 
literature (e.g., Stevenson, Hickman, & 
Knights, 1963). McCoy and Zigler, elaborat- 


1This research was conducted while the first 
author was on an NIH postdoctoral fellowship 
(1-F2-MH_ 20, 434-01). It was also supported by 
Grant MH-06809 from the United States Public 
Health Service. The authors wish to express their 
thanks to the members of the Meriden, Connecti- 
cut, elementary school system for their cooperation 
y and assistance in the course of this study. 


FORCEMENT ' 


concerning reinforcer effectiveness were 


after preliminary contact, and % after 


e delay variable was 1 source of incon- 


reaction tendencies towards E were 
effectiveness as a reinforcing agent. 


ing on earlier work (Shallenberger 
1961; Zigler, 1963), have asserted 
typical experimental situation in wh 
adult verbally reinforces a child mi 
thought of as a complex interaction im 
the adult has both positive and mi 
characteristics for the child. The magi 
of these positive and negative valence 
viewed as an outgrowth of the child’s p 
lar experiences with adults. McCoy 
ler’s position generated the predictio! 
preliminary period of warm interacti 
tween experimenter and child would 
the experimenter’s positive valence 
tenuate his negative valence. This wouk 
crease his reinforcer effectiveness OV! 
found following a neutral interaction, 
in turn would result in greater eff 
than that found when the experime 
a stranger to the child. These p 
were confirmed by McCoy and Zi 
findings being inconsistent with 
ported in support of the satiation ] 
(Berkowitz, 1964; Erickson, 1962; 
& Baer, 1958). } 
This inconsistency may have been 
a significant methodological differen 
the earlier studies the effectiveness 
reinforcement was assessed immediat 
lowing the preliminary interaction, 
McCoy and Zigler measured reinfor 
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tiveness after a week’s interval. It is possible 

. that preliminary positive interaction serves 
both to satiate drive for social reinforcers 
and to strengthen the experimenter’s posi- 
į tive valence. Satiation effects, however, might 
be expected to dissipate over time, so that 
the primary factor influencing performance 
after a week’s delay would be the valence 
that the reinforcing agent held for the child. 
The delay factor may also be of crucial 
importance in regard to a condition, not em- 
ployed by McCoy and Zigler, in which the 
child has had negative experiences in his pre- 
liminary interaction with the experimenter. 
The valence position would generate the pre- 
diction that such a negative condition would 
reduce the effectiveness of the experimenter 


** as a social reinforcer. However, Stevenson 


and Snyder (1960) and Shallenberger and 
Zigler (1961), employing the no-delay pro- 
cedure, found that reinforcer effectiveness 
was enhanced following a negative preliminary 
. condition. These latter investigators at- 
tributed this finding to motivational effects 
introduced by the contrast or interaction be- 
tween the negative preliminary and the posi- 
tive test conditions, It is also a possibility 
that when a positive condition immediately 
follows a negative one, the child has consid- 
erable difficulty in evaluating the experi- 
menter, that is, in determining just what 
valence to assign to him, It may be that with 
a week’s delay the interaction effects be- 
tween negative preliminary condition and 
subsequent positive reinforcement would dis- 


~ Sipate and the experimenter would acquire a 


definite valence which would then become the 
primary factor influencing the child’s per- 
formance. In the present study, positive and 
hegative preliminary interactions, in con- 
: junction with the no-delay and the week-delay 
procedures were employed to test the hy- 
Potheses advanced above. A control condi- 
tion, in which the experimenter had no pre- 


liminary contact with the subjects, was also 
` utilized, 


METHOD 
Subjects 


au subjects were 80 second-grade children, 40 
eee and 40 girls, obtained from the Thomas Hooker 
fe ic school in Meriden, Connecticut. The mean 
ge of the 80 subjects was 7.3 years. There were 
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three conditions of experimenter-subject interaction 
prior to criterion task administration. Thirty-two 
subjects met individually with the experimenter 
under a condition of positive prior contact (P), 32 
under a condition of negative prior contact (N), 
and 16 had no prior contact (C). Of the P and N 
subjects, half were tested immediately after initial 
contact (I) and half were tested after a 1-week 
delay (D). Assignment to the five groups (PI, PD, 
NI, ND, and C) was made randomly, with the 
stipulation that each included an equal number of 
boys and girls. The mean CAs of the five groups 
were quite comparable (range: 7.2-7.3 years). 


Apparatus 


Three games (similar to those employed by 
Shallenberger & Zigler, 1961) and a painting activ- 
ity (similar to that employed by McCoy & Zigler, 
1965) were used during the preliminary social inter- 
actions. The criterion task was the Marble-in-the- 
Hole game employed by McCoy and Zigler. 

The Card game consisted of three sets of 242X3 
inch cards. One set contained cards with red rec- 
tangles on them, one set cards with black rec- 
tangles, and one set had 12 of each. The Nail Board 
game consisted of a gold board inclined at a 50- 
degree angle, having two small boxes at the bottom, 
one green and the other black. Nails placed on the 
surface of the board formed several paths, all of 
which led to the green box. The rigged nature of the 
game was very difficult to discern. Another small 
box was used to hold 12 marbles. The Nursery School 
game consisted of two small wooden houses and 12 
pictures of young children. The materials for the 
painting activity consisted of paper and pastels in 
12 colors. 

The Marble-in-the-Hole game consisted of a grey 
box, 14 inches high X 20 inches deep X 19 inches 
wide. On the side facing the subject were two holes, 
1 inch in diameter, placed 6 inches apart, and be- 
neath them, a semicircular opening 2 inches in diam- 
eter. The interior of the box contained a felt-lined 
ramp, funneling marbles from the two holes on 
top to the opening beneath. This opening was filled 
with yellow and green marbles, thus assuring the 
subject a steady supply. 


Procedure 

The experimenter met with each subject indi- 
vidually, both during the preliminary interaction 
and during the criterion task administration. Upon 
meeting the subject, the experimenter said, 


Im Mr. Berkowitz. We're going to play 
games today. These are fun games and I 
that you will like them. 


some 
think 


The experimenter then conducted the child to the 

imental room, chatting with the subjects in 
the P condition, but remaining silent with the 
others, except to give minimal answers to their 
questions. The experiment was conducted in a light 
airy room, which contained two large work tables 
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and two child-sized desks surrounded by adults’ 
and children’s chairs. The subject was seated at a 
desk, and the experimenter sat beside him. The ex- 
perimenter immediately gave instructions for the 
first preliminary game to the subjects in the P and 
N conditions, or the instructions for Marble-in-the- 
Hole for subjects in the C condition. 

Preliminary games. In the Card game, the experi- 
menter showed the subject the faces of the cards in 
the deck containing both red and black rectangles. 
He then distracted the subject’s attention and sub- 
stituted a deck containing only red rectangles (P) 
or only black ones (N). The cards in this deck 
were spread face down on the table. The subject 
was instructed to draw one card at a time from the 
cards on the table and try to pick cards with red 
faces. After each red card was exposed, the experi- 
menter said “right,” after each black card, “wrong.” 
.The subject was allowed to pick 12 cards. 

In the Nail Board game, the subject was shown 
the apparatus and told that he was to start a 
marble at the top of the board. If it rolled into the 
right box he “won.” As the marble always went 
into the green box, subjects in the P group were 
told that the green box was correct. Those in the N 
group were told that the black box was correct. The 
subject was told “right” or “wrong” for the first 6 
trials, Each subject was given 12 trials. 

In the Nursery School game, the subject was told 
that the houses represented nursery schools attended 
by the children in the pictures. The subject was 
instructed to place each of the 12 pictures in the 
schools he thought the children attended and the 
experimenter would tell him if he were right or 
wrong. After each response, subjects in the P group 
were told “right,” those in the N group, “wrong.” 

After the three games were completed, the experi- 
menter said, 


Why don’t you try some drawing now? I'll give 
you some of these pastels and some paper and 
you can make a picture of anything you like. 


All subjects in the P and N groups were given the 
preliminary tasks in this order and played them for 
a total of 20 minutes. 

Relationship conditions. In the P treatment, the 
experimenter’s manner was warm and friendly, and 
he praised the subjects’ performance on all the 
games. Responses of the subjects in this condition 
always met with success, the games being prear- 
ranged so that they could make only correct re- 
sponses. These subjects received both verbal and 
nonverbal support (ie., smiles and nods) during 
the games. Verbal support consisted of four rein- 
forcing statements, made in random order after 
every ninth response (counted across games). The 
reinforcing statements employed were: “That’s fine,” 
‘Very good,” “You really know how to play this 
game,” and “You really know how to play [name 
of game] don’t you?” During the painting activity, 
the experimenter complimented the subjects and 
encouraged their activity, freely dispensing ap- 
proyal both verbally and nonverbally. 
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The experimenter’s manner in the N treatment 
was aloof and businesslike. He refrained from sup- 
portive behavior of either a verbal or nonverbal i 
type. He remained silent except to answer questions 
or to deliver criticisms and negative reinforcement, 
The experimenter employed a flat, indifferent tone of 
voice throughout this treatment. Responses of the 
subjects in this treatment always met with failure 
on the three experimental games due to the pre- 
arranged nature of the tasks. In addition, subjects 
were exposed to verbal disapproval, in the form of 
the following statements, dispensed in random 
order after every ninth response: “You really don’t 
play [name of game] very well, do you?” “You're 
not doing very well,” “You arent getting any 
right,” and “You don’t know how to play this game 
very well.” During the painting activity, the ex- 
perimenter made three statements of disapproval. 
These statements consisted of brief but appropriate 
criticisms of the drawings, such as, “The house 
doesn’t have any window,” “That’s the wrong color | 
for a tree,” “The man has no arms,” etc. (After the 
administration of the criterion task, subjects im 
this treatment were asked to play the preliminary 
games again, at which time they were lavishly praised 
for their performance. This procedure was carried 
out in an attempt to reduce any negative feelings 
which were not dissipated during the administra- 
tion of the criterion task.) k 

Subjects in the C group received no preliminary 
interaction and were given the criterion task im- 
mediately. 7 

Delay conditions. At the end of the 20-minute 
preliminary period, P and N subjects in the I cons 
dition were introduced to Part I of the criterion 
task. Those in the D condition were returned to 
their rooms. One week later, these subjects again 
met with the experimenter, who told them, 


Hello . You remember me, don’t you? 
Im Mr. Berkowitz. We're going to play some 
more games today. 


the experi- 


These subjects then were brought to ie, 
act 


mental room and introduced to Part I of thi 
terion task. 

Criterion task. All subjects were introduced to 
the criterion task in the same manner and receive 
identical instructions and reinforcement. They were 
told: 


a 
This is a game we call Marble-in-the-Hole. 
tell you how to play it. You see these marb i7 i 
[the experimenter points to lower opening]. oom 
of them are green and some are yellow. They a 
in these holes [the experimenter points to upp! 
two holes]. The green ones go in this hole, an 
the yelow ones in this one [the experimen 
points to the appropriate holes]. Now you i. 
me a green marble. Put it in the hole it goes 
Now show me a yellow marble. Put it in the 
it goes in, You can put as many marble ip 
holes as you want to. You can stop when A 
you want to. You tell me when to stop. Rem? 
ber, when you want to stop, just tell me- 
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After every tenth marble was inserted, the experi- 
‘enter randomly made one of the following four 
‘statements: “You really know how to play this 
“game,” “That’s fine,” “You really know how to 
play Marble-in-the-Hole, don’t you?” and “Very 
good.” He also smiled and nodded at the subject 
throughout the game. The subject played until he 
“indicated that he wished to stop, either by telling 
the experimenter that he wished to do so or by not 
“inserting a marble for 30 seconds. At this point, the 
“experimenter instituted Part II of the game, saying: 


Now I'll tell you how to play this game. This 
time we put the yellow marbles in this hole and 
the green marbles in this one [opposite holes to 
those employed in Part I]. Put a yellow marble 
where it goes. Now put a green marble where it 
goes. You can put as many marbles in the holes 
as you want to. You can stop whenever you 
want to. You tell me when you want to stop. 
Remember, when you want to stop, just tell me. 

» OK? Ready, begin. 

F The same reinforcers and schedule employed dur- 
ing Part I were used during Part II. The subject 
was allowed to continue until he told the experi- 
menter that he wished to stop or did not insert a 
marble for 30 seconds. 


RESULTS 
As în earlier studies (Shallenberger & Zig- 
ler, 1961; Zigler, 1961; Zigler, Hodgden, & 
Stevenson, 1958) the total time (Part I+ 
Part II) spent playing Marble-in-the-Hole 
Was used as the measure of reinforcer effec- 
tiveness. The mean scores of the five groups 


are presented in Figure 1. 


| 


aal 
2 


The effects of experimental treatments, 
delay conditions, sex of subject, and the in- 
teractions were assessed in a 2 X 2 X 2 fac- 
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MEAN TIME IN SECONDS 


POS NEG 
DELAY 


POS NEG 
IMMEDIATE 


NO 
CONTACT 


Fig, 3 
1. Mean time played on game by each group. 
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TABLE 1 


Raw AND TRANSFORMED MEAN PLAYING 
TIMES IN SECONDS 


Logarithmic 


Raw scores transformation 
Group $ IOTER 

M SD M SD 
Positive delay 493 | 403 | 2.5854 | .3251 
Negative delay 155 | 165 | 1.9869 | .4357 
Positive immediate 337 | 305 | 2.4501 | .4001 
Negative immediate | 579 | 498 2.5799 | .4474 
No contact 332 | 361 | 2.3579 | .4649 


torial analysis of variance which excluded 
the C group. Since the means and variances 
of the scores were found to be correlated, 
a logarithmic transformation was performed 
for this analysis. The raw and transformed 
scores are presented in Table 1. Significant 
treatment (F = 4.84, df=1/56, p< 05) 
and delay (F = 5.07, df= 1/56, p < .05) 
main effects were found as well as a signifi- 
cant Treatment X Delay interaction (F= 
12.24, df = 1/56, p < .005). The sex main 
effect and all other interactions resulted in 
Fs <1. 

As can be seen in Figure 1, the significant 
Treatment X Delay interaction primarily re- 
flects the shorter playing time of N subjects 
in the D as compared to the I condition, 
Furthermore, an examination of the differ- 
ences between the four groups by means of 
individual £ tests disclosed that the main 
effects found in the analysis of variance are 
for the most part attributable to the rela- 
tively brief time scores of subjects in the ND 
groups. 

The valence point of view would generate 
the prediction that in the D condition, play- 
ing time would be longest in the P, shortest 
in the N, and intermediate in the C treat- 
ments, An analysis of linear trend performed 
on the log-time scores of these three groups 
confirmed this prediction (F = 16.81, df= 
1/45, p < .005). Individual ¢ tests revealed 
that the PD and ND groups differed signifi- 
cantly (f= 4.10, p < .005), the C and ND 
groups differed significantly (f= 2.54, p < 


(01), and the PD and C groups differed at a 
borderline level of significance (t= 1.56, P 
< .10). 
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Differences in the time scores of the C, PI, 
and NI groups were assessed by individual 
t tests, Differences at a borderline level of 
significance were found between the NI and 
PI groups (¢ = 1.66, p < .10) and between 
the NI and C groups (¢=1.61, p< .10). 
No significant difference was found between 
the PI and C groups (¢ < 1.00). 

Earlier studies (McCoy & Zigler, 1965; 
Shallenberger & Zigler, 1961) have employed 
a difference score (time played on Part II — 
time played on Part I of Marble-in-the-Hole) 
as a measure of the subject’s wariness of the 
experimenter, with greater wariness being 
reflected in a larger score. Employing the PI, 
NI, PD, and ND groups, a Treatments X 
Conditions X Sex of Subject analysis of vari- 
ance revealed no significant effects (overall 
F < 1.00) on this measure corrected for total 
amount of time played. The earlier studies 
have suggested that a clearer indicator that 
a child is reluctant to interact with the ex- 
perimenter is his playing Part II of the game 
longer than Part I. Eight children in the P 
groups and 12 children in the N groups 
showed such a pattern; although in the ex- 
pected direction, this difference is not a sig- 
nificant one (x? = 1.16, p< 30). It is of 
some interest to note that the percentage of 
children in the C group (31%) showing such 
a pattern falls midway between the percent- 
ages found in the P (25%) and N (37%) 
groups, and that the effect was most pro- 
nounced in the ND group (44%). 

A correlation between individual playing 
times on the two parts of the game proved to 
be significant (r = .72, p < .005). As in ear- 
lier studies, the total time spent on the game 
was highly related to the total number of 
marbles inserted (r = .95), 


Discussion 


The present study clearly indicates that 
given a 1-week delay, the type of relation- 
ship established between an adult and a child 
influences the adult’s effectiveness as a posi- 
tive reinforcing agent. The finding that rela- 
tive to a no-contact condition, a positive re- 
lationship enhanced the adult’s effectiveness 
= while a negative relationship attenuated it, is 
’ consistent with the valence position and find- 

ings of McCoy and Zigler (1965). It should 
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be noted that the absence of a negative con- 
dition in the McCoy and Zigler study made | 
it difficult for these investigators to deter. 
mine whether the differences between treat- 
ments were due to changes in both positive, 
and negative tendencies or to changes in posi- 
tive tendencies alone. The extremely attenu- 
ated performance of the ND children in the 
present study provides evidence of the effects 
of an experimentally strengthened negative f 
tendency. It is surprising that so little atten- 
tion has been paid in the social reinforce- 
ment literature to the negative aspects of 
adults as reinforcers, since it is quite appar- 
ent that the history of children is such that 
adults have been frequently paired with nega- 
tive as well as positive reinforcers. 

The general results of the present study” 
lend support to the view that the inconsist- 
ency between the findings of McCoy and 
Zigler and earlier investigators was due to 
the delay variable, In the delay condition, the 
positive treatment resulted in significantly 
longer playing times than the negative treat- 
ment, while the reverse was true (at a border- 
line level of significance) in the immediate 
condition. A 

The children’s performance in the immedi 
ate condition was consistent with certam 
earlier findings, and inconsistent with others. 
The tendency for the negative treatment to 
result in longer playing times than the post- 
tive treatment is reminiscent of the findings 
of Shallenberger and Zigler (1961) and St 
venson and Snyder (1960). One explanation 
of this finding may be derived from the coi 
tention of Walters and Ray (1960) that | 
social reinforcer effectiveness scores are posk 
tively correlated with the level of anxiety ™ 
duced in the child. In respect to the present 
study, this explanation would involve 
assumption that the children’s reaction to 
their anxiety was to persist in playing i 
game rather than simply to quit and retut! 
to their classrooms. The ease with vo 
children terminated play in the ND cont | 
tion attests to the implausibility of this E 
sumption. As noted by earlier investigato | 
(Shallenberger & Zigler, 1961; Stevens? 
Snyder, 1960) it appears that the enhance 
effectiveness of social reinforcers give? ia t 
mediately after negative social contact 15 
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to a contrast effect in which positive rein- 
forcers subsequently acquire a greater value 
for the child. 
The satiation position generates the hy- 
“pothesis that in the immediate condition 
positive preliminary contact would result in 
decreasing reinforcer effectiveness beneath 
that found in the no-contact condition. The 
failure to obtain support for this hypothesis 
is somewhat perplexing in light of the positive 
findings of Gewirtz and Baer (1958). The 
failure to replicate the findings of Gewirtz 
and Baer may have been due to differences 
in the dependent measures of the two studies 
as well as to differences in the absolute 
amount of social reinforcement received by 
the child during the preliminary contact. In 
‘respect to the dependent measures, Gewirtz 
and Baer employed a simple learning meas- 
ure while a duration measure was utilized in 
the present study. Several investigators have 
noted that the several measures that have 
-been employed to gauge social reinforcer ef- 
fectiveness are not functionally equivalent 
(Gewhttz, 1954; Patterson & Hinsey, in press; 
Zigler, 1963, 1964). It is conceivable that 
the satiation manipulations employed in the 
social reinforcement studies differentially af- 
fect the length of time that the child will 
maintain his interaction with the experi- 
menter as opposed to his performing particu- 
lar responses in order to elicit the experi- 
Menter’s praise. 
The present study certainly calls into 
question the position that social reinforce- 
Ment effects are wholly explainable in terms 
of Such drive operations as the short-term 
deprivation and satiation of social reinforcers. 
Within such a position the child is viewed as 
4 passive recipient of social reinforcers that 
_nexorably shape his behavior. The present 
ae, Suggests that the child’s attitude toward 
tl € reinforcing agent must be viewed as medi- 
a the effectiveness of the social reinforcer 
ea dispensed on the one hand and deter- 
_ Mig responses emitted on the other. 


f 
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SELECTED PERSONALITY CORRELATES OF MODE 
OF CONFLICT RESOLUTION * 


EUGENE L. RINGUETTE* 
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The general hypothesis underlying the present study was that meaningful 
relationships exist between modes of conflict resolution and personality char- 
acteristics. Using a motor conflict apparatus, Ss were presented with conflict 
situations and grouped according to th 
These groups were then compared in terms of diagnosis, reaction time, and 
scores on the Barron Es and Welsh A and R scales. Statistically significant 
differences obtained with respect to diagnosis, reaction time, and scores on 
the Es scale were judged to represent strong support for the initial hypothesis. 


Conflict has enjoyed wide use as a theo- 
retical concept and has also been the focus of 
considerable experimental investigation. Gen- 
eral types of conflict have been defined (Hov- 
land & Sears, 1938; Lewin, 1935), and re- 
sponses to experimental conflict situations 
have been categorized (Hovland & Sears, 
1938). In addition, well-grounded formula- 
tions regarding the nature and dynamics of 
conflict have been advanced (Lewin, 1935; 
Miller, 1944). Little has been established, 
however, with regard to the organismic fac- 
tors related to response to conflict situations. 
The general hypothesis underlying the present 
study was that meaningful relationships exist 
between modes of conflict resolution and per- 
sonality characteristics. In an effort to assess 
this hypothesis, subjects were presented with 
conflict situations and were grouped accord- 
ing to their modes of resolving these conflicts. 
These groups were then compared with re- 
gard to their scores on several personality 
measures. 


1The author gratefully acknowledges the con- 
structive help of his academic committee, chaired 
by John M. Hadley. Gratitude is also expressed to 
the Veterans Administration Hospital, Indianapolis, 
Indiana, and to Carl Wright, Chief Psychologist, for 
aid in securing hospitalized subjects, Finally, spe- 
cial gratitude is extended to the members of the 
American Legion in the Indianapolis area, Joe Cas- 
sidy and Kenneth Trueblood in particular, for their 
helpful cooperation in securing a comparison group. 

*Now at the Neuropsychiatric Institute, Univer- 
sity of California, Los Angeles, Center for the 
Health Sciences. 
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eir modes of resolving these conflicts. 


METHOD 


Both hospitalized and nonhospitalized subjects 
were used in the study. The former group was com 
posed of 79 male patients from the Neuropsychiatric 
Service of a Veterans Administration General Medi- 
cal and Surgical Hospital. These subjects were taken | 
from the admission ward of the hospital, and no 
selection criterion other than their ability to com 
plete the experimental procedure was employed. | 
The mean age of the hospitalized group was 386 
years, and the mean number of years of education 
was 10.9 years. The 30 nonhospitalized subjects 
were male veterans who never had been hospital | 
ized for psychiatric reasons and were recruited from | 
a veterans’ organization and from among employee 
on other services of the hospital at which the study 
was conducted. The mean age of these subjects wi 
36.5 years, and the mean number of years of edu- 
cation was 11.6 years. The differences between 
groups on the variables of age and education welt | 
not statistically significant. 

The motor conflict apparatus introduced by Hov 
land and Sears (1938) was used to present the fow 
classical conflict situations to the subjects. The ap 
paratus was equipped with two sets of lights. j 
light and a green light were set into the board e 
the left side, and a red light and a green light m 
set into the board on the right side. A contro! 
boatd having a switch for each of the lights ant 
master switch enabled the experimenter to activa 
the lights in any simultaneous combination. 
standard 6-volt chronometer was wired to i 
cuit so that it was activated and deactivated W 
the lights on the board by closing the circuit ig 
the master switch. A roll of 5-inch white ai 
tape was fed across the apparatus below the W 
sets of lights. The subject’s task was to draw 2 a 
with a pencil from a notch at the bottom of tt 
board up toward the lights at the top- F 

The conflict situations were defined as follow’: 


= 


k re" 
Type I (approach-approach)—a green light PI 
sented simultaneously on both sides. i 


, in 


Mope oF ConFLict RESOLUTION 
. 


Type II (approach-avoidance)—a red light and 
a green light presented simultaneously on the same 
side. 

Type III (avoidance-avoidance)—a red light pre- 
sented simultaneously on both sides, 

-> Type IV (double approach-avoidance)—all four 
lights presented simultaneously. 


Each subject was given the following instructions: 


As I mentioned before, this is a study in prob- 
lem solving. You'll notice that there are two sets 
of lights at the top of the board in front of you— 
one on the left side and one on the right. In each 
set there’s a red light and a green light. I can 
turn these lights on from over here. As we go 
along, I'll be doing that and also recording times 
from this timer. Because I'll be doing these things, 
„I won't be able to watch you very easily so I’ve 
put paper in the problem board in order that, later, 
I can see what everyone has done. 

Im going to give you two basic instructions 
regarding what you should do in response to the 
lights, First, when a green light comes on, I want 
you to draw a line, starting from this notch, as 
quickly as possible up to the side where that 
green light is. Second, when a red light comes on 
I want you to draw a line as quickly as possible 
up to the side away from that red light. In other 
words, the basic idea is to go toward the side 
whéte a green light is but away from the side 
where a red light is. 

Now, just for practice, put your pencil in the 
notch and draw a line up to one set of lights or 
the other; it doesn’t matter where. 

Fine. Now as we go along, there will be times 
when you will have to use your judgment as to 
what you should do. When we're all through, I'll 
be glad to answer any questions that Pm able to. 


The subjects’ responses to the conflict situations 
were classified into four modes of resolution. The 
descriptive terms assigned to these modés and their 
definitions were as follows: 

A Compromise—defined by the subject’s drawing a 
ine from the notch at the base of the board up the 
middle between the two sets of lights. 

Arbitrary—defined as a line drawn directly to one 
or the other set of lights. 
no duivocation—defined as lines drawn from the 
otch to both sets of lights. 

Blocking—defined as no response with the pencil 
to the lights. 

P previous study (Patton, 1952) indicated that 
pasties trials were optimal with regard to the 
ia fatigue and the establishment of the ap- 
this g and avoidance responses to the lights. In 
ea: each subject was given 16 practice trials 
` count i color of light and the side of presentation 
ieee alanced. Following this, each type of con- 
te presented four times for a total, includ- 

Bite Th 16 practice trials, of 32 trials for each sub- 
-type f e types of conflict were ordered so that each 
ollowed every other type and itself an equal 
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number of times, and the same order was followed 
with each subject. The latency of response was re- 
corded on each trial, and the mode of resolution 
was later recorded from the paper tape in accord- 
ance with the definitions noted above. 

Each subject was assigned to a mode of resolu- 
tion group according to the mode which he adopted 
on more than half of the conflict trials. For exam- 
ple, a subject drawing a line up the center of the 
board on 9 of the 16 conflict trials would be as- 
signed to the compromise group. Of the total num- 
ber of subjects completing the procedure, 11 could 
not be classified in this manner and were excluded 
from the study. 

In addition to reaction time, which previous re- 
search indicates is a reliable measure of disruption 
in the conflict situation (Arkoff, 1957; Atthowe, 
1960), the Barron Ego Strength (Es) scale (Bar- 
ron, 1956), diagnosis, and the Welsh A and R scales 
(Welsh, 1956) were included as measures for com- 
parison of the mode of resolution groups. The Es 
scale was chosen as the primary measure because of 
its theoretical relevance to conflict resolution and 
because it represents a continuum presumably ap- 
plicable to every individual in the population. Diag- 
nosis was chosen as a rough measure of general 
behavioral patterns, and the anxiety and repression 
measures were utilized because of the general im- 
portance assigned to them in clinical thinking. 

The MMPI, from which the Es, A, and R scores 
were derived, was administered routinely to pa- 
tients entering the hospital. Nonhospitalized sub- 
jects completed the experimental procedure, includ- 
ing both the MMPI and the conflict apparatus, 
outside the hospital usually in space made available 
at the club rooms of the organization from which 
they were recruited. It was judged that the experi- 
mental conditions were generally similar for all sub- 


jects. 
RESULTS 


For the purpose of analyzing the data, sub- 
jects were grouped according to two factors, 
mode of conflict resolution and level of emo- 
tional integration. As described above, there 
were four mode of resolution groups which 
were assigned the descriptive terms com- 
promise, equivocation, arbitrary, and block- 
ing, in accordance with the predominant mode 
of resolution adopted by the subjects com- 
prising each group. Another grouping was 
based on levels of emotional integration. 
These were characterized as hospitalized 
psychotic (HP), hospitalized nonpsychotic 
(HN), and nonhospitalized (NH). Assign- 
ment to these groups was based on psychi- 
atric diagnosis. Five psychiatrists contrib- 
uted a total of 79 diagnoses, and a nonsig- 
nificant chi-square indicated that diagnostic 
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TABLE 1 


AND ExpECTED (E) FREQUENCIES OF SUBJECTS CLASSIFIED AccoRDING TO MODE oF 
CONFLICT RESOLUTION AND LEVEL OF EMOTIONAL INTEGRATION 


OxsErvED (O) 


EucenE L. RINGUETTE 


Mode of resolution 


Level of 


integration Compromise Arbitrary Equivocation Blocking Total 
o E o E o E o E 

HP 7 6.8 12 11.9 4 7.7 8 4.6 31 

HN 11 10.6 15 18.5 19 11.9 3 7.0 48 

NH 6 6.6 15 11.6 4 7.4 5 44 30 

Total 24 42 27 16 109 


Note.—x? = 14.54, p <.05. 


bias was not operating with regard to assign- 
ment to the level of emotional integration 
groups. It was also determined that the co- 
efficients of correlation relating age and edu- 
cation to the measures used were not sta- 
tistically significant. 

Table 1 presents the observed and expected 
frequencies for subjects grouped according to 
mode of conflict resolution and level of emo- 
tional integration. 

Table 2 presents the analysis of variance 
summary of Es scores of the mode of resolu- 
tion and level of emotional integration 
groups. It was necessary to reduce the num- 
ber of levels of emotional integration from 
three to two, that is, hospitalized and non- 
hospitalized, because the F ratio obtained 
from a 3 X 4 factorial design with unequal 
n’s would have been unreliable (Winer, 1962). 

The mean Es scores of the compromise, 
blocking, arbitrary, and equivocation groups 
were 42.05, 40.45, 36.29, and 34.50, respec- 
tively. Application of the ¢ test indicated that 


TABLE 2 


ANALYSIS OF VARIANCE SUMMARY OF SCORES ON THE 
Eco STRENGTH SCALE OF SUBJECTS CLASSIFIED 
ACCORDING TO LEVEL OF EMOTIONAL INTE- 
GRATION AND MODE OF CONFLICT 


RESOLUTION 
Source SS df MS F 

Emotional integra- ‘ 

tion (A) 2,902,686] 1 | 2,902.686 | 61.02* 
Mode of resolution 

(B) 709.592} 3| 236.530} 4.98* 
AXB 211.900} 3] 70.633| 1.48 
Error 4,804.608| 101 | 47.570 

*p <.05. 


the differences between the means of the 
compromise and arbitrary, compromise and 
equivocation, and blocking and equivocation f, 
groups were statistically significant. 

An analysis of variance of scores on the | 
Welsh A scale of the mode of resolution and | 
level of emotional integration groups indi- 
cated that the only significant difference on 
the anxiety measure was between hospitalized 
and nonhospitalized subjects, with the, hos- | 
pitalized subjects scoring higher on the Ay 
scale, No significant differences were obtained | 
with regard to scores on the R scale. 

The Kruskal-Wallis one-way analysis of 
variance by ranks (Siegel, 1956) was used to f 
test for differences among the mode of resolu- | 
tion and level of emotional integration groups 
on the variable of reaction time. Because, the 
mean reaction times for subjects adopting the | 
blocking mode of resolution were distorte 
by the fact of their not responding, these sub- 
jects were excluded from the analyses involv- 
ing reaction time. Hence the total V was © 
duced from 109 to 93 for these analyses. The 
H value obtained for the differences among 
the mode of resolution groups was 65.38 an 
that for the level of integration groups Wai 
55.13, both of which exceed the value of 5.2) 
which represents significance at the .05 level 

The median ranks of the level of emotion? 
integration groups on the reaction-time a 
ure were: HP = 68, HN = 49, NH = os 
Application of the Mann-Whitney U t 
(Siegel, 1956) indicated that the difference 
between the HP and HN, and HP and , 
groups were statistically significant. The E 
dian ranks of the mode of resolution groui 
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TABLE 3 
FREQUENCIES OF THE MODES OF RESOLUTION ON ALL Trrars oF Each Type OF CONFLICT 


Type of conflict 
I II Il IV 
Mode of 
resolution 
Trials Trials Trials Trials 

1 air ens 4 1 | 2 | 3 4 | 1 2 3 4 1 2 3 4 

(8 2 15 17 | 14 4 j| 10 8 9 | 20 | 26 | 27 | 26 | 25 | 25 | 25 | 25 

A 74 | 54 | 52 | 51 | 74 | 64 |, 64 | 66 | 45 | 37 | 35 | 39 | 43 38 | 40 | 38 

E 28 | 27 | 31 | 31 | 24 | 22 | 24 | 23 | 31 28 | 27 | 26 | 28 | 26 | 25 | 27 

B 5 13 9 | 13 7 13 | 13 | 11] 13 | 18 | 20 | 18 | 13 | 20 19 | 19 


on the reaction time measure were: com- 
promise = 69, arbitrary = 55.5, equivocation 


` = 38. Application of the Mann-Whitney U 


test indicated that the differences between 
the compromise and equivocation, and arbi- 
trary and equivocation groups were statisti- 
cally significant. 

The type of conflict situation eliciting the 
longest mean reaction time for each subject 
was noted. The number of subjects exhibiting 
the longest mean reaction time for each of 
the conflict types was as follows: Type I= 
12, Type II = 15, Type MI = 30, Type IV 
= 36, The chi-square obtained for these dif- 
ferences was 17.53 which exceeds the critical 
value of 9.84 representing significance at the 
05 level. 

In order to compare the frequency of oc- 
currence of the modes of resolution in re- 
Sponse to each conflict type, it was necessary 
to choose one of the four trials on each type 
of conflict as representative of a subject’s re- 
Sponse to that type, because including all four 
Tesponses to each type would inflate the sam- 
ple # and invalidate a chi-square. Table 3 
Presents the frequencies of the modes of 


: resolution on each trial of each conflict type. 


Although there is a relative similarity, it 
ae clear that the subjects’ responses on 
tial 1 of each type are not representative 


s O 7 . 
f their responses on subsequent trials; more- 


Over, the discrepancy is greater for types of 


_ lesser difficulty than for those of greater diff- 


am Because Trials 2, 3, and 4 are more 
X Presentative of the subjects’ responses, Trial 
Was chosen to represent the subjects’ re- 


- Sponses to the conflict types. 


The relative frequencies of the four modes 
of resolution on Trial 4 of each type of con- 
flict were tabulated and are shown in Table 4. 
The chi-square obtained for these differences 
was 27.04 which exceeds the critical value of 
21.67 representing significance at the .01 
level. 

Because the types of conflict, in addition 
to the mode of resolution groups and level of 
emotional integration groups, differed on the 
variables of reaction time and relative fre- 
quencies of the modes of resolution, chi- 
square was used to test whether these dif- 
ferences were independent of one another. 
Using the conflict type eliciting the highest 
mean reaction time of each subject (again 
omitting the blocking group), the mode of 
resolution groups and types of conflict were 
cross-tabulated. The chi-square obtained for 
this tabulation was 1.52 which falls short of 
the 12.59 needed for significance at the .05 
level. The same procedure was followed in 
comparing the level of integration groups 


TABLE 4 


FREQUENCIES OF THE MopEs or RESOLUTION ON 
TRIAL 4 OF THE TYPES OF CONFLICT 


Type of conflict 


Mode of Total 

resolution 
Il ll IV 

Cc 14 9 26 25 74 

A 51 66 39 38 194 

E 31 23 26 27 107 

B 13 11 18 19 61 

Total | 109 109 109 109 436 


Note—x? = 27.04, p < .01. 
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with regard to type of conflict eliciting the 
highest mean reaction time for each subject. 
The chi-square obtained for this tabulation 
was 4.60 which again falls short of the criti- 
cal value of 12.59. 

Finally, treating each mode of resolution 
separately, the relative frequencies of the 
modes of resolution among the level of inte- 
gration groups for each type of conflict were 
tabulated, that is, the differences previously 
found among the level of integration groups 
and among types of conflict on the variable 
of mode of resolution were tested for inde- 
pendence. Each mode of resolution was tested 
separately, and the chi-squares obtained 
were: compromise = 3.21, arbitrary = 1.98, 
equivocation = 1.14, blocking = 1.95. All of 
these values fall short of the critical value of 
12.59 which represents significance at the .05 
level. 

In summary, the results indicate the fol- 
lowing: 

1. The types of conflict differ in terms of 
the reaction times of the subjects and in terms 
of the relative frequencies of the modes of 
resolution. 

2. The mode of resolution groups, that is, 
groups formed on the basis of the subject’s 
predominant mode of response on the 16 con- 
flict trials, differ on the variables of Es score, 
reaction time, and level of emotional integra- 
tion, and the differences in reaction time are 
independent of type of conflict. 

3. The level of emotional integration 
groups, in addition to the differences in mode 
of conflict resolution indicated above, also 
differ on the variables of Es score and reac- 
tion time, and the differences in mode of con- 
flict resolution and reaction time are inde- 
pendent of type of conflict. 


Discussion 


Sears and Hovland (1941) reported that 
the proportion of blocking was about five 
times as great in the Type III conflict situa- 
tion as in the Type I. Arkoff (1957) found 
reaction time to be significantly longer in 
the Type III than in the Type I situation. 
Brown’s (1942) study, using rats as subjects, 
indicated that Type I conflict produced the 
least withdrawal behavior while the Type III 
conflict produced the most. 
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The present results seem consistent with 
the results of these previous studies. The | 
types of conflict differed in difficulty, as meas- 
ured by the subjects’ highest mean reaction 
times, and the Type III situation elicited the . 
highest mean reaction time significantly more 
often than did the Type I. In addition, the 
frequencies of the modes of resolution varied 
as a function of conflict type, with the com- 
promise and blocking responses tending to in- 
crease and the arbitrary responses tending to 
decrease in the Type III as compared with 
the Type I situation. In contrast to this, the 
number of equivocation responses is quite 
stable over conflict types. With regard both 
to number of highest mean reaction times 
elicited (see p. 509) and frequencies of the 
modes of resolution (Table 4), however, the 
differences among the types of conflict are 
not equal. Types I and II are very similar 
with respect to these variables as are Types 
III and IV, so that the great share of the dif- 
ferences found are accounted for by the dif- 
ference between Types I and IT pooled and 
Types III and IV pooled. Thus, althougn the 
terms, approach-approach, approach-avoid- 
ance, avoidance-avoidance, and double ap 
proach-avoidance are certainly of descriptive 
value, their operational value seems to be that 
they represent situations of differing intensity 


or difficulty. The fact that the differences be- | 


tween the mode of resolution groups and level 
of integration groups, on the one hand, and 
the types of conflict, on the other, appeat to 
be independent or noninteractive also does not 
suggest any qualitative differences among the 
types of conflict. If, for example, the hos- 
pitalized psychotics had shown a greater 
tendency toward disruption or blocking 1 
one conflict situation as compared to th 
other subjects, a qualitative difference among 
the types would be suggested. As this iS not 
the case, it seems reasonable to think of con 
flict in a generic sense, at least for the PU 
pose of the present study, with the oe 
representing different intensities rather tha 
kinds of conflict. ne 
Considered in this way, the results of : 
comparisons of the mode of resolution greri l 
are judged to offer ample support for a 
general hypothesis initially advanced that 
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ferences in mode of conflict resolution are re- 
lated to measurable personality differences. 
In spite of this evidence indicating that 
mode of conflict resolution is a significant 
personality variable, the nature of this vari- 
able is not clear in light of some apparent 
contradictions in the data obtained. Making 
logical value judgments about the measures 
on which the mode of resolution groups dif- 
fered, it is submitted that high level of emo- 
tional integration, high ego strength, and 
little disruption in the conflict situations 
would all have positive connotations in terms 
of personality. The obtained data suggest, 
however, that those subjects exhibiting the 
‘east ego strength, that is, the equivocation 
group, also exhibited the least disruption in 
the conflict situations and tended to be non- 
psychotic rather than psychotic. The con- 
verse relationship also was indicated in that 
the compromise and blocking groups, which 
displayed the highest ego strength, were ap- 
parently most disrupted by the conflict situa- 
tions and contained a greater proportion of 
psythotic subjects. In view of this, it may be 
reasonable to question the adequacy of the 
measures or of the interpretations placed 
upon them; however, some alternative con- 
siderations seem pertinent. As a result of 
their studies, Hovland and Sears (1938) con- 
cluded that the blocking and compromise 
responses are essentially related as are the 
arbitrary and equivocation responses. They 
Suggested that the former responses were Te- 
lated in that they both constituted withdrawal 
from the field. If, instead, the relationships 
between blocking and compromise on the one 
hand and arbitrary and equivocation on the 
other are interpreted in terms of some con- 
cept such as “patterns of defense” (as sug- 
gested by Miller & Swanson, 1956) or “toler- 
ance of ambiguity,” the results obtained with 
the measures in this study may be quite con- 
sistent, Bearing in mind that there is no ob- 
Jectively correct response to the conflict situa- 
tions presented to the subjects, it might be 
Said that the compromise and blocking re- 


- Sponses show more appropriate cognizance of 


the essential nature of the situations than 
z e arbitrary and equivocation responses. 

hus, it would seem that some perceptual or 
Cognitive factor, other than the objective fact 
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of the conflict, is an important determinant of 
the response made to a conflict situation. 

Another pertinent consideration is that mo- 
tivational variables have a complex, rather 
than direct, relationship to response at vary- 2 
ing intensities of conflict (Samelson, 1957). 
Thus, the behavior of a given individual in 
a conflict situation probably does not vary 
directly with the intensity of conflict because 
of the influence of motivational and other 
personality factors. If, for example, ego 
strength is an important determinant of ef- 
fectiveness in conflict situations, the relative 
effectiveness of the equivocation and arbi- 
trary groups, as measured by reaction time, 
may be an illusion in that they essentially 
avoid or are insensitive to the conflictual as- 
pects of the low intensity situations used in 
this study. If placed in a situation of suffi- 
cient intensity to demand recognition of the 
conflict, they might show greater and more 
sudden disorganization than the groups char- 
acterized by higher Es scores, but greater 
sensitivity to conflict. 
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Analysis of 
spatial direc! 


Ss’ performance on linear syllogisms and of their assignment of 
tions to nonspatial ordering relations led to the conclusion that 


thinking about abstract orderings depends on the construction of spatial 
representations. In making these constructions, people proceed more readily 


in a downward than in an upward direction, 


and in a rightward than in a 


leftward direction. They also end-anchor. These principles account for the 


results of otherwise inexplicable variation in 


difficulty among linear syllogisms. 


Some ordering relations (“better” and “worse”) have a fixed tie to a vertical 
axis in people’s cognitive space, whereas other relations (“lighter” and “darker”) 
are not so tied. Detailed consequences of such differences were predicted and 


observed in reasoning tasks. Implications for social 


particularly concerning the previously 
and the indications th 
figure. 


Mickey Mantle tells of an exhibition base- 
ball game in which he and Willie Mays, both 
superstars, were playing. Unhappily, the two 

_ Superstars were having a bad day. Finally a 
leather-lunged fan shouted at Mantle, “I 
came to see which of you guys was better, 
you or Mays. Instead, I’m seeing which is 
worse!” 

Mantle knew he had been insulted. But 
wherein was the insult? It could be argued 
that for a rational investigator the two ques- 
tions would be strictly equivalent. The dis- 
covery that A is better than B seems identical 
to the discovery that B is worse than A. Yet 
it is clear that the antonyms do not inter- 
change freely, that “better” somehow con- 
notes general goodness, whereas “worse” con- 
notes general badness. The words convey 
more than strictly relational or comparative 
meaning; they say something about absolute 
Positions, 

a Such paralogical implications of “better” and 

. “worse” have significance beyond their use in 
taunting ballplayers. Comparisons may be 
odious, but they abound in people’s descrip- 
tion of their world, particularly their social 
World. Statements like “Lincoln was greater 
than Washington” and “Sue is better than 

ep ary” are the very stuff of social reasoning. 
pepe have in fact a fateful predilection for 
near orderings. They are prone to place ele- 


Gime work was supported in part by Grant NSF- 
25 from the National Science Foundation. 


at the linear ordering is a preeminent cognitive 


cognition were discussed, 
single orderings 


observed predilection for 
good 


ments in linear orderings to the exclusion of 
other structures and they handle linear or- 
derings more facilely than most other struc- 
tures (Coombs, Raiffa, & Thrall, 1954; De 
Soto, 1960). But it would be a mistake to 
dismiss the popularity of linear orderings by 
saying they are easy or simple. For structures 
are not intrinsically easy; they are easy only 
as people’s cognitive apparatuses or schemas 
make them so (De Soto, 1960, 1961; Inhelder 
& Piaget, 1958; Mandler & Cowan, 1962). 
People evidently have apparatuses which en- 
able them for the most part to handle linear 
orderings efficiently. Their apparatuses are 
not necessarily the logician’s, however. On 
the contrary, anecdotes like Mantle’s provide 
evidence that they are more paralogical than 
logical. And if there is to be hope of under- 
standing and modifying the predilection for 
linear orderings, psychology must study the 
paralogic, not the logic, of linear orderings. 
This article reports three investigations of 
people’s thinking about social orderings, the 
first a rather thorough experimental analysis 
of their reasoning about orderings formed of 
the relations “better” and “worse,” the sec- 
ond a study of tendencies to assign spatial 
directions to nonspatial ordering relations like 
“better? and “worse,” and the third an ex- 
perimental analysis of reasoning with addi- 
tional ordering relations, testing predictions 
derived from the first and second investiga- 
tions, Throughout, it was taken for granted 
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that people do not always operate as logic 
machines, and the aim was not to make that 
negative point—so often made in past .re- 
search on reasoning—but to uncover the laws 
or mechanisms that account for their depar- 
tures from strict logic. 


Tue Paratocic or “BETTER” AND “WORSE” 


Experiment I 


The previous experiments that come closest 
to an attack on the paralogic of ordering rela- 
tions are those concerned with what has been 
called the linear syllogism (Woodworth & 
Schlosberg, 1954, p. 843). The linear syllo- 
gism differs from the traditional syllogistic 
forms in that the relations between elements 
are relations of order rather than of inclusion 
or implication. For example: 


Premise 1: Mantle is better than Mays. 

Premise 2: Mays is better than Moskowitz. 

Conclusion: Mantle is better than Mosko- 
witz. 


Experiments in which subjects were given 
various valid and invalid linear syllogisms to 
evaluate have established that such syllogisms 
vary in difficulty. Unfortunately these experi- 
ments were not systematic enough to pinpoint 
the subjects’ paralogic. They led to one gen- 
eral conclusion: that a linear syllogism is 
easy to evaluate when the ordering of the 
elements is maintained as they are presented 
in the premises. The example above fulfills 
this requirement by giving first the man at 
one extreme (Mantle), then the man in the 
middle (Mays), and finally the man at the 
other extreme (Moskowitz). If the premises 
were switched, or if Premise 1 was modified 
to read, “Mays is worse than Mantle,” the 
requirement would be violated and the syllo- 
gism should become difficult. This conclusion 
seems almost self-evident, but the experiment 
to be reported here demonstrates that it is in 
fact wrong. 

The trouble with the earlier experiments, 
the reason that they gave misleading results, 
was that they did not vary systematically and 
fully the linear syllogisms presented to the 
subjects, but instead used a small and inade- 
quate sampling of them. Given an ordering 
of three elements, A, B, and C, with A the 
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best, B in the middle, and C the worst, there 
are eight ways of presenting the ordering in 
syllogistic form, eight premise combinations, 
as shown in Table 1. In the present experi- 
ment, all eight were used, along with eight |, 
premise combinations which do not yield 
orderings. 


Method. The syllogisms were printed on cards for 
presentation to the subjects, with men’s first names 
replacing the letters A, B, and C. Each premise com- 
bination was used four times with varied questions 
of the following forms: “Is A better than C?” “Is 
C better than A?” “Is A worse than C?” and “Is C 
worse than A?” The subjects received the cards in 
different random orders. 

The subjects’ task is described well by the instruc- 
tions given them: 


This experiment is concerned with the under- 
standing of the relationships between different peo- 
ple. You will each receive a deck consisting of 64 
cards. On each card you will find printed two 
statements followed by a question. The two state- 
ments will give you some information about the 
relationships among three individuals and the ques- 
tion will ask about the relationship between two i 
of these people. Below the question will be these 
possible answers: “Yes,” “No,” and “?.” If the 
answer to the question is “Yes,” circle “Yes.” If 
the answer is “No,” circle “No.” If the question 
cannot be answered conclusively “Yes” or “No” 
from the information given in the two statements, 
circle “?.” If you cannot decide which of the three 
answers is correct, do not circle anything. 

You will be given 10 seconds to work on each 
card. I will say “Next” every 10 seconds, Bach 
time I say “Next,” turn the top card over and 
work on it, placing the previous card on another 
pile, even if you have not circled an answer on it 
Do not move on to a new card until you hear mè 
say “Next.” y 

When a name is repeated on the same card, it 
refers to the same person, but when it is repeated 
on a different card, it refers to a different person: | 


The subjects in this experiment were 117 Johns 
Hopkins undergraduates. 


Results. Table 1 presents the percentage | 
of correct answers given by the subjects for 
the eight premise combinations which yield 
true orderings. It is apparent—even without 
any correction for guessing—that the premis? 
combinations varied quite widely in difficulty: 
An analysis of variance performed ON T 
data (assigning a correct answer a score of 
and a wrong answer a score of 0) yielded H 
significant main effect for premise combina 
tions (F = 13.45, df = 7/812, p < .001)- 
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© Discussion. One can reject immediately the 
classic conclusion about what makes linear 
syllogisms easy or difficult. According to the 
classic conclusion, Premise Combinations 1 
and 4 should be easy, since both proceed 
" from one extreme to the middle to the other 
extreme in presenting the elements. All the 
other combinations should be difficult. In- 
stead, Combination 4, far from being easy, is 
one of the most difficult combinations, and 5 
and 6, which should be difficult, are among 
the easiest. “a 
Clearly, an altogether different paralogic is 
required to account for the findings. We 
would like to propose two paralogical princi- 
ples that seem helpful. These principles are 
"not new, but their previous application has 
+ been to tasks other than reasoning tasks. The 
"first principle is that people learn orderings 
_ better in one direction than the other. The 
second principle is that people end-anchor 
orderings. 
: There is less precedent for the first princi- 
‘ple than for the second. One hint of it is 
found,inviews of rote serial learning which 
_ hold that the subject learns by chaining for- 
ward from the first item and backward from 
the last item, but learns better in the for- 
ward direction than in the backward direction 
(Ribback & Underwood, 1950), producing 
the well-known asymmetry of the serial po- 
sition curve. 
Consider the application of the first princi- 
= ple to the data in Table 1. We propose that 
$ the differences among the four premise com- 
binations in the upper half of the table can 
be well accounted for by this simple state- 
ment: people learn an evaluative ordering 
more readily from better to worse than from 
_ Worse to better. There are two ways in which 
this Statement applies to a premise combina- 
_ tion: within premises and between premises. 
X} A premise proceeds from better to worse 
Within when it gives the better element first 
and the worse element second, as in “A is 
better than B.” It proceeds from worse to 
poe within when it gives the worse element 
d oe and the better element second, as in “B 
ee than A.” A premise combination pro- 
Bice from better to worse between premises 
a a gives the better pair (A and B) first 
worse pair (B and C) second. It pro- 
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TABLE 1 


PERCENTAGE OF CORRECT ANSWERS FOR VARIOUS 
PREMISE COMBINATIONS BASED ON “BETTER” 


AND “WorsE”’ 
is % 
Genata |S Analysis 


better-to-worse 
better-to-worse 
etter-to-worse 
worse-to-better 
worse-to-better 
better-to-worse 
worse-to-better 
worse-to-better 
ends-to-middle 
better-to-worse 
ds-to-middle 
orse-to-better 
middle-to-ends 
: better-to-worse 
Within premises: middle-to-ends 
Between premises: worse-to-better 


. A is better than B 
B isbetter than C 
. Bisbetter than C 
A isbetter than B 
. B is worse than A 
Cis worse than B 
. Cis worse than B | 42.5 
B is worse than A 
Ais better than B 
Cis worse than B 
Cis worse than B 
Ais better than B 
. Bis worse than A 
Bis better than C 
. B is better than C 
Bis worse than A 


Within premi: 


P 


Py 


Between premisi 


> 


Within premises 
Between premis 
Within premise: 
Between premi. 
Within prem: 


as 


o x 


ceeds from worse to better between premises 
when it gives the worse pair (B and C) first 
and the better pair (A and B) second. 

As indicated in the “Analysis” column of 
Table 1, Premise Combination 1 proceeds in 
the optimal direction uniformly both within 
and between premises, and it is the easiest of 
the four upper premise combinations, in ac- 
cordance with the hypothesis. Premise Com- 
binations 2 and 3 both proceed in the optimal 
direction in one sense but in the wrong 
direction in the other sense, and produce in- 
termediate difficulty. Premise Combination 4 
proceeds in the wrong direction in both re- 
spects and produces the greatest difficulty. 

The second principle—that people end- 
anchor orderings—has long been used by in- 
vestigators of judgments and ratings, who 
commonly find that the extreme items of an 
ordered series of items are judged most con- 
sistently and serve as reference points for 
judgments of other items (Volkmann, 1951). 
It has also been used by investigators of rote 
serial learning in accounting for the fact that 
subjects learn the end items first and best 
(Feigenbaum & Simon, 1962). More recently, 
it has been found that even in nonserial learn- 
ing of an ordering the end items are learned 
first (De Soto & Bosley, 1962), and it has 
been proposed that in all cognitive tasks in- 
volving orderings, people will tend to end- 


anchor. 
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How does this principle apply to a linear 
syllogism? We have already hypothesized that 
the order in which elements are given in a 
premise has significance of one sort; now we 
hypothesize that it has significance of an ad- 
ditional sort. We hypothesize that it is help- 
ful to the subject if the first element given in 
the premise is an end element in the ordering, 
the best or the worst, so that the premise pro- 
ceeds from an end toward the middle rather 
than from the middle toward an end. This 
proposition does not make differential predic- 
tions for the first four premise combinations, 
in each of which one premise combination 
has the desirable property and one does not. 
It applies to the second four premise com- 
binations in Table 1. Combinations 5 and 6 
possess the desirable property in both prem- 
ises and are easy. Combinations 7 and 8 
lack it in both and are difficult. The first 
principle also applies partially to these latter 
four premise combinations. Within premises 
they are alike in having one premise go from 
better to worse and one from worse to better. 
In the between-premises sense, however, they 
differ. Combinations 5 and 7 give the better 
pair first, unlike 6 and 8, and should be rela- 
tively easier, as they are. 

To conclude, the results in Table 1, which 
can only be called chaotic in the light of 
the classic statements about linear syllogisms, 
seem to be quite nicely organized by these 
two simple paralogical principles. 

And these principles may illuminate other 
phenomena. The first principle implies that 
“better” should be the word of choice for in- 
terrelating elements more often than “worse.” 
This prediction is confirmed by the L count 
(Thorndike & Lorge, 1944), which shows that 
“better” is used 2,354 times in 4.5 million 
words, while “worse” is used only 450 times. 
(It should be noted that a reversed hypothesis 
—an argument that syllogisms using “better” 
are easier than syllogisms using “worse” 
because “better” is the more frequent word— 
would be unable to account for the differences 
in difficulty between premise combinations like 
1 and 2, which use identical words in dif- 
ferent arrangements.) 

The two principles together imply that the 
occasion for using the word “worse” would be 
to interrelate elements at the bad end of an 
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ordering, and not too far toward the middle 
of the ordering, just as forward chaining 

takes up a larger portion of the list than Jg 
backward chaining in serial learning. The 
baseball fan who cast Mantle and Mays into 
the nether world of ball players by asking 
who was worse rather than who was better 
can be seen, then, as capitalizing on some- 
thing more than idiosyncrasies of words. 


THE PARALOGIC OF SPATIAL DIRECTIONS 


Questioning of the subjects in Experiment I 
indicated that most of them solved the syllo- 
gisms with the aid of imagery. Most com- 
monly, they imagined the men’s names as at- 
ranged in a vertical ordering, with the best 
man the highest and the worst man the low- 
est. “Better than” was transformed in their 
thinking into “above”; “worse” became “be: | 
low.” 

The use of spatial imagery in thinking 
about nonspatial orderings has been noted 
previously by such diverse introspectionists 
as Lashley (1961) and Inhelder and Piaget 
(1958, p. 252). Unfortunately, however, 
casual introspective observations fail to prove 
that this imagery plays a key role in such 
thinking, much less delineate the role. After 
all, the serious and intensive investigations of 
imagery by Titchener and his students ended 
in failure to prove that images are anything 
more than insignificant concomitants of think- 
ing (Brown, 1958, p. 92). “gq 

An alternative approach is to regard im 
ages only as clues to the processes of thought 
and seek independent data bearing on the 
role of spatial representations in thinking 
about orderings. Such an attempt was made 
in the two remaining experiments. 


Experiment II 


This experiment was a systematic invest | 
gation of people’s imputations of spatial d 
rections to “better” and “worse” and also t 
“has lighter hair than” and “has darker hall 
than,” the latter pair chosen for having no 
apparent ties to spatial directions. 


Method. The subjects were given printed state 
ments about the relation between two people, cted 
example, “Tom is better than Bill,” and instru 
to write the names of the people in two o! 
boxes printed on the same sheet of paper. Tw 
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were at the ends of a vertical line and two were at 
the ends of a horizontal line which crossed the verti- 
cal one. The subjects were instructed to enter the 
names in opposite boxes, choosing them as seemed 
appropriate in consideration of the statement made 
about the names. 

Tn addition to the four relations mentioned above, 
four others were included as buffers. Each subject 
received the eight relations in a different random 
order with the restriction that 25 subjects received 
“better” before “worse,” 25 received “worse” before 
“hetter,” 25 received “lighter” before “darker,” and 
25 received “darker” before “lighter.” The subjects 


were 50 Johns Hopkins undergraduates. A 


Results. In the upper half of Table 2 are 
shown the frequencies with which the subjects 
imputed the various spatial directions to the 
warious relations when the relations were the 
first of their kind presented. That is, of the 

- 25 subjects who received “better” as their first 
evaluative relation, 23 treated the statement 
as proceeding downward, placing the first 
(better) man on top. One treated it as pro- 
ceeding upward, with the first man on the 
bottom, and one treated it as proceeding from 
left to right, with the first man on the left. 
In gontrast, of the 25 subjects who received 
worse” as their first evaluative relation, 19 
treated it as proceeding from bottom to top, 
with the remaining 6 subjects scattering their 
choices among the other three directions. 
These data yielded a chi-square of 28.3, sig- 
nificant at the .001 level. 

For the hair-color relations, the pattern is 
altogether different. Both “lighter” and 

darker” are treated with considerable fre- 
quency as top-to-bottom and also left-to-right, 
and, unlike “better” and “worse,” are decid- 
‘edly not ascribed directions opposite to one 
another, These data yielded a nonsignificant 
chi-square, 

In the lower half of Table 2 are presented 
results for cases in which each relation was 
the second of its kind. Here the breakdown 
of the data is according to how the subject 
treated this relation relative to his treatment 
of the first relation of the kind, whether he 
mputed to it the opposite direction, the same 
direction, or a direction on the other axis. It 
is evident that for the evaluative relations the 
Popular response was to impute the opposite 
direction to the second relation; 41 of the 50 
Subjects did so. For the hair-color relations, 

ere was a much weaker tendency to do this; 
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TABLE 2 
FREQUENCIES WITH WHICH SPATIAL DIRECTIONS 
WERE Imputep TO NONSPATIAL RELATIONS 


Imputations when relation was first of its kind 


Relation Direction 
Top-to- Bottom-to- | Left-to- Right-to- 
bottom top right left 
Better 23 1 1 0 
Worse 3 19 2 1 
Lighter 12 0 10 ’ 3 
Darker 5 0 17 3 


Imputations when relation was second of its kind 


Direction 


Opposite direction | Same direction | Other axis 
Better 20 3 2 
Worse 21 1 3 
Lighter 9 13 3 
Darker 14 8 3 


23 of the 50 subjects did so. A correlated- 
proportions chi-square of 11.57, significant 
at the .001 level, was calculated from these 
figures. 

Discussion. This experiment provides evi- 
dence that evaluative relations are indeed tied 
to a vertical axis in most people’s thinking. 
“Better than” proceeds from top to bottom, 
and “worse than” from bottom to top, when- 
ever they are presented. Hair-color relations 
are not so tied, although there were neverthe- 
less some consistencies in the subjects’ treat- 
ment of them. 


Experiment IIT 


In the older work on linear syllogisms there 
appears to have been a tacit assumption that 
ordering relations are homogeneous so far as 
people’s handling of them in a reasoning task 
is coneerned. These earlier experiments did 
in fact present subjects with syllogisms based 
on vatied ordering relations, for example, “is 
fairer [or darker] than,” “is to the left [or 
right] of,” but they did not make compari- 
sons among the different relations. In the 
present experiment direct comparisons were 
made of people’s performance on linear syllo- 
gisms formed of various ordering relations. — 
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An attempt was made, however, to do more 
than answer the simple question of whether 
or not varying the relation makes a difference 
in people’s performance. The attempt was 
predicated on the general hypothesis that in 
reasoning about orderings of elements people 
rely on spatial representations or thought- 
models which they construct in some cognitive 
space. With the results of Experiments I and 
II in hand, it seemed possible to frame some 
fairly strong predictions based on this hy- 
pothesis about the outcome of a further ex- 
periment on linear syllogisms. 

Consider first syllogisms based on an ex- 
plicit naming of relative positions on a verti- 
cal axis, that is, “above” and “below.” If, as 
we have supposed, subjects are in fact think- 
ing of such relations when ostensibly con- 
cerned with “better” and “worse,” then one 
might predict they would perform on “above” 
and “below” much as they performed on 
“better” and “worse.” In particular, they 
should proceed more readily in a downward 
direction than in an upward one, just as they 
proceeded more readily in a worseward direc- 
tion than in a betterward one. Our hypothe- 
sis is that worseward is downward in their 
thinking. 

Consider next syllogisms based on relations 
with no intrinsic tie to a spatial axis, such 
as the hair-color relations. We have not pre- 
cluded the possibility that in solving such 
syllogisms subjects will use spatial representa- 
tions, but we would have to predict that they 
would not use them in a single fixed way. The 
results of Experiment II imply that which- 
ever name is given first will be placed at the 
preferred starting point of one of the axes 

(top of a vertical axis or left end of a hori- 
zontal axis), and the following relational term 
will be transformed, whichever it is—lighter 
or darker—into “above” or “to the left of,” 
The subject should have no difficulty with this 
first premise. What then of the second prem- 
ise? Having already imputed a spatial direc- 
tion to the relation given in the first prem- 
ise, he should be able to handle the second 
premise readily if it uses the same relation. 
But if it uses the opposite relation, he may be 
in trouble because, as the results of Experi- 
ment II clearly imply, he will not readily im- 

ute the opposite spatial direction of this new 

ag 
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relation. In sum, in contrast to the findin 
for “better” and “worse” and the predici 
for “above” and “below,” the prediction 
“lighter” and “darker” is that they will bẹ 
equally difficult, and that syllogisms restric 
to one or the other will be relatively 
but that syllogisms including both will 
exceptionally difficult. 
Finally, consider syllogisms based explicit 
on left-right relations. The results from Exe 
periment II indicate that the left end is 
preferred starting point for a horizontal 
dering, just as is the top end of a vertical 
ordering. On the basis of this finding, it wor 
seem reasonable to predict that people ¥ 
proceed from left to right more readily 
from right to left in handling horizontal lineal 
syllogisms, that there is a left-right asym 
metry corresponding to the above-below 
asymmetry. In other respects too the resulti 
for left-right syllogisms should resemble thos 
for above-below syllogisms for the reason 
both are explicitly tied to spatial directions 


Method. For the comparisons to be made in. 
experiment, it was considered sufficient to work W 
the selected premise combinations and questions 
shown in Table 3, together with eight combinations 
not yielding orderings. Cards and instructions ¥ 
prepared closely resembling those of Experiment i 
The subjects were 146 Johns Hopkins undergradl= 
ates, 


Results. Table 3 presents, in the better 
worse column, data taken from Experiment] 
(the percentages vary somewhat from thei 
counterparts in Table 1 because of the tej 
striction to one question per premise coms 
bination), and, in the remaining columns, dat 
obtained from the 146 additional subjects. 
analysis of variance performed on the new 
data yielded a significant Premise Combi 
tions X Relations interaction (F = 9-77; 4 
= 6/870, p < .001), clearly disproving ™ 
old supposition that ordering relations 
homogeneous in reasoning tasks. The spec 
predictions are confirmed in detail. ; 

As predicted, the better-worse and abo 
below results are very much alike, with 
only sizable discrepancy occurring for 
ise Combination 4. In particular, the 
pothesis that people will proceed downwi 
More readily than upward in above-bel 
linear syllogisms is confirmed by the differ 
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TABLE 3 


PERCENTAGE OF CORRECT ANSWERS FOR SELECTED PREMISE COMBINATIONS 
BASED ON VARIOUS RELATIONS 


Relation 
Combi- 
nation 
Better-worse Above-below Left-right Lighter-darker 
1. | Ais better than B | Ais above B A is to the left of B A has lighter hair than B 
`B is better than C B is above C B is to the left of C B has lighter hair than C 


Ts A to the left of C? 


Is A better than C? | Is A above C? Does A have lighter hair than C? 
63.0 


67.5 65.8 52.0 i 
2. C is worse than B C is below B Cis to the right of B C has darker hair than B 
B is worse than A B is below A B is to the right of A B has darker hair than A 
Is C worse than A? | Is C below A? IÈ C to the right of A? Does C have darker hair than A? 
47.0 50.0 | 38.4 64.4 
Ae Ais better than B A is above B C is to the right of B A has lighter hair than B 
C is worse than B C is below B A is to the left of B C has darker hair than B 
Ts C better than A? | Is C above A?| Is A to the right of C? Does C have lighter hair than A? 
. 66.7 65.1 45.2 39.7 
4, B is worse than A B is below A B is to the left of C B has darker hair than A 
Bis better than C | B is above C B is to the right of A B has lighter hair than C 


Is C better than A? | Is C above A? 
35.9 46.6 


Is A to the right of C? 
26.7 


Does C have lighter hair than A? 
24.7 


ence between Combinations 1 and 2 in the 
above-below column. 

The prediction that syllogisms based on 
“lighter” alone or “darker” alone would be 
relatively easy—and equally easy—is clearly 
confirmed, as is the prediction that syllogisms 
based on mixtures of “lighter” and “darker” 
would be exceptionally difficult. 

Tn the left-right column, the hypothesis 
that people will proceed from left to right 
more readily than the reverse is confirmed by 
a comparison of the percentage correct for 
Combination 1, which proceeds from left to 
tight both within and between premises, and 
Combination 2, which proceeds from right to 
left in both senses. The prediction that the 
overall pattern of results for this column 
should resemble that for the above-below 
column is less clearly substantiated. The re- 
sults certainly are much more like the above- 
below results than they are like the lighter- 
darker results. But the percentages are lower 
throughout the column—left-right syllogisms 
seem generally difficult. Unfortunately, Prem- 
lse Combinations 3 and 4 do not corre- 
| Spond exactly to those in the above-below 
wk as they proceed in the wrong direction 

etween premises, probably reducing these 
Percentages still further. 

Discussion. The confirmation of the several 
Predictions appears to give strong support to 


the hypothesis that people rely on spatial 
representations in thinking about orderings. 
The unpredicted result, the general difficulty 
of left-right syllogisms, may also support it. 
Momentary inability to decide which direc- 
tion is left and which is right is a common 
failing—certainly more common than igno- 
rance of which way is up and which down. 
It is a failing that could be expected to im- 
pair performance on a left-right syllogism for 
a subject who used spatial representations, 
but hardly for a subject who treated left and 


right simply as verbal opposites. 


SPATIAL REPRESENTATIONS AND THE 
ORDERING SCHEMA 


These findings seem helpful in answering 
earlier questions about how it is that people 
are so good at giving elements a linear order- 
ing in their thinking, yet so poor at interrelat- 
ing them in most other ways (De Soto, 1960), 
and at giving the same set of elements more 
than one ordering (De Soto, 1961). The ear- 
lier answer was that people have an ordering 
schema, an inner mechanism or procedure for 
ordering things, and lack schemas for most 
other ways of jnterrelating things. This ap- 
peal to schemas seemed preferable for a num- 
ber of reasons to the essentially nonpsycho- 
logical statement that linear orderings are in- 
trinsically easier than other structurings of 
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elements. But, despite its value as a reminder 
that inner processes must subserve even—or 
especially—the accomplishments people find 
easy, it has the shortcoming of saying nothing 
about what these inner processes might be 
like. 

Now it can be argued that the inner pro- 
cedure in thinking about orderings consists 
primarily in arranging elements on an axis in 
some cognitive space. We believe this view 
makes comprehensible a number of otherwise 
disconnected and inexplicable phenomena. 


Predilection for Single Orderings 


The tendency to treat a given set of ele- 
ments as having only one ordering, with 
strong aversion to multiple orderings of the 
set, has been noted as a phenomenon and 
problem in diverse realms of thought (De 
Soto, 1961; Sampson, 1963). It was, in fact, 
originally established as an inductive gen- 
eralization based on its diverse manifesta- 
tions. Subsequent experiments pinpointed it 
as a difficulty in applying the same schema 
to the same set of elements more than once, 
rather than as simply the difficulty of han- 
dling complex tasks. But even those experi- 
ments failed to reveal anything about the in- 
ternal processes involved. 

Now, the explanation of the predilection 
would run as follows. People are good at 
thinking of elements as ordered because they 
can readily arrange them appropriately on an 
axis—ordinarily the vertical axis—in their 
cognitive space. But this space is in a sense 
one-dimensional. It does not provide other 
axes; even the left-right axis seems relatively 
unavailable. And the space is certainly not 
an n-dimensional cartesian coordinate system 
in which an element can have different values 
on different dimensions, nor can an element 
simultaneously occupy different positions on 
the one axis. Thus if a subject is asked to 
think of people as ordered by wealth, he can 
do so readily enough, arranging them on the 
vertical axis. But he cannot simultaneously 
arrange them on another axis called “power,” 
or one called “prestige.” There is no other 
axis, no additional way of imposing an order- 
ing on the set of elements. 

It is not by chance that “high” and “low,” 
and “above” and “below,” have their tre- 
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mendous range of metaphorical meanings, nor 
is it a surprise that they are among the most | 
frequently used words in the language accord- 
ing to the Thorndike-Lorge (1944) count. 
Indeed, according to the dictionary defini- 
tions of such words as “high,” the tie to the 
vertical axis we have found for better-worse 
relations seems to exist for the three major 
dimensions of meaning reported by Osgood, 
Suci, and Tannenbaum (1957), good-bad, 
strong-weak, and fast-slow. The failure of 
lighter-darker relations to be so tied may be 
quite exceptional, perhaps occurring because 
lighter-darker relations are usually used only 
to contrast, not to order, elements, And when 
an evaluative component is introduced into 
lighter-darker relations, it too may quickly 
become tied to the vertical axis, as in “high 
yaller,” the lightest-skinned (best) Negro. 


Linear Order as Good Figure 


Perhaps a more profound problem in think- 
ing than the predilection for single orderings 
is the evident predilection for linear orderings 
at the expense of other varieties of organiza- 
tion, such as partial orderings and dominance 
structures (Coombs et al., 1954; De Soto, 
1960; De Soto & Kuethe, 1959; Kuethe, 
1962; Wunderlich, Youniss, & De Soto, 
1962). It appears that the linear ordering 1S, 
by analogy with “good figures” in perception, 
a “good figure” in cognition. Thought tends 
toward the linear ordering; and the linear ot- 
dering is evaluated positively. It is no ac 
dent that the generic term for good arrange 
ment in English is order: the prototypic e 
ample of good arrangement is order 10 its 
narrow sense of linear order. 

Under the hypothesis that thinking about 
seemingly abstract orderings depends on spa 
tial representations, one might look for evi 
dence that the spatial representations them- 
selves have “good figure” properties. on 
source of such evidence is suggested by Asch’ 
(1958) remarkable finding that “straight” 1 
English, as well as its equivalent in other 
languages, universally carries a metaphotle 
meaning of goodness and propriety, whereas 
“crooked” conveys evil and impropriety: , 3 
can add that it is good to be “upright” % 
“on the level,” and bad to be “oblique 
“slanted,” or “biased.” And other spat% i 
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structures which in principle could subserve 

, cognitive organizations have much weaker 
positive connotations, if not negative ones: 
it is bad to be cross, or to cross someone, 

and especially to double-cross someone; it is 
bad to be square; a love triangle is bad; a 
vicious circle is bad; it is bad to have a 
checkered career; or to weave a web of deceit. 
The evolutionary tree—a very apt metaphor 
for the partial ordering laid down by evolu- 
tionary processes—does not have strong nega- 
tive connotations, but it hardly has the ap- 
peal of the seriously misleading scale of na- 
ture which dominated western thought for so 
long (Lovejoy, 1960). 

‘It seems probable that the linear ordering, 
„and perhaps also: the simple grouping, are 
"preeminent good figures in cognition in a way 

unmatched by any perceptual good figures. If 

this is so, it is urgent to realize it and seek 

some understanding of it, for only by know- 

ing our proclivities can we avoid being 
, trapped by them. 
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EFFECTS OF QUESTION SPECIFICITY AND ANXIETY- 
PRODUCING MESSAGES ON VERBAL FLUENCY IN 
THE INITIAL INTERVIEW * 


ARON WOLFE SIEGMAN anp BENJAMIN POPE 
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An experimental analogue of the initial interview is used to investigate the 
effects of interviewer specificity and topical focus, i.e., a low anxiety-arousing 
vs. a high anxiety-arousing topic, on interviewee’s verbal behavior. It was 
found that low-specificity interviewer remarks are associated with verbal 
indices of caution and hesitation (“ah’s,” ‘a slow articulation rate and silent 
pauses). It is suggested that a conceptualization of the specificity variable in 
terms of informational uncertainty provides a parsimonious explanation for 
the above findings. The anxiety-arousing topic was associated with disrupted 
speech (“non-ah” speech disturbances). 


This study uses an experimental analogue 
of the initial interview to investigate the 
effects of interviewer specificity and topical 
focus (a relatively low anxiety-arousing ver- 
sus a relatively high anxiety-arousing topic) 
on verbal fluency in interviewee’s responses. 
Following the information theory conceptual- 
ization of the psychotherapeutic interview 
advanced by Lennard, Bernstein, Palmore, 
and Hendin (1960), the specificity level of 
an interviewer’s remark is defined in terms of 
the limits it sets on interviewee’s choice of 
response, Thus, an interviewer’s remark “Just 
start by saying anything that occurs to you,” 
has a low specificity value because it does not 
limit the interviewee’s response to a specific 
subject matter or proposition. On the other 
hand, the question “How old are you?” has 
a high specificity value because it limits the 
range of possible alternatives from which the 
interviewee can select his reply (Lennard et 
al., 1960, p. 11). 

In two previous studies, one using natural- 
istic initial psychiatric interviews and the 
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other an experimental analogue of such inter- 
views, Pope and Siegman (1962, 1965) found 
an inverse relationship between interviewer 
specificity and interviewee productivity; the 
lower the specificity level of the interviewers 
remarks, the higher the productivity level of | 
the interviewee’s responses, The authors sug- 
gest that low interviewer specificity creates 
a condition of informational uncertainty 
which the interviewee strives to reduce 
through increased productivity. x 

In the study of naturalistic initial psychi 
atric interviews Pope and Siegman (1962) | 
also found that low-specificity interviewet 
remarks are associated with more speech dis- 
ruptions in interviewee’s responses than alt 
high-specificity interviewer remarks. Thel 
measure of speech disruption was Mahli 
(1956) general speech disturbance ratio 
(SDR), which is based on the following 
categories: “ah” and allied expressions 0 
hesitation, sentence correction, sentence m; 
completion, repetition, stutter-stammer, M 
truding incoherent sound, tongue-slip, an 
omission of a word or part of a word. It i 
thus clear that while low-specificity inte 
viewer remarks are associated with great! 
productivity they are also associated with less 
fluent speech. Assuming that speech disturb- 
ances are an index of anxiety, Pope and Sies 
man (1962) conclude that low-specificl! 
interviewer remarks have an anxiety-arousing 
effect, It was suggested that the ambiguity 
and uncertainty which are associate 3 
a low-specificity interviewer remark arè © 
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source of interviewee anxiety which mani- 
fests itself in speech disturbances. The as- 
sumption that speech disturbances are an in- 
dex of concurrent anxiety is based on a study 
by Mahl (1956) in which he found higher 
SDRs for high-anxious than for low-anxious 
phases of six therapeutic interviews with the 
same patient. 

Other studies provide a measure of con- 
struct validity for the anxiety relevance of 
speech disturbances (Boomer, 1963; Panek 
& Martin, 1959), although not all the evi- 
dence is uniformly supportive (Boomer & 
Goodrich, 1961; Brenner, Feldstein, & Jaffe, 
1963). It should also be pointed out that 
Mahl (1959), in a more detailed analysis of 
his earlier data (Mahl, 1956), found that not 
all categories of speech disturbance are 
related to anxiety. Considering separately 
speech disturbances due to “ah” and allied 
expressions of hesitation and all other cate- 
gories of speech disturbance (i.e., repetition, 
correction, etc., hereafter referred to as 
“nomah’”), he found that only the latter were 
related to anxiety in the patient’s responses. 
A subsequent study by Kasl and Mahl (1958) 
also indicates that anxiety is associated with 
non-ah but not with ah speech disturbances. 
„An alternative interpretation of the rela- 
tionship between interviewer specificity and 
interviewee speech disturbance is in terms of 
uncertainty, Recent evidence indicates that 
uncertainty does in fact produce speech dis- 
turbances, Brenner et al. (1963) used pas- 
Sages varying in their degree of approxima- 
tion to written English for generating dif- 
ferent levels of uncertainty. They found that 
as uncertainty increases there is a corre- 
sponding increase in non-ah speech disturb- 
ances, Maclay and Osgood (1959) found that 
ah speech disturbances (“filled pauses” as 
they designate them) tend to occur more 
frequently before lexical words than before 
functional words. They attribute this finding 
to the greater uncertainty which is involved 
in the choice of lexical words as opposed to 


_ functional words. Assuming, as has been sug- 


gested earlier, that low interviewer specificity 
Creates a condition of informational uncer- 


_ tainty, the latter may be considered as an ex- 


Planatory hypothesis for the obtained rela- 
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tionship between interviewer specificity level 
and interviewee speech disturbance. 

The problem of interpretation aside, our 
content analytic study of naturalistic inter- 
views (Pope & Siegman, 1962) is subject to 
at least one major methodological criticism. 
It is not unusual for an interviewer to be 
more general and less specific when introduc- 
ing a potentially anxiety-arousing topic. It is 
thus possible that the interviewer’s specificity 
level was confounded with the anxiety-arous- 
ing potential of the content of his remarks. 
In the present study, therefore, one half of 
an experimental interview consisted of a rela- 
tively high anxiety-arousing, or an emotional, 
topic, and the other half of a relatively low 
anxiety-arousing, or a neutral, topic. Within 
each topic the interviewer asked an equal 
number of high-specificity and low-specificity 
questions. This makes it possible to evaluate 
the effects of interviewer specificity and of 
topical focus independently of each other. 

There are two major objectives to the pres- 
ent study. The first is to test the hypothesis 
that there is an inverse relationship between 
interviewer specificity and interviewee verbal 
fluency. The indices of verbal fluency in- 
cluded, first of all, Mahl’s ah and non-ah 
speech disturbance ratios. It will be recalled 
that in our content analytic study no distinc- 
tion was made between ah and non-ah, so 
that we cannot know from that study whether 
interviewer specificity is related to only one 
or to both categories of speech disturbance. 
In the present study, however, the distinction 
between the two measures was observed. 

Studies conducted by Goldman-Eisler 
(1961b, 1961c) indicate that uncertainty also 
affects a speaker’s reaction time, his silent 
pauses and perhaps also his articulation rate. 
Goldman-Eisler manipulated the uncertainty 
variable by asking subjects to describe a 
series of New Yorker cartoons and to formu- 
Jate their meaning, the assumption being that 
the latter task involves relatively more un- 
certainty than the former. She found that the 
second of the two tasks was associated with a 
longer reaction time and longer silent pauses. 
Although subjects’ articulation rate was not 
differentially affected by the two tasks, it 
did increase with practice (Goldman-Fisler, 
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1961c), suggesting that uncertainty may also 
affect articulation rate. On the assumption, 
then, that low interviewer specificity is a 
source of interviewee uncertainty, we can 
now state the following hypothesis: 

Hypothesis 1. Low-specificity interviewer 
remarks are associated with more ah and 
non-ah speech disturbances, a longer reaction 
time, more silent pauses, and a slower artic- 
ulation rate than are high-specificity inter- 
viewer remarks, 

The second major objective of this study is 
to ascertain the effects of an anxiety-arousing 
topic on nonlexical parameters of interviewee’s 
verbal behavior. As already indicated, previ- 
ous studies suggest that anxiety is associated 
with non-ah but not with ah speech disturb- 
ances (Kasl & Mahl, 1958; Mahl, 1959). 
Using silent pauses as a measure of nonflu- 
ency, Mahl (1956) found a significant posi- 
tive relationship between such pauses and 
rated anxiety, although he could not repli- 
cate this finding in a subsequent study (Mahl, 
1961). These studies suggest that anxiety 
has a disruptive effect on verbal fluency. 
Kanfer (1958, 1960), on the other hand, 
found that verbal rate accelerates in response 
to shock-conditioned tones and in response 
to anxiety-arousing interview topics. Davids 
and Eriksen (1955) found a positive correla- 
tion between manifest anxiety and productiv- 
ity in a word-association task, and Pope and 
Siegman (1965) found a similar relationship 
between the anxiety-arousing effect of inter- 
view topics and interviewee productivity. 
These findings suggest that anxiety has an ac- 
tivating (Duffy, 1962) and facilitating effect 
on verbal behavior. These two sets of data, 
however, are not necessarily contradictory: 
as a matter of fact, the drive properties of 
anxiety which are responsible for its activat- 
ing and facilitating effects may also be the 
cause of its disruptive influence, More spe- 
cifically then we investigated the following 
hypothesis, suggested by previous studies: 

Hypothesis 2, A relatively high anxiety- 
arousing topic is associated with the occur- 
rence of more non-ah, more silent pauses, 
s and a faster articulation rate than a rela- 
Mively low anxiety-arousing topic, 
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PROCEDURE 
Subjects 


The interview subjects were 50 female students 
at the University of Maryland School of Nursing, 
ranging in age from 20 to 22. These subjects were 
selected from the total membership of the junior and 
senior classes (WV = 104) on the basis of the follow- 
ing criteria: both parents living; one or more sib- 
lings; no indication, on a specially devised ques- 
tionnaire, of school or college maladjustment. The 
purpose of the first two selection criteria was to fit 
each subject into the content structure of the ex- 
perimental interview. The third criterion was used 
to eliminate all subjects who reported stressful ex- 
perience in any period of past school history, and 
thus facilitate the use of school history as a low 
anxiety-arousing topic. 


Interview Design 


The experimental interview (see Table 1) repre- 
sents a systematic manipulation of specificity level 
and topical focus, the two independent variables of 
the present study. 

The low anxiety-arousing or neutral topic deals 
with schooling; the high anxiety-arousing or emo- 
tional topic deals with family relations. In ordet 
to enhance the anxiety-arousing potential of the 
emotional topic, the interviewer made the fellow- 
ing introductory remarks. 


You remember taking a number of personality 
tests over the past few weeks, Several of the 
questions dealt with family relations. I am intet- 
ested in talking about this area further with you, 
because from past experience we know that even 
when a person is not aware of problems in this 
area and therefore does not express them in het 
responses to the test, such problems may actually 
exist. 


Where appropriate the interviewer concluded: 


Of course in your case you are aware of ay 
problems and you have indicated them in y0 
responses to the tests.? 


A control for a possible order of specificity oe 
is built into the interview design: two i 
specificity remarks are followed by four high 
specificity remarks, and then again by two pr 
specificity remarks in each topical area. On a 
assumption that starting the interview with 4 
emotional topic enhances its anxiety-arousing Lae 
tential, and possibly elevates the anxiety level n 
the whole interview, the order of topical foci 


2 The effectiveness of the topical manip! lat 
indicated by subjects’ postinterview self-rating a 
two 6-point bipolar scales: comfortable-unc’ 
fortable and tense-relaxed. Subjects indicate n 
they were significantly more uncomfortable a af 
tense during the emotional than during the neu 
topic. 


; 
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TABLE 1 
EXPERIMENTAL INTERVIEW DESIGN 
Specificity Specificity 
Rating Rating 
Neutral topic 2. Iknow very little about schools of 
Low specificity nursing. Tell me some more about the 
1. Now would you please tell me some- School of Nursing, anything that oc- 
thing, anything you would like, curs to you. 6.3 
about the schools you attended be- Emotional topic 
fore coming to the School of Nursing. 4.1 Low specificity 
2. Tell me something about the elemen- 1. Would you please tell me something 
tary school you attended (junior high about your immediate family and how 
school, high school—whichever was A you got along with them, 4.1 
not emphasized in response to Ques- 2. Tell me more about your father 
tion 1) and how you got along there. 6.3 (mother—whichever was not empha- 
High specificity* sized in response to Question 1), just 
1, In which of these schools did you get anything you can think of. 6.3 
along better? 10.9 High specificity* 
* 2. Why? 7.6 1. With whom did you get along better, 
3. What do you mean by. . . ? 7.6 with your father or with your mother?} 10.9 
4. Which of these schools was a better 2, Which member of your family are you! 
school? 10.9 closest to? 10.9 
5. Why was this school better? 7.6 3. Why? 7.6 
6. Did your school have good library fa- 4, What do you mean by . . . 2 7.6 
cilities? 10.9 5. pa your father and mother get along 10 
i A 2 well? 5 
i payon par tips oe TanyeRe 10.9 6. What was your reaction to this? 7.6 
5 i enoni ji 7. Did your mother tell you about men- 
8. Why? (if the subject says “no” in re- struation? 10.9 
4 Question 7) 7.6 : Ava i 
eponse to a, 8. Did your father treat all his children 
9, Would you say that you participated equally? 10.9 
© more than the average student? 10.9 9. Did your mother treat all her children 
10. What do you mean by . . . ?and equally? 10.9 
What was your reaction to . . . ? Low specificity 
when appropriate. 7.6 1. Tell me more about your brothers and 
Low specificity sisters. 6.3 
1. Now I'd like you to tell me something, 2. We've talked about you and your 
just anything that occurs to you about parents when you were younger. Tell 
your first two years in the School of me something about you and your 
Nursing at College Park. 6.3 parents now. 6.3 


a Four of these questions were used, as appropriate. 


(emotional-neutral versus neutral-emotional) was 
alternated sequentially as the subjects appeared for 
the interview. 

The numbers after each interviewer question in 
the interview outline (Table 1) are the ratings of the 
interviewer remarks on the empirical Specificity 
Scale developed by the authors (Siegman & Pope, 
1962). More than the required number of questions 
was prepared in order to give the interviewer some 
latitude in responding to the subject. 

„One of the authors (BP) interviewed all subjects. 
Neck microphones were worn by both the inter- 
viewer and subject. Tape recordings were made in 
the adjoining observation room. 


Scoring of Dependent Variables 


4 on assistant scored interviewees’ responses for 
ey disturbance from verbatim transcripts, while 
ae taneously listening to the tape recording of the 

s E Each instance of speech disturbance was 
mie led under the appropriate category, and then 
immed to yield one ah score and one non-ah score 


(Mahl, 1956) for each response, The ah and non-ah 
scores of each response were then transformed into 
ratios by dividing each score by the total number 
of words in that response. 

To obtain a measure of interscorer reliability, 
three interviews were scored independently by one 
of the authors and a research assistant. The per- 
centage of agreement (Dollard & Auld, 1959, pp. 18- 
19) between the two scorers was 89 for the ah dis- 
turbances and 74 for the non-ah disturbances. The 
research assistant then scored all the interviews. 

For the scoring of reaction time, silent pauses, 
and articulation rate, the tapes were played back 
and the relevant time readings were made with a 
stopwatch. The reaction-time score consisted of the 
duration (in seconds) of the interval between the 
end of the interviewer’s remarks and the beginning 
of interviewee’s response. Two seconds was the 
minimum criterion for a silent pause. The durations 
of all the silent pauses were summed and then di- 
vided by the total time of the response. Following 
Goldman-Fisler (1956), the articulation rate of each 
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TABLE 2 
ANALYSIS OF VARIANCE RESULTS 
MS 
ies z Ah Non-ah | Reaction | Articulation | Sie 
ratio ratio time | rate quotient 
Order of topical focus (A) 1 | o2s0 | .0000 | 31500" | 835 | si 
Subjects within groups (error a) 48 .0098 .0092 | 48.54 ó 8, 3079 
Specificity (B) 1 .0435* 0014 | 7.60 35.46 0001 
AXB 1 .0047 | 0034 | 4141 -23 ree 
bjects withi rror b) 48 .0031 0044 | 25.86 1,37 ol 
A ETUE E e . 1 | .0033 | .0079** | 66.12 1.73 ‘0946 
AXC 1 0045 0036. | 14.05 5.06* 0187 
C X Subjects within groups (error c) 48 0014 .0010 18.67 | 104 0365 
BXC 1 | ‘009% | ‘0004 | st 3.28 ‘0413 
B X C X Subjects within groups (error bc) | 48 0019 .0026 13.39 1. 0673 
j n 
*p <.05. 
p SOL, 


response was determined by dividing the number of 
words by subject’s speaking time, that is, the dura- 
tion of the response minus the silent pauses. Inter- 
scorer reliability (r) for reaction time, total time of 
response, and silent pauses, the basic data from 
which all the temporal indices are derived, is, 
respectively, .97, .99, and .96. 


RESULTS 


The data concerning the effects of inter- 
viewer specificity, topical focus, order of 
topical focus, and of their interactions on the 
dependent variables were subjected to analy- 
ses of variance for repeated measurements 
(Winer, 1962, pp. 314-330). The results of 
these analyses are summarized in Table 2. 


Interviewer Specificity Level 


On the assumption’ that low-specificity in- 
terviewer remarks are a source of interviewee 
uncertainty manifested in nonfluency, Hy- 
pothesis 1 postulates that low-specificity in- 
terviewer remarks are associated with more 
ah, more non-ah, a longer reaction time, more 
silent pauses, and a slower articulation rate 


TABLE 3 


An Ratios AND ARTICULATION RATES FOR 
HicH AND Low SPECIFICITY SEGMENTS 


Ah ratio Articulation 
Interview rate 
segment 
M SD M SD 
High specificity | .0186 | .0152 2.91 425 
Low specificity | .0259 | .0169 2.70 360 


than high-specificity interviewer remarks, Of 
these predictions, the ones pertaining to ah 
and articulation rate were supported; those 
pertaining to non-ah and reaction time were 
not, In regard to silent pauses the results 
are ambiguous. As can be seen in Tables 
2 and 3, low-specificity interviewer remarks 
were associated with significantly (F = 14.16, | 
p < .01) more ah speech disturbances. Fol- 
lowing a recommendation by Cohen (1964), 
wherever we have significant findings we will 
point out not only probability levels but also 
the magnitude of the relationship between 
the two variables. Thus, the correlation bè 
tween interviewer specificity and interview?” 
ah ratios, as estimated by epsilon (e); $ 
479, Low-specificity interviewer remarks us 
also associated with a significantly (E3 
25.88, p < .01) slower articulation rate than 
high-specificity remarks (Tables 2 and 3): 
The magnitude of this relationship 1S indi- 
cated by an e of .581. Although the analyst 

3 Epsilon is an unbiased estimate of the corte 


tion ratio, eta, and may be computed by the fom 
ula, 


where: 5 
df,=degrees of freedom associated WI 
numerator of the F ratio ay the 
dfa= degrees of freedom associated with E 
denominator of the F ratio. i 

In terms of its magnitude, e may be interpreted m 


much the same way as a product-moment r. cok ail 
r, e does not presume a linear relationship 


1964). 


th the 
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of variance results for silent pauses are not 


significant, an inspection of the raw data 
revealed a higher proportion of silence-free 
responses (responses with zero silence quo- 
tients) in the high-specificity segment of the 
interview as compared to the low-specificity 
segment. Thus, 35 subjects had two or more 
silence-free responses in the high-specificity 
segment of the interview, but only 16 sub- 
jects had a similar proportion of silence-free 
responses in the low-specificity segment (x? 
= 12.97, p < .001). This analysis indicates 
that low-specificity interviewer remarks are 
indeed associated with silent pauses. 

To summarize, then, low-specificity in con- 
trast to high specificity remarks are associ- 
ated with more ah’s, a slower articulation 
rate, and fewer silence-free responses. These 
findings are consistent with the proposition 
that interviewer specificity correlates posi- 
tively with interviewee fluency. 


Topical Focus and Order of Topical Focus 


According to Hypothesis 2, the high anxi- 
ety-frousing or emotional topic should be 
associated with more non-ah speech disturb- 
ances, more silent pauses, and a faster articu- 
lation rate than the low anxiety-arousing, or 
neutral topic. As indicated in Tables 2 and 
4, the predicted relationship between the 
anxiety-arousing topic and non-ah was clearly 
confirmed (F = 7.56, p < .01). The magni- 
tude of this relationship is indicated by an « 
of .369. It should be noted that there is no 
significant relationship between topical focus 


and the ah ratio. This finding is consistent 


with the guiding assumption of the present 
study that anxiety is associated with non-ah 
but not with ah speech disturbances. 

There were no significant relationships be- 
tween topical focus on the one hand, and 
silent pauses and articulation rate on the 
other hand. Nor was there any indication 


TABLE 4 
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TABLE 5 
REACTION TIME AS A FUNCTION OF 
ORDER oF ToricaL Focus 
Order of topical focus | N M SD 
Neutral-emotional 25 1.99 65 
Emotional-neutral 25 2.62 1,02 


that the distribution of silence-free responses 
differs significantly as a function of topical 
focus (x? = 1.01). 

It will be recalled that the basis for the 
order of topical focus manipulation (emo- 
tional-neutral versus neutral-emotional) was 
the expectation that starting the interview 
with an anxiety-arousing topic might raise 
the interviewee’s anxiety level throughout the 
interview, or at least enhance the effects of 
the anxiety-arousing topic. The results pro- 
vide some support for both of these expecta- 
tions (Table 2). First, as indicated in Tables 
2 and 5, the emotional-neutral order had the 
effect of increasing subjects’ reaction time 
throughout the interview. The magnitude of 
this relationship is indicated by an e of .317. 
Second, the emotional-neutral order had the 
effect of accelerating interviewees’ articula- 
tion rate during the emotional segment of 
the interview, although the magnitude of this 
relationship, as indicated by an e of .271, is 
low. 

One additional significant finding, namely, 
the Topical Focus X Specificity interaction 
effect for ah speech disturbances (Tables 2 
and 6), indicates that the inverse relation- 
ship between interviewer specificity and the 
ah ratio is more pronounced during a high 
anxiety-arousing topic than during a rela- 
tively neutral topic. This difference is due to 
the fact that high-specificity remarks in the 


TABLE 6 
Au Ratios FoR THE FOUR Interview SEGMENTS 


M SD 
Non-am Ratios ror EMOTIONAL AND Jatene w Sere 
NEUTRAL SEGMENTS Emotional 
Low specificity .0267 0174 
Interview segment M SD High specificity .0158 0135 
Neutral 

Neutral 0231 0136 Low specificity 0252 0164 
Emotional (0262 0174 High specificity 0213 0154 
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TABLE 7 
INTERCORRELATIONS OF DEPENDENT VARIABLES 


icula- | Reacti Silent 
Non-ah | tion rate | “tines | pauses 
Ah .215 | —.591** A47 .252* 
Non-ah —.073 —.073 | —.058 
Articulation rate —.081 | —.219 
Reaction time .530** 


a All correlations involving this variable were additionally cal- 
culated separately for the emotional-neutral sequence group and 
the neutral-emotional sequence group. The two sets of correla- 
tions were about the same. 

#10 < p > 05. 

wp <.05, 


emotional topic elicited significantly fewer 
ah disturbances than high-specificity remarks 
in the neutral topic (ż= 2.53, p< .01). 
Alternatively, it can be said that during the 
high-specificity segment of the interview anxi- 
ety was associated with a reduction in ah 
speech disturbances, 

By way of summary, then, we find that 
the only significant independent effect of 
topical focus is in relation to non-ah speech 
disturbances. However, if one includes the 
significant effects of order of topical focus, 
alone and in interaction with topical focus, 
there is evidence that anxiety additionally is 
associated with a long reaction time and a 
fast articulation rate. 


Intercorrelation of Dependent Variables 


Table 7 lists the intercorrelations of the 
various verbal behavior indices. The findings 
are, by and large, consistent with the results 
obtained on the basis of the experimental 
manipulations. Thus, the absence of a signifi- 
cant correlation between ah and non-ah is 
not surprising in view of the finding that the 
former varies as a function of interviewer 
specificity and the latter as a function of 
anxiety arousal. If we look at the pattern of 
intercorrelations we find some justification 
for a concept of cautious and hesitant speech, 
which is manifested by ah speech disturb- 
ances, a slow articulation rate, and, to a les- 
ser extent, by silent pauses. Finally, we note 
a significant positive correlation between 
silent pauses and reaction time, which is con- 
sistent with Goldman-Eisler’s (1961b) posi- 
tion that these two variables represent slightly 
variant expressions of the same phenomenon. 
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Discussion 


We are now in a position to evaluate the 
effects of interviewer specificity and of an 
anxiety-arousing topic on interviewee verbal 
behavior. It will be recalled that low specific- 
ity in contrast to high specificity is associated 
with ah speech disturbances, a slow articula- 
tion rate, and silent pauses, These findings 
indicate that low-specificity interviewer re- 
marks elicit more cautious and hesitant speech 
in the interviewee than high-specificity re- 
marks, This is consistent with the general 
hypothesis that interviewer low specificity 
creates a condition of informational uncet- 
tainty. 

Turning now to the effects of the anxiety- 
arousing topic on interviewees’ verbal be- 
havior, we find, as did Kasl and Mahl (1958) 
and Mahl (1959), that anxiety is associated 
with the non-ah but not with the ah variety 
of speech disturbance. Actually there is evi- 


dence that during the high-specificity seg. 


ments of the interview anxiety reduced the 
occurrence of ah. However, the anxiety- 
arousing topic, in its independent effects, 
failed to show the predicted relationships 
with the other verbal indices, that is, silent 
pauses and articulation rate, We are inclin 
to believe that the limited impact of the 
topical variable, as a main effect, is a conse- 
quence of a weak manipulation of inter- 
viewee’s anxiety level. It will be recalled 
that our subjects were requested to rate the 
emotional and neutral segments of the inter- 
view on two 6-point bipolar scales: comfort- 
able-uncomfortable and relaxed-tense. 
though subjects indicated that they welt 
significantly more tense and uncomfortable 
during the emotional segment, their mea 
ratings were only 2.50 and 2.20, still on the 
comfortable and relaxed sides of the scales. 
These values suggest that the emotional topie 
had only a moderate anxiety-arousing effect 
We have reason to believe that a stronge 
manipulation of the anxiety variable wou 
yield more pronounced relationships betwee 
anxiety and the verbal indices which W 
used in this study. This conclusion 1$ sug 
gested by the order of topical focus 
alone and in interaction with topical ot 
on reaction time and articulation rate. f it 


ch wee 


La 


assumed that by starting the interview with 
the emotional rather than the neutral topic 
we enhance the anxiety-arousing potential of 
the emotional topic,* then these findings indi- 
cate that a stronger manipulation of anxiety 
would show the expected effect on articula- 
tion rate, By virtue of the same reasoning, 
we could expect a positive correlation between 
anxiety and reaction time. 

Tt is difficult to fit all the findings regard- 
ing the effects of anxiety on verbal behavior 
into a single conceptual framework. Nevér- 
theless, they are consistent with the position 
that anxiety has both an activating as well as 
a disruptive effect on verbal behavior. Its 
disruptive effect is clearly noted in its as- 
Sotiation with non-ah speech disturbances, Its 

activating effect is noted in its association 

with an accelerated articulation rate. The 
finding in an earlier study (Pope & Siegman, 

1965), that an anxiety-arousing topic is as- 

Sociated with greater productivity is also 
, Consistent with the concept of anxiety as an 

activator. 

We are now in a position to compare the 
effects of interviewer specificity and of an 
anxiety-arousing topic on interviewee verbal 
fluency. It can be concluded that both low 
Specificity and the focus on an anxiety-arous- 
ing topic interfere with verbal fluency, al- 
though each of these conditions produces 

"different symptoms of nonfluency, An anxiety- 
arousing topic produces disrupted speech, as 

Manifested in non-ah speech disturbances. 

Low-specificity remarks, on the other hand, 

ate associated with cautious and hesitant 

Speech, rather than with disrupted speech. 

The fact that these two conditions produce 

Tather distinct patterns of interviewee verbal 

behavior, argues against our previous conclu- 

Sion (Pope & Siegman, 1962) that the speci- 
| ficity effect is mediated by anxiety. 

Assuming that the specificity effects are 
Mediated by interviewee uncertainty, we find 
a number of discrepancies between the re- 

Sults of this study and those of previous in- 


ik A assumption is supported by the fact that 
ie who were assigned to the emotional-neutral 
ider tated themselves as significantly more “tense” 
the the emotional segment of the interview than 
e subjects who were assigned to the neutral- 
_ “motional order. 
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vestigations. Thus, Brenner et al, (1963) re- 
port a positive correlation between uncer- 
tainty and non-ah speech disturbances. It 
should be pointed out, however, that these 
investigators used passages varying in their 
degree of approximation to written English 
for generating different levels of uncertainty. 
Thus, in the high uncertainty condition sub- 
jects’ associations interfered with the actual 
words which subjects were required to read, a 
situation which is likely to produce non-ah 
speech disturbances. The results of the pres- 
ent study are also at variance with those re- 
ported by Goldman-Eisler (1961a). This in- 
vestigator found that uncertainty had no ef- 
fect on a measure of ah speech disturbances. 
It is difficult, however, to evaluate this dis- 
crepancy since Goldman-Eisler used a dif- 
ferent measure of ah than was used in the 
present study (i.e., duration of ah’s divided 
by duration of ah’s plus duration of silent 
pauses), Perhaps the discrepancy in relation 
to ah speech disturbances is a consequence 
both of the discrepant measures and of the 
different sources of uncertainty in the two 
studies, It will be recalled that Goldman- 
Eisler asked subjects to describe and to 
formulate the meaning of a series of cartoons 
on the assumption that the latter task in- 
volves a higher level of uncertainty than the 
former, We note also that the response condi- 
tions differ. In an interview situation there 
may be greater pressure on the subject to 
keep on talking than he would encounter in 
the kind of response situation set up by Gold- 
man-Eisler. Consequently, when he is uncer- 
tain about what to say during the interview, 
he does not feel free to remain silent, but 
rather feels compelled to fill in the silent 
pauses with ah’s and variations thereof. 
Finally, it should be pointed out that the evi- 
dence obtained by Maclay and Osgood (1959) 
from spontaneous speech is consistent with 
our own findings regarding uncertainty and 
ah. Thus, it would appear that the effects of 
uncertainty on verbal behavior probably do 
vary with the nature of the task generating 
the uncertainty. 

In conclusion, the findings of the present 
study carry a note of warning to those en- 
gaged in psychiatric interviewing. The prac- 
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titioners of this art have long been aware of 
the signal value of the nonlexical aspects of 
verbal behavior. Thus, Sullivan (1954) has 
noted that: “Much attention may profitably 
be paid to the telltale aspects of intonation, 
rate of speech, difficulty in enunciation . . . 
[p. 5].” As a rule, evidence of caution and 
hesitation as well as speech disruptions is 
interpreted as a sign of anxiety and emotional 
conflict. Our findings, however, suggest that 
not all forms of nonfluency are necessarily 
indicative of anxiety. 
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If a subject gives his impression of a per- 
son described by a set of personality-trait 
adjectives, one might expect that the adjec- 
tives will interact in some way. Thus, the 
contribution of one adjective to the overall 
impression might be thought to depend upon 
“the other adjectives of the set (Asch, 1946). 
A dirgct test of this idea gave it little sup- 
port (Anderson, 1962). In fact, the data were 
reasonably well accounted for by a simple 
average model: the subject acted as though 
each adjective had a definite value, inde- 
pendent of context, and gave as his response 
to any set the arithmetic mean of the values 
of the adjectives. 

The present work takes up two limitations 
of the previous report. In the first place, the 
adjective sets were previously constructed by 
fandom choice. This procedure avoids possi- 
-ble bias in set construction, but it may not be 
Sensitive to interaction present in only a rela- 
tively small proportion of the sets. In the 
Second place, the instructions previously em- 
ployed indicated that all adjectives of a set 
Should be considered as having equal impor- 
tance, With more naturalistic instructions, 
the simple average model might not hold. 

The present experiment was designed to 
explore these two questions. Sets were chosen 
in which strong interactions could be expected 
0n common-sense grounds. In addition, four 
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Ss judged the likableness of persons described by sets of 3 adjectives under 
1 of 4 instruction conditions: 1—each adjective is equally important, 2—the 
adjectives may differ in validity, 3 and 
adjective sets embodied affective and antonymic inconsistency. The results 
were interpreted as indicating that the impression response was an average 
of the scale values of the adjectives. For Condition 1, a simple average model 
worked reasonably well though not perfectly. Data for the remaining conditions 
were interpreted in terms of a weighted average formulation in which incon- 
sistent adjectives received a decreased weight. 


4—1 adjective does not apply. The 


different instruction conditions were em- 
ployed. 


THEORY AND DESIGN 


The primary focus of the experiment is on 
two types of potential inconsistencies among 
the adjectives of a set. In honest-considerate- 
gloomy, the last adjective has a low value 
and is affectively inconsistent with the first 
two adjectives which are of high value, In 
honest-deceitful-gloomy, there is not only an 
analogous affective inconsistency, but also 
a direct contradiction in meaning between 
honest and deceitful. 

This experiment takes up the study of how 
the subject handles such inconsistencies in 
forming his impression. Of course, if the 
subject obeys the simple average model, then 
the inconsistencies will not affect the response. 

Suppose, however, that such inconsistency 
does cause difficulty in forming a unified im- 
pression. One way to handle the inconsistency 
would be to discount the inconsistent adjec- 
tive. In honest-deceitful-gloomy, for instance, 
one would expect that one of the antonyms 
would be discounted; presumably the subject 
would tend to discount honest, basing his 
impression on deceitful-gloomy since these 
are both of low value. His impression would 
then be less favorable than if honest had not 
been discounted. 

Tt is on this general idea that the experi- 
ment is based. The adjective sets that were 
used are illustrated in the left part of Table 
1. Two different adjectives appear in each 
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TABLE 1 
ILLUSTRATIVE 2 X 2 X 2 ADJECTIVE-SET DESIGN, TOGETHER WITH THEORETICAL PREDICTIONS, 
AND SPECIFIC COMPARISON COEFFICIENTS FOR THE INTERACTIONS OF THE 2 X 2 X 2 DESIGN 
Prediction 
Set adjectives Set type Set AB AC BC 
1 2 

honest deceitful gloomy HLL a a’ AıB:ıCı + + + 
honest considerate gloomy HHL 2a 28 AıB:C1 = + = re 
reckless deceitful gloomy LLL 0 0 AsBiCi = — +a 
reckless | considerate gloomy HLL a a A:B:C: + S -A 
honest | deceitful aimless | HLL a a’ | ABC: | + | $ = We 
honest considerate aimless HHL 2a 2B ABa = - + + 

reckless | deceitful aimless LLL 0 0 AsBiC2 | — | + | — | ae 

reckless considerate aimless HLL a a AsB:C; + + +|- 


column, and these are combined in all possi- 
ble ways to yield eight different sets of three 
adjectives each. 

Of the adjectives in Table 1, honest and 
considerate are of high value on a likableness 
scale and are denoted by H. The remaining 
adjectives are of low value and are denoted 
by L. A set with one H and two Ls will be 
denoted by HHL, etc. A set containing an 


antonymic pair will be indicated by an over- : 


bar: honest-deceitful-gloomy is denoted by 
HLL. With this notation, the eight sets of 
Table 1 are of four types: HLL, HHL, LLL, 
and HLL, as listed in the table. 

To illustrate the theoretical predictions, 
suppose for pro tem convenience that each L 
has scale value 0, and that each H has scale 
value 34. 

In the simple average model, the predicted 
response to any set is simply the arithmetic 
mean of the scale values of the several ad- 
jectives of the set. Since all Ls have the value 
0 by assumption, the response to LLL will be 
0. The response to HLL will be (3a +0+ 
0)/3, which equals a. The response to HLL, 
which contains the antonymic pair, will also 
be a. Finally, the response to HHL will be 
(3a + 3a + 0)/3, which equals 2a. These 
predictions of the simple average model are 
given in the Prediction 1 column in Table 1. 

A second set of predictions is obtained if 
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the affective and antonymic inconsistencies 
mentioned above cause the subject to dis 
count one of the adjectives. Consider first 
the two HLL sets in which, presumably, j 
there will be a general tendency to discount 
the H. If so, the response will be based pti- 
marily on the two Ls and will be less favor 
able, therefore, than if the H had not bet 
discounted. If the response to HLL is denoted 
by a', then discounting implies that a’ is les 
than a. 

Predictions for the other three types of set 
are obtained in the same way. For HLL, tit) 
subject again presumably tends to discount 
the H since it is affectively inconsistent with 
the two Ls. Hence the response to HLL, d 
noted by a, will also be lower than the value 
a predicted by the simple average mod e 
For HHL sets, the subject presumably tendi 
to discount the L, and base his impressio 
primarily on the two Hs. His response P 
HHL, denoted by 28, will then be higher that 
the value 2a predicted by the simple averag 
model. Finally, for LLL sets, which prest 
ably contain no marked inconsistency, it f 
supposed that no discounting occurs and na 
the simple average model applies. Thus, © 
predicted response to LLL is 0. 

This set of predictions, based on the 
counting hypothesis, is summarized i 4 
Prediction 2 column in Table 1. Althoté 
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these predictions are only qualitative, no 
more is needed for present purposes. 

It is now necessary to turn to the quanti- 
tative analysis of the two formulations just 
‘set forth, This will be first given in a simple 


illustration, and then stated briefly for the 


full design of Table 1. 

To illustrate the quantitative tests, con- 
sider just the first four sets of Table 1. These 
four sêts form a 2 X 2 factorial design as 
shown in Table 2. The two adjectives in the 
first column of Table 1 are the row (A) fac- 
tor in the design, with the two levels, Ai = 
honest and“A, = reckless. Similarly, the two 
adjectives in the second column of Table 1 
dre the column (B) factor in the. design of 
Table 2, with the two levels, Bı = deceitful 
and Bo = considerate. (The third adjective, 
gloomy, is the same for all four sets and may 
be ignored for the purposes of this illustra- 
tion.) 

The cell entries in the upper part of Table 
2 are the predictions of the simple average 
model taken directly from Table 1. If the 
simple average model is correct, then the 
interaction of this 2 x 2 design is 0. To see 
this, note that the interaction may be com- 
puted algebraically as the difference between 
the two entries in the first row, minus the 
difference between the two entries in the sec- 
ond row. The former difference is 4 — 24, 
and the latter difference is 0 — a. Both dif- 
ferences equal —a, so that A X B interaction 
of the upper part of Table 2 is indeed 0. 

„The lower part of Table 2 gives the pre- 
dictions made on the hypothesis that dis- 
counting occurs. In this case, the A X B in- 
teraction is not 0. The difference between 
the entries in the first row is a’ — 28, and 
the difference between the entries in the sec- 
ond row is O—a. The AXB interaction, 
hamely, the difference between these two 
differences, is (a — 2B) — (0— a)=a tea 
— 28. As noted above, both a’ and a are less 
than a, whereas £ is greater than a. Hence the 
given expression for the A X B interaction is 


less than 0, 


AM above example illustrates how the test 
ane two formulations is reducible to the 
tdinary analysis of variance. Actually, the 


‘full design of Table 1 is a 2 xX 2 X 2 design 
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TABLE 2 
Monet Tests ILLUSTRATED FOR 2 X 2 DESIGN 


Deceitful | Considerate 


Simple average model predictions 


Honest} a 2a 
Reckless} 0 a 
Discounting hypothesis pre- 
dictions 
Honest} a’ 28 
Reckless} 0 a 


which includes a third factor, C, with two 
levels, Cı = gloomy and C = aimless. The 
test of the formulations is thus more elaborate 
than in the above example, but the principle 
remains the same. These tests may be briefly 
summarized, If the simple average model 
is correct, then all four interactions of the 
three-factor design are 0. If the discounting 
hypothesis is correct, then the A X B inter- 
action will be negative. At the same time, 
however, this formulation still implies that 
the remaining interactions will be 0. Thus 
the design has been so chosen that the dis- 
counting effect will be localized in the A X B 


interaction. 


That these significance tests are appropriate is 
shown in more detail in Table 1, Because each 
factor of the design has two levels, each interaction 
has a direct, algebraic expression as a specific com- 
parison (eg Edwards, 1960, p. 189) among the 
responses. The coefficients of the specific compari- 
sons, which are either +1 or —1, are listed in the 
rightmost four columns of the table. To evaluate 
any predicted interaction, the predicted response in 
any row is multiplied by the coefficient in that row, 
and these eight products summed, For example, the 
AXB interaction predicted by the simple average 
model is +a—2a—0+a+a—20—0+ a, which 
equals 0. In the same way, the AXC, BX C, and 
AXBXC interactions are readily shown to be 0 
under the simple average model. 

If the predictions of the Prediction 2 column are 
used in the specific comparison, then the predicted 
AXB interaction is seen to be +a'—2p—0+a 
+a’ —2p—O+a, which equals 2e+20— 4B. 
From the previous reasoning, a’ and œ are both less 
than a, whereas B is greater than a. Hence the 
given expression for the AXB interaction is less 
than 0, and so the test of the AXB interaction 
constitutes a test for the discounting effect. Appli- 
cation of the specific comparison coefficients in the 
remaining columns shows that the AXC, BXC, 
and AXBXC interactions are 0 under the dis- 
counting hypothesis. 
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One final point needs mention. The assumptions 
that each L has value 0, and that each H has value 
3a are in no way essential-to the test of the simple 
average model. Different Hs and Ls may have dif- 
ferent scale values, and these may vary from sub- 
ject to subject; the predictions of 0 interactions are 
the same. These statements will not be verified 
here but they may readily be checked by numerical 
example following the above lines of reasoning. 


Four instruction conditions were used, and 
it remains to relate these to the theoretical 
analysis. In Condition 1, the subjects were 
told that the adjectives of a set represented 
equally important aspects of the personality. 
For this condition, it was expected that the 
behavior would follow the simple average 
model, 

Condition 2 employed a more naturalistic 
set of instructions. The subjects were told, 
essentially, that the adjectives of a set might 
not be equally valid. With these instructions, 
subjects should exhibit whatever natural dis- 
counting tendencies might prevail. Accord- 
ingly, it was expected that the behavior of 
these subjects would follow the second of the 
above formulations and, in particular, exhibit 
a significant A X B interaction. 

In Conditions 3 and 4, the subjects were 
told that one of the adjectives did not ac- 
tually apply, that they should decide which 
one was inapplicable, and base their impres- 
sion on the other two. With these instruc- 
tions, the discounting should be maximal, and 
its size will serve as a reference against which 
to evaluate the effect observed in Condition 2. 


METHOD 
Procedure 


Subjects were presented sets of three adjectives 
descriptive of personality traits. For each set, they 
judged how much they thought they would like a 
person so described. Ratings were made on a 20- 
point scale, ranging from 1 (least liked) to 20 (most 
liked). 

The experimenter read the three adjectives at 
a slow steady rate. The 3 X5 card listing the 
adjectives was then given to the subject who read 
the three adjectives aloud in the reverse order. This 
double exposure procedure was used to minimize in- 
attention as well as to reduce possible effects of order 

_ of presentation. 

Presentation rates were approximately 20 seconds 
per set in Conditions 1 and 2, and 30 seconds in 
Conditions 3 and 4. Pilot work showed that sub- 
jects in Conditions 3 and 4 took longer and the 
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listed times were chosen to allow a comfortable pe. 
riod in which to do the task. 

Each subject was run individually and judged 36 | 
sets. For Condition 1 only, the 36 sets were pre- 
sented a second time after all had been judged once, 
This second presentation, as well as the larger m for 
Condition 1, were employed to get greater power in 
the test of the simple average model. $ 


Instructions 


All subjects were told that they would see several | 
sets of three adjectives, that each set described a per- | 
son, that they were to try to imagine the type Oi | 
person being described, and then to tell how much 
they thought they would like such a person. It was 
emphasized that there was no right or wrong an 
swer, and that the subject should give his own per 
sonal opinion. 

The four experimental conditions were differenti: 
ated according to the further instructions as follows: 


[Condition 1] Imagine that 3 people have each 
contributed 1 word describing the person. These 3 
people all know the person well, and each word is 
equally important in describing the person. Some- 
times, of course, the 3 words may seem inconsist- 
ent. That’s to be expected because each of the 3 
people might see a different part of the persons 
personality. However, all 3 words are accurate, 
and each word is equally important. You shou i 
pay equal attention to each of the 3. Sometimes | 
this may seem hard, but just act naturally, and 
do the best you can. 

{Condition 2] Imagine that 3 people have each 
contributed 1 word describing the person. These 3 
people all know the person well. However; these 3 
people might not all be equally good judges 0 
personality. Consequently, the 3 words might not 
be equally important aspects of the person’s per 
sonality. In order to decide what the person $ 
really like, you might have to pay more attention 
to one word than another, at least in some casts. 
Sometimes this may seem hard, but just act natu- 
rally and do the best you can. y 

[Condition 3] Imagine that 3 people have eat 
contributed 1 word describing the person. 2 iy 
them know the person well and have described 4 
person accurately. However, 1 of the words a 
not belong; it does not actually describe the eh 
son. You must decide which 2 words you 1 
are correct and base your impression on bee H, 
words alone. Sometimes this may seem hard, bi 
just act naturally and do the best you can 3 

[Condition 4] Subjects in Condition 4 wer 
given the same instructions as those in Com chal 
3. In addition, they stated aloud the word f 
they had eliminated from consideration ih each 


Stimuli 


The construction of the experimental sets, oa 
are of primary interest, is described in the nex > and 
section. Of the remaining 24 sets, 18 were fillers £ 
6 were anchors. The 18 fillers were composed of | 
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{rarily chosen adjectives that covered a wide range 
of scale values. Of the 6 anchors, 3 had adjectives 
‘all of very low scale value, and 3 had adjectives all 
of very high scale value. The same fillers and anchors 
were used for all subjects. 

The first 10 sets presented were 6 fillers and 4 
anchors, This warm-up series was designed to de- 
velop a frame of reference within which valid rela- 
tive ratings could be made. The remaining 12 fillers 
and 2 anchors were shuffled together with the ex- 
perimental sets, separately for each subject, and 
these were all presented following the 10 warm-up 
sets, 

The individual adjectives were taken from a dist 

of 3555 common personality-trait adjectives that had 
previously been rated on favorableness dimension by 
100 students. The specific adjectives used in the ex- 
perimental sets are listed below. 
+ Experimental sets. The basic 2 X 2 X 2 design has 
already been illustrated in Table 1 for one antonym 
pair, honest-deceitful. For greater generality, eight 
different antonym pairs were used, yielding eight 
2X2X2 designs, and these were included as an 
orthogonal factor in the complete experiment. The 
construction details for these 2 X 2 X 2 designs, given 
directly below, are not essential for an understanding 
of the experiment. 

The construction began with the selection of four 
triplets of adjectives: deceitful, honest, considerate; 
reckless, careful, artistic; gloomy, cheerful, apprecia- 
tive; and aimless, purposeful, respectful. In each 
triplet, the first word is an L, the second word is an 
H and also an antonym of the first word. These four 
antonym pairs were so chosen that each H was a 
direct antonym to exactly one of the four Ls. The 
third word in each triplet is also an H, chosen to 
have approximately the same scale value and “im- 
pact” as the other H in that triplet, and to not be 
a direct antonym to any of the four Ls. 

From each triplet, a 2 2X2 design was con- 
structed by using the three words of that triplet 
plus the three Ls from the remaining three triplets. 
Thus, the 2X 2X2 design of Table 1 is formed 
using the first triplet listed above, and the Ls from 
the remaining three triplets. The four triplets listed 
above thus generate four 2 2X2 designs, each 
with the same Ls but different Hs. 

An additional four 222 designs were con- 
structed in the same way from the following four 
triplets: careless, dependable, open-minded ; impolite, 
Courteous, forgiving; insecure, self-confident, mod- 
ern; and boring, amusing, generous. 
ae 2X 2X 2 design generates eight sets, as has 
a illustrated in Table 1, and these eight sets are 

Primary interest. For added balancing, however, 
an additional four sets were also used. These were 
es by interchanging the two Hs, correspond- 
a i A; and Ba in each 2X2X2 design, thus 
P ng four additional HLL sets. The data from 

se sets are of subsidiary interest and will not be 


_ Considered unless specifically noted. 
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Subjects 


The subjects were 128 volunteers fulfilling a course 
requirement in introductory psychology. Half were 
of either sex, sex being included as a factor in the 
complete experiment. 

The n’s were 64, 32, 16, and 16 for Conditions 1, 
2, 3, and 4, respectively. Half the subjects of Con- 
ditions 1 and 2, and all the subjects of Conditions 3 
and 4, were run in one replication. The second repli- 
cation, which doubled the n’s for Conditions 1 and 
2, was run to get additional precision for these two 
conditions. Replication differences were small and 
nonsignificant. Within each sex, subjects were 
assigned randomly to conditions. 


RESULTS 


Dependent Variables 


The mean response to each of the eight 
main experimental sets is given in Table 3. 
The table also lists the mean response to the 
four additional HLL sets not included in the 
basic 2 X 2 X 2 design. 

In addition, each subject was given a score 
that corresponded to each interaction of the 
2%2%X2 design. For a given subject, the 
A XB score measures the extent to which 
that subject’s data exhibit an A X B inter- 
action, The A X B score, therefore, consti- 
tutes a proper dependent variable, and is 
used to test the models in accord with the 
above theoretical rationale. 

The A x B score is computed as a specific 
comparison just as was done in obtaining the 
predicted interactions discussed in connection 
with Table 1. For example, if the responses 
of a given subject to the eight sets of Table 1 
were, in descending order, the digits 1 through 
8, the A X B score for that subject would be: 
41-2 -3 +4 +5 —6 —-7 +8, which equals 
0. In this manner, each subject received four 
scores, one for each interaction of the 2 X 2 
x 2 design. The means of these four depend- 
ent variables are listed in Table 4, and each 
was tested in a separate analysis of variance. 

Within an instruction condition, the null 
hypothesis of no net A X B interaction is 
tested by comparing the observed mean A X B 
score against 0. To the extent that the sub- 
jects do show an A X B interaction, the mean 
A XB score will tend to be significantly dif- 
ferent from 0. Similarly, if the size of the true 
A X B interaction varies over conditions, the 
between-groups difference in the A X B score 
will tend to be significant. 
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TABLE 3 


MEAN Impression RESPONSE FOR EXPERIMENTAL 
Sets as A FUNCTION OF INSTRUCTION 


CONDITION 
Instruction condition 
Type 
1 2 3 4 

AıBıCı | HLL | 7.39 6.86% | 7.44| 6.37| 6.06 
A:BiC; |HLL | 6.98 6.36 | 6.84| 5.69| 5.50 
AiB:C; |HHL | 10.88 9.81 | 11.19} 12.69 | 14.56 
AıB:C2 | HHL | 9.88 9.34 | 10.22} 13.38 | 14.75 
AgBiC; |LLL | 4.81 5.00 | 5.22} 4.69] 4.50 
AsBiC2 |LLL | 3.95 3.97 | 4.75] 4,88] 4.00 
AsB:C; |HLL | 7.34 6.97 | 7.06] 5.62] 6.38 
A:B:C3 |HLL | 6.67 6.47 | 6.31) 5.56] 7.12 

HLL*} 7.30 7.05 | 7.55) 658) 5.94 


a First presentation for Condition 1. 
b Second presentation for Condition 1. 
Four additional HLL sets; see text. 


Since this test procedure may be unfa- 
miliar, it may be noted that it is essentially 
equivalent to the appropriate repeated meas- 
urements analysis, but is computationally 
simpler. In addition, the mean A X B score 
is more immediately meaningful than would 
be the corresponding sum of squares in the 
repeated measurements analysis of variance. 


Model Tests 


The critical data for the theoretical for- 
mulations are the mean interaction scores of 
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Table 4. Those means that are significantly 
different from O are marked with an asterisk, 


It may first be noted that none of the ` 


A X C, B X C, or AX B X C means is sig 
nificantly different from 0. Although the sup- 
portive value of these particular nonsignifi- 
cant interactions is limited, they are in ac. 
cord with the theoretical predictions. 

The main test, however, concerns the 
A X B interactions. For Conditions 3 and 4, 
the A X B means are large and significant, as 
was expected. These data reflect the discount- 
ing effect induced by the instructions that one 
of the adjectives did not apply. 

For Condition 2, which used the natural- 
istic instructions, the A x B interaction is 
also significantly different from 0. These sub- 
jects also show a discounting effect. Its size, 
however, is somewhat less than a third of the 
maximum effect as obtained under Conditions 
3 and 4. 

For Condition 1, the A x B mean for the 


first presentation is nonsignificant. However, | 


the value of —1.46 for the second presenta- 
tion is significant, F = 4.14, df = 1/55. Ab 
though the situation is thus not entirely cleat, 
the best conclusion would seem to be that 
there is a real, though small, A X B effect for 
Condition 1. Its magnitude is about a tenth 
of the maximal effect as obtained in Condi- 
tions 3 and 4. 


Set Type 


The A X B score is itself a composite of disco 
ing effects from the various types of sets, and m 


TABLE 4 


MEAN INTERACTION SCORES AND STANDARD ERRORS 
FOR THE Four Instruction CONDITIONS 


Instruction condition 

1 4 
(n = 64) (n 2 32) (n 216) (nae 
Tp feras Hon 75 

XB =1.128 — 1.460 3 —12.38* —12. 
AXC =0.12 =0:56 acy Og 0.62 
teh xc Soe ee S|: te | ae 
Standard error ate poy Thid Sa a 

XB 0.74 0.72 2 
AXGBXCAXBXC| 077 0.61 109 135 15 


à First presentation for Condition 1, 
» Second presentation for Condition 1, 
* Significantly different from 0. 


detailed information on the origin of the A X B in- 


An ad hoc test was made by summing the re- 
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teraction requires closer study of Table 3. Since rela- 
tively little discounting apparently occurred in Con- 
dition 1, it will be compared with Conditions 3 and 
4 combined: the difference in response will then be 
a measure of the discounting effect induced by the 
instructions in Conditions 3 and 4. 

For HLL sets, Table 3 yields a mean of 7,00 for 
Condition 1, and a mean of 6.17 for Conditions 3 
and 4. The discounting effect is thus .83. The corre- 
sponding difference for the four additional HLL sets 
was 1.04, It should be noted that these values are 
not much smaller than the discounting effect of 
1.28 for the HLL antonym sets. 3 

In contrast, the discounting effect for HHL sets 
appears fairly large, The Condition 1 mean is 10.38, 
compared to 13.84 for Conditions 3 and 4, yielding 
a discounting effect of 3.46. 


sponses to the two HLL sets and the two HHL sets. 
Since the discounting effect goes in opposite direc- 
tions for these two types of set, comparison of Con- 
dition 1 with Conditions 3 and 4 on this sum score 
actually tests whether the magnitude of the dis- 
counting effect is different for the two set types. In 
this analysis, the condition effect was significant, 
F=4,53, df= 1/85, indicating that the discounting 
was greater for HHL than for HLL. The same test 
comparing HHL and HLL was also significant, F = 
5.99, df= 1/85. 

ThfS result was somewhat surprising since it had 
been expected that discounting would occur mainly 
because of the antonymic inconsistency in the HLL 
Sets. However, although the data illustrate a useful 
method of analysis, the result must be considered 
tentative since the test was ad hoc, and since the 
experiment was not designed to test this particular 
question and may not have been completely ade- 
quate to do so. 


Eliminations 


The tabulations of the adjectives actually elimi- 
hated by the subjects in Condition 4 are in general 
agreement with the results just given. In the HLL 
Sets, an antonym was always eliminated, but in only 
22 of the 32 instances was this antonym the H. In 
the HHL and HLL sets, the eliminations largely re- 
flected the formal affective inconsistency. The L was 
eliminated from HHL in 28 of the 32 instances; the 
H was eliminated from HLL in 24 of the 32 in- 
Stances, The four additional HLL sets provide an 
Additional 64 instances and the H was eliminated in 
46 of these. 

With simultaneous presentation, HLL and LLH 
fhe equivalent set types. On a purely formal basis, 
t crefore, HHL and HLL are symmetric and have 
Te same formal affective inconsistency. However, 

€ above elimination data indicated a greater tend- 


PNY to discount the odd adjective in HHL than in 


ws and this difference was significant by ad hoc 
est, P= 4.62, df = 1/18. 
e elimination data thus suggest that HH may 


Sonstitute a stronger or more natural unit than does 
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LL, and the above data on discounting effects for 
HHL and HLL agree with this idea, Of course, other 
explanations, such as response bias toward favorable 
evaluations, cannot be ruled out with the present 
data. 


Statistical Analysis 


The complete analysis of the interaction scores re- 
quires some further detail. For each interaction score, 
the main effects of Condition, Sex, 2 X 2 X 2 Design, 
and Condition X Sex interaction were evaluated, The 
remaining interactions were pooled with error by 
prior decision. No effects were significant in any of 
these analyses except, of course, the Condition effect 
for the AX B scores which yielded F=17.38, df= 
3/113. 

The error mean squares for AX C, BXC, and 
AXBXC were roughly equal, as would be ex- 
pected in the absence of a real effect. These error 
mean squares were accordingly pooled and reduced 
to standard errors as listed in Table 4. The corre- 
sponding standard error for the second presentation 
of Condition 1 was computed separately. For the 
AXB means, the listed standard errors were com- 
puted separately for the first, and for the second, 
presentations of Condition 1, for Condition 2, and 
for Conditions 3 and 4 combined. 

The interactions means of Table 4 had also to be 
tested against 0. The statement that none of the 
AXC, BX C, or AXBXC means was significant 
rests on the nonsignificance of the two sources, Mean 
and Conditions, in the overall analyses. (The F for 
Mean tests the null hypothesis that the grand mean 
over conditions is 0; e.g., Anderson, 1960.) Each 
AXB mean was tested against 0 separately. 


Discussion 


The personality impression task may be 
considered as one of integrating information. 
Thus, the subject is given several discrete bits 
of information which he is to combine or inte- 
grate into a single impression, A central theo- 
retical problem is how this integration takes 
place. 

The present results are consistent with the 
idea that the integration is an averaging proc- 
ess: that each stimulus has a value to the 
subject, and that he arrives at his impression 
by averaging these values. 

For Condition 1, it had been expected that 
the simple average model would hold. This 
model implies that all interactions of the 2 x 
2 x 2 design are 0, and the data are for the 
most part consistent with this implication. 
Nevertheless, it is true that the A X B inter- 


‘action for the second presentation was signifi- 


cant, and the best conclusion would seem to 
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be that there was a small but real discount- 
ing effect. 

In the present experiment, Conditions 3 
and 4 provide a useful standard of compari- 
son for Condition 1. The instructions to elimi- 
nate one adjective presumably yield the maxi- 
mum discounting effect available in the given 
circumstances. Against this reference, the A 
X B interaction in Condition 1 may reason- 
ably be considered small since it is only about 
a tenth the size of that in Conditions 3 and 
4. It would appear, therefore, that the simple 
average model gives a good account of the 
data of Condition 1, This conclusion agrees 
with that of previous work with a single sub- 
ject design (Anderson, 1962) in which the 
simple average model performed quite well 
though not perfectly. In the present experi- 
ment, moreover, it is not unreasonable to sug- 
gest that what discrepancy was observed re- 
flects a failure of the instructions to eliminate 
discounting completely, and that even better 
agreement between model and data would 
be expected with less marked inconsistencies 
among the stimuli. 

It might be argued that even had Condi- 
tion 1 obeyed the simple average model ex- 
actly, this would show no more than that the 
subjects were able to follow the instructions 
that each adjective should be given equal im- 
portance. These instructions were of course 
used because equal weighting is a necessary 
condition for applicability of the simple av- 
erage model (although this condition can be 
somewhat relaxed by appropriate experimen- 
tal design). Nevertheless, equal weighting is 
not a sufficient condition for the simple aver- 
age model, and the instructions imposed no 
further restriction on the information integra- 
tion process than that the weighting should 
be equal. Thus, if the meanings of the adjec- 
tives varied with context, or if their values 
were subject to assimilation or contrast, the 
simple average model would not apply. 

For Conditions 3 and 4, of course, the 
simple average model is clearly inadequate 
since the A X B interaction was large. For 
these conditions, it is necessary to employ a 
weighted average formulation. 

In the weighted average model, the re- 
sponse is an average of the scale values of 
the stimuli, but different stimuli may receive 
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different weights. The theoretical expression 
for the response is Sw;5;, where wp and s, 
are the weight and scale value associated with 
the Ath stimulus of the set, and Sw; = 1, For 
sets with just two stimuli, the cited expression 
would reduce to ws; + (1 — w)sz. The sim 
ple average model is just the special case in | 
which the w+ are all equal. i| 

Now discounting may be interpreted simply | 
as the assignment of a lesser weight to the 
discounted adjective. For Conditions 3 and | 
4,*the instructions that one adjective was 
inapplicable amount to assigning that ad- 
jective 0 weight in the impression. If the 
two remaining adjectives received equal 
weight, then the weights for a given set would 
be 0, 4, and 4. The experiment was sO de- 
signed that the effects of such weighting | 
would be localized in the A X B interaction, 
and the significance of the interaction is thus 
consistent with the weighted average formula- 
tion. The nonsignificance of the remaining in- 
teractions gives at least some further support | 
to the averaging formulation. “| 

In the naturalistic instructions of Condi: 
tion 2, the subjects were told that the adjec: | 
tives might not be equally valid, and their 
A X B interaction was about a third the size 
of that in Conditions 3 and 4. Presumably, 
therefore, where the subjects of Conditions 
and 4 completely discounted one adjective, 
the subjects of Condition 2 did so only pad 
tially. In the weighted average formulation, 
such partial discounting would amount to asi 
signing a weight greater than 0 but less than 
4. It should be noted that this interpretation 
of Condition 2 would be considerably less 
if the other instruction conditions had ™ 
been run. 

Tt is thus seen that the present data ate 
consistent with the averaging formulation. 
Moreover, related work on effects of ordet > 
presentation, obtained when the adiit t 
are given serially rather than simultaneous 
has also supported an averaging formulati | 
(Anderson, 1965). The data are not âs m 
conclusive, but this formulation does Si" 
considerable promise. i 

For both simultaneous and serial preset a 
tion, it is necessary to allow for the ae 
bility of differential weighting of the i h 
and this possibility requires additional © À 


-_— | 


“ment. In general, three main sources of dif- 
ferential weighting may be distinguished. 
First, unequal weighting may be imposed ex- 
perimentally. For instance, the subject could 
“be told that a specified piece of information 
had high or low relevance. Second, the infor- 
“mation items might naturally differ in weight. 
‘Jn the present task, for example, a physical 
“trait such as tall would presumably carry less 
“weight than the personality-trait adjectives. 
For the adjectives themselves, the work of 
Podell and Podell (1963) suggests that ex- 
“treme adjectives may carry greater weight 
“than adjectives of intermediate value. Third, 
and finally, the weights could vary as a func- 
tion of context. 

In the present situation, it is evident that 
the weights were largely determined by the 
context. In Condition 4, at least, the subject 
presumably eliminated that adjective that fit 
least well with the other two, and no doubt 
similar weighting decisions occurred in the 
«Other conditions as well. Such weighting de- 
cisions aré part of the total impression forma- 
tion process and it is clear that, even though 
the averaging formulation shows promise in 
accounting for the impression response data 

"per se, it cannot completely account for this 
t form of differential weighting. In this respect, 
it would appear that the adjectives do inter- 
act in forming the impression, a conclusion in 
harmony with the views of Asch (1946). The 


on the nature and extent of such interaction 
processes, but further study of the adjective 
elimination task itself may prove fruitful. 
Finally, the present results bear on the in- 
terpretation of the primacy, or first impres- 
‘lon effect obtained when the adjectives are 
aha serially (e.g., Anderson, 1965). Re- 
ited work has also found, however, that if 
the subject is forced to pay closer attention 
to all the adjectives, this primacy effect is 
eliminated (Anderson & Hubert, 1963; Stew- 
art, 1965). These results Have been taken to 
Mean that the primacy effect ordinarily ob- 
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present results give only limited information - 
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tained is due to decreased attention to, and 
hence decreased weights for, the later adjec- 
tives. 

The present data, especially for Condition 
2, suggest the possibility that the affective in- 
consistency between the earlier and later ad- 
jectives may cause the subject to discount 
the later adjectives. Thus, a primacy effect 
would be obtained if the subject discounted 
the terminal Ls in the sequence HHHLLL, 
and the terminal Hs in the reverse sequence 
LLLHHH, If this suggestion is correct, the 
attention decrement explanation of the cited 
results would be appropriate at a gross level 
only. Equalization of attention to the adjec- 
tives may simply have reduced the effect of 
the affective inconsistency between the first 
and second halves of the sequence. To the ex- 
tent that this is true, attention would act by 
modifying the inconsistency resolution process 
rather than by directly affecting the weights 
of the later adjectives. 
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SOME DETERMINANTS OF SOCIAL DISTANCE AMONG 
AMERICAN, GERMAN, AND JAPANESE STUDENTS* \ 
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Scales of social distance were independently standardized in Germany, Illinois, 
and Japan. Approximately 100 Ss, from each culture, responded to a question- 
naire consisting of complex stimulus persons (varying simultaneously in race, 
occupation, religion, and nationality) and social-distance statements. Race, 
occupation, religion, and nationality—in that order—are the important de- 
terminants of social distance in Illinois; occupation, religion, race, and na- 
tionality—in that order—in Germany; occupation, race, and nationality in 
Japan. The social-distance scores were not related to any response style 
characteristics of the Ss, but Ss of differing personality types show differing 
degrees of social distance. The F Scale, corrected for acquiescence response set, 
was moderately correlated to some of the social-distance scores in all 3 cultures. 


Since Bogardus’ (1928) early use of the 
notion of social distance and his employment 
of a social-distance scale, much social psycho- 
logical research has focused on this topic. A 
recent increase of interest is reflected in pa- 
pers by Banton (1960), Triandis and Triandis 
(1960, 1962), Triandis (1961), Vaughan 
(1962), and others. 

Broadly speaking, there are three sources 
of variance that determine the social distance 
that a person experiences towards another 
person. These are the culture of the perceiver, 
his personality, and the characteristics of the 
perceived person. These sources of variance 
often operate as interactions. That is, in cer- 
tain cultures, certain characteristics of per- 
sons are considered particularly objectionable, 
and persons with certain kinds of personali- 
ties are likely to experience particularly large 
amounts of social distance towards persons 
having these characteristics, 


1 This study was supported by Grant M-6151(A) 
from the National Institute of Mental Health, United 
States Public Health Service. In all phases Triandis 
was assisted by Eleanor R. Hall. The German data 
were collected by Davis with the assistance of J. 
Hamm. The Japanese data were collected by Take- 
zawa, with the assistance of Keiko Ishikawa. The 
data from all three cultures were analyzed by D. 
Yeidel, Josephine Naidoo, M. Johnson, and M. Edel- 
man, The authors are grateful to Martin Fishbein, 
Ivan Steiner, and Leigh Triandis for critical com- 
ments on an earlier version of the manuscript. 

2 Now at the University of Illinois. 
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In order to understand the phenomena a 
sociated with social distance, it is desirable) 
to obtain systematic evidence from subjects 
of different cultures, having different perso 
alities, who are confronted with stimulus pe 
sons having a variety of potentially objection 
able characteristics. Uniformities in such data 
may suggest general laws that account fo 
the majority of social-distance phenomena, 7 

Systematic accumulation of the type af 
data described above requires a standardize 
methodology that may be employed in ig 
ferent cultures with some degree of oi 
rability. Triandis and Triandis (1960, 1962) 
developed such a methodology and employ 
it in a comparison of data obtained from 
Greek and American students. The presei 
paper extends this work to German and Jap? 
nese students. ‘el 

Triandis and Triandis (1960, 1962) f° 
a number of propositions concerning te 
terminants of social distance. Briefly, 
argued that social distance depends 0n f 
characteristics employed by a cultural ie 
in the definition of what persons consti í 
the ingroup and what persons constitute T 
outgroup; the norms of behaviors that E. 
propriate towards members of variou i | 
groups; the personal insecurity experié if 
by an individual towards persons who ate 2 
like himself. Social distance was seen mite 
termined by a complex function invo 
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cultural norms and personality. Their data 
suggested that differences in cultural norms 
should be given a much larger weight than 
personality differences in the specification of 
this function. 

The present paper examines whether these 
propositions apply to data obtained from Ger- 
man and Japanese subjects. 


METHOD 


The study required three steps: first, a 
standardization of the social-distance scale 
and basic questionnaire to be used in each 
culture; second, the administration of the 
“questionnaire developed in Step 1 to a new 
sample of members of each culture; third, 
the translation of the German and Japanese 
questionnaires into English and the adminis- 
tration of these questionnaires to American 
subjects. Since the original (Triandis & Tri- 
andis, 1960) work was done with American 
subjects, Step 3 permitted comparisons to 
preyious work. 


Samples Employed and Characteristics of the 
Subjects 


Step 1, the standardization, employed 95 senior 
high-school students from Cologne, Germany, and 
113 Japanese university juniors in the spring of 
1962, The American standardization was done by 
Triandis and Triandis (1960). Step 2, the main Ger- 
man and Japanese studies, employed 100 male senior 
high-school students from Hannover, Germany, and 
67 men and 33 women university students from 
Tokyo, Japan. The data were collected in the fall 
of 1962. Step 3, the American replication with ma- 
‘terials corresponding to the German and Japanese 
studies, was done on one sample of 100 University 
of Illinois males and another sample of 97 Tllinois 
males, for the German and Japanese materials, re- 
spectively, The sample corresponding to the German 
study was obtained in the spring of 1963. The one 
corresponding to the Japanese sample, in the fall of 
1963. The first sample was given some of the Japa- 
nese and German stimuli, in order to study the 
Parallel form reliability of our indices. Some sub- 
jects who obviously misunderstood the instructions 
ere dropped from the analysis. Thus, the main 
analysis is based on 88 Germans and 93 correspond- 
ing Americans, as well as 100 Japanese and 90 corre- 
ere Americans. An attempt was made to select 

€ subjects so that they were equivalent in socio- 
ee and other background variables, but prob- 
ae of availability of subjects necessitated some 

+ deviations from this ideal. Thus, the German sub- 
Jects had a mean age of 18 years, the Japanese a 
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mean age of 21, and the Americans, of 19.5.3 The 
occupation and education of the parents, as reported 
by the subjects, would place the Germans some- 
where between lower and upper middle class, the 
Japanese somewhat higher, and the Americans at 
about the same position as the Germans. 

The religious preferences of the subjects in the 
various samples were markedly different. The sam- 
ples of 93 and 90 Americans included, respectively, 
56 and 45 Protestants, 13 and 19 Roman Catholics, 
12 and 16 Jewish subjects, and 12 and 10 subjects 
indicating they had no religious affiliation. The Ger- 
man sample consisted of 74 Protestants, 8 Catholics, 
and 6 nonreligious subjects. The Japanese sample 
consisted of 17 Protestants, 2 Roman Catholics, 14 
Buddhists, 65 nonreligious, and 2 “other.” Thus, 
while the majority of the American and German 
samples were Protestants, the majority of the Japa- 
nese sample were nonreligious. 


Standardization of Social-Distance Scales 


The American scale was standardized by means of 
Thurstone’s successive intervals procedure (Edwards, 
1957b). Davis and Takezawa translated the 15 items 
of the American scale into German and Japanese, 
respectively, and added attitude statements which 
they considered particularly relevant in their respec- 
tive cultures. The statements were then presented to 
95 German and 113 Japanese subjects who judged 
the degree of social distance implied by agreement 
with each of the statements. These judgments were 
scaled by the Thurstone successive intervals method. 
The statements with unimodal distributions and 
small interquartile ranges, which were located at ap- 
proximately equal intervals, were then employed in 
the German and Japanese social-distance scales. The 
scale values obtained are shown in Table A‘ In ad- 
dition, the subjects provided rankings of a number 
of occupations (in terms of prestige) and a number 
of nationalities, religions, and races (in terms of 
liking). The rankings are shown in Table B (see 
Footnote 4). 


The Questionnaire 


The main questionnaire presented stimulus persons 
varying simultaneously in race, religion, occupation, 
and nationality. Each page of the questionnaire had 
one such stimulus at the top, and 15 or 17 state- 
ments of social distance under it. The subjects were 
asked to check whether they would or would not 


3 These age differences are not important. We cor- 
related the main scores of the study with the sub- 
ject’s age and found no significant trends in any of 
the samples. 

4Tables A, B, C, D, and E have been deposited 
with the American Documentation Institute. Order 
Document No. 8500 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress, Washington, D. C. 20540. Remit in advance 
$1.25 for microfilm or $1.25 for photocopies and 
make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 
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undertake the particular behaviors described by the 
social-distance statements. For instance, a stimulus 
person might be a “Portuguese, Mongoloid, Physi- 
cian, Roman Catholic” (the Portuguese colony of 
Cacao makes this a credible stimulus), and the sub- 
ject might be asked to indicate if he would accept 
such a person as an intimate friend, exclude this 
person from the neighborhood, invite this person to 
stay at his house, etc. 

Each stimulus consisted of elements that were 
either high or low in value on the characteristics 
race, occupation, religion, and nationality. The ele- 
ments were arranged so that they formed a factorial 
design. For the German subjects the characteristics 
Negro-White, Physician-Unskilled Laborer, Protes- 
tant-Catholic-Jewish, and German-American-Italian 
were used. This constituted a 2 X 2X 3 X 3 factorial 
design. For the Japanese subjects the character- 
istics White-Mongoloid-Negro, Physician-Unskilled 
Laborer, Protestant-Catholic-Jewish, and Japanese- 
American-Portuguese were used. This constituted a 
3X2X3X3 design. The pretest information about 
the rankings of such characteristics, in the various 
cultures, was used in the determination of the stimu- 
lus persons, Since the number of stimulus persons for 
a complete design was rather large (36 for the Ger- 
man, 54 for the Japanese), and the number of judg- 
ments required of the subjects too large (it is equal 
to the number of stimuli times the number of state- 
ments of social distance), the complete designs were 
not employed in either culture. Davis felt that the 
use of Negro Germans as stimuli would give mislead- 
ing results, because such persons are most frequently 
illegitimate. He also felt Negro Jews would produce 
confusion, Thus, we dropped these stimulus persons 
from the German design, Takezawa felt that religion 
would be of little importance for the J apanese sub- 
jects. The majority of the Japanese subjects were 
nonreligious. Thus, the majority of the Jewish stimu- 
lus persons were dropped from the Japanese design. 
The American subjects answered questionnaires hav- 
ing exactly the same stimuli as those used in the 
German or Japanese designs, depending on which 
study was being replicated by the American subjects. 

In addition to the social-distance questionnaires, 
the subjects in each culture were asked to respond 
to questions concerning their age, sex, social class, 
religion, and other demographic characteristics, and 
to some personality inventories. All subjects were 
given the 10 positive and 10 reverse F Scale items 
previously employed by Triandis and Triandis, trans- 
lated into the appropriate language. In addition, 


5 Table C (see Footnote 4) shows the correlations 
between the F+ and F— items in the three cultures. 
They range from —.35 to —45 and are significant at 
$ < .001. The F+ items correlate about .60 with 
acquiescence response set and about .80 with the 
authoritarian content score corrected for response 
set (Messick, 1961), in all three cultures, The acquies- 
cence and authoritarian-content-corrected-for-acqui- 
escence scores correlate .22 (p< 05), 32 (p< 01), 
and .37 (p< .001) in Germany, Japan, and Illinois, 
respectively. 
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the subjects in the various samples were given the 
instruments shown below. ' 

The Germans were given 13 items plus 4 file Ų 
items from the as yet unpublished Frankfurt Au- | 
thoritarianism scale developed by Adorno and asso- 
ciates, the Brengelmann and Brengelmann (1960a) } 
14-item Dogmatism and 14-item Intolerance for Am- 
biguity scales, and the Brengelmann and Brengel- 
mann (1960b) 16-item Extraversion, 20-item Neu- 
rotic Tendency, and 20-item Neurotic Rigidity scales, | 
All these instruments had been validated with Ger 
man subjects prior to their use in the present study, 

The Japanese subjects, in addition to the positive | 
and-negative F Scale items, were asked to: respond 
to questions concerning the way their parents 
brought them up; rate their home atmosphere on Il _} 
semantic-differential scales such as warm-cold, demo- | 
cratic-authoritarian, etc.; indicate the frequency and | 
type of punishment administered by their parents; | 
and identify the most important socializing agent 
(father, mother, etc.) in the history of their up- 
bringing. Triandis and Triandis (1962) had found 
that these items discriminated among high and low 
social-distance subjects in both American and Greek 
student samples. The items were administered in 
Japan for exploratory purposes. 

The American subjects responded to a battery of | 
attitude and personality measures. Four sets of | 
sociopolitical items that were factor analyzed by 
Davis, Wrigley, and Castelein (1960) © were ad: ii 


®Ten of these were designed to measure what | 
Davis et al. identified as the Religious Conformity 
factor. The subject obtained a high score om bs) 
scale by agreeing with items such as “The universe 
was created by God” and “Only by going nae i 
religion can civilization hope to survive,” and by x k 
agreeing with items such as “The idea of God Ke 
invention of the human mind” and “Free love i 
tween men and women should be encouraged sant 
means towards mental and physical health.” Ano! ol d 
10 items represented the Davis et al. Tolerance 4 
Minorities and Deviants factor. The subject obt { 
a high score on this scale by agreeing with ee 
such as “Jews are as valuable citizens as any rill 
group” and “All forms of discrimination aoe i 
colored races, the Jews, etc. should be made i ee 
and subject to heavy penalties,” and by diss 4 
with items such as “It would be best to keep a at 
people in their own districts and schools, in orde 
prevent too much contact with whites,” and 
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4 i ore 
tions scale. The subject who obtained a Ao 
on this scale agreed with items such as hpi 4 


private property should be abolished and old 
socialism introduced” and “Sunday observance Ha a 
fashioned and should cease to govern our beha yin 
and disagreed with items such as “In taking ee a 
any form of world organization this country sl and 
make certain that none of its independence it” 
power is lost” and “People should realize’ thal 84 
greatest obligation is to their family.” Fi nding 
items represented the Sympathetic Understa 
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ministered to the subjects. In addition, the Ameri- 
can subjects were administered the Budner (1962) 
Intolerance for Ambiguity scale. 

All questions administered to the Japanese sub- 
jects were also administered to the American sub- 
jects. Furthermore, in order to study the relation- 
ship between social-distance scores and the stylistic 
behavior of the subjects, a questionnaire developed 
by Wiggins was employed with one of the American 
samples.” 


Scoring 

The German data and the corresponding American 
data were scored following one procedure, while*the 
Japanese data and the corresponding American data 
were scored using a slightly modified procedure.® 
This scoting procedure resulted in a number between 
1 and 15 for the German, and a number between 0 
‘und 11 for the Japanese data, for each of the stimuli 
judged by the subjects. Since the German subjects 
(and, of course, the corresponding Americans) judged 
26 stimuli and the Japanese subjects (and the corre- 
sponding Americans) 36 stimuli, for each subject 
there was information about the way he judged 26 
or 36 stimulus persons. 

In order to study the style of the subjects in an- 
swering the social-distance questionnaires, we com- 
puted the following additional scores. A score %, 
which is the average social distance experienced to- 
wardeall the stimuli he responded to, and a score Y, 
which is the variance of the social-distance scores ob- 
tained from a given subject. The x score must be 
considered as a measure of the “aloofness” of the 
subject. Having a high x score would indicate re- 
jecting people in general, no matter what their char- 
acteristics, The y score is a measure of perceptual 
discrimination, or “prejudice,” since it indicates a 
high differentiation between “preferred” and “non- 
preferred” stimulus persons. 


factor, The subject obtains a high score on this scale 
by agreeing with “Our treatment of criminals is too 
harsh; we should try to cure them, not punish them” 
and “A white lie is often a good thing” and by dis- 
dgreeing with items such as “Homosexuals are hardly 
better than criminals and ought to be severely pun- 
ished” and “Birth control, except when recommended 
by a doctor, should be made illegal.” 
pets questionnaire included at least 10 items 
ae each of the following scales: Wiggins (1959) 
Berl Desirability scale, which measures the subjects’ 
endency to lie in MMPI item responses; the Ed- 
wards (1957a) Social Desirability scale; the Asch 
pe) Speed of Decision Test; the Crowne and 
pare (1960) Social Desirability scale; the Wig- 
M 3 (1962) Acquiescence Scale; the Jackson and 
TE ( 1957) Reversed Fascism Scale; the Couch 
Keniston (1960) Agreement Response Set; the 
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(1950) Fascism Scale; and the Barron (1953) Inde- 
Pendence of Judgment Scale. 
The details of the scoring procedure have been 


« deposited as Table E with the American Documenta- 


tion Institute (see Footnote 4). 
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The parallel form reliability of these two scores, 
for the American subjects who responded to both 
the German and Japanese replications (V = 93), was 
77 for x and .73 for y. Considering that the German 
and Japanese stimuli were not the same, these reli- 
abilities are very high. 

In addition to the x and y scores described above, 
we computed for each subject score Dre and Dra. To 
compute Dye, a score measuring social distance due 
to differences in religion, we summed the social-dis- 
tance responses towards stimulus persons with the 
least and with the most preferred religion, and ob- 


tained the difference. The higher this index, the 


greater the social distance towards the least preferred 
stimulus persons. The Dra score measures distance 
due to differences in race and was computed by sub- 
tracting the sum of the social distance toward the 
most preferred race from the sum of the social dis- 
tance toward the least preferred race. For the 93 
American subjects these two scores were correlated 
49 (p<.0001). To obtain an estimate of the reli- 
ability of these scores, we computed the Dra scores 
for the stimuli employed in the German and Japa- 
nese replications from the data obtained from the 
American subjects. This parallel form reliability was 
85. 


RESULTS 
Reliability 
Test-retest reliabilities were obtained by 
repeating the same stimuli in different parts 
of the questionnaire and arranging the ele- 
ments in different orders. The reliability of 


the scores employed in the present paper was 
found to exceed .90 in all three cultures. 


Possible Stylistic Determination of the Social- 
Distance Scores 


Since the results obtained in the measure- 
ment of social distance may be due to the 
checking style of the subjects, it is desirable 
to know what characteristics of this checking 
style relate to our various scores. The results 
of the analyses of variance depend on scores 
similar to the Dre and Dra scores. To check 
the stylistic determination of these scores we 
computed, for the American sample, the cor- 
relations between these scores and the Wig- 
gins inventory of Stylistic scales. The results 
will be reported for the Japanese and the Ger- 
man stimuli separately. Since the Japanese 
stimuli did not permit large differentiations 
due to religion, the Dre score for this set of 
stimuli was not computed. Thus, we have only 
the D,. and Dra scores from the German and 
the Da scores from the Japanese stimuli. 
Table 1 shows the relevant correlations. Most 
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TABLE 1 
CORRELATIONS BETWEEN THE Dye AND Dra SCORES AND OTHER VARIABLES: 
AMERICAN SAMPLE OF 93 SUBJECTS 
German stimuli 
Variable 
Dre Dra Dn 
Stylistic variables fg 
‘Triandis Acquiescence Response score .13 E 
Couch and Keniston Agreement Response Set .25* a 
Wiggins Acquiescence Scale 17 T 
Wiggins Social Desirability scale .18 E 
Edwards Social Desirability scale .08 a 
Asch Speed of Decision Test J; —.16 T 
Crowne and Marlow Social Desirability scale .07 Hi 
Barron’s Independence of Judgment scale —.24* Fi 
Budner’s Tolerance for Ambiguity —.12 =a 


Attitudinal variables 
Wiggins items of Reversed Fascism Scale 
Wiggins items of Fascism Scale 
Triandis items of Reversed Fascism Scale 
Triandis items of Fascism Scale 


Davis et al. Tolerance for Minorities and Deviants 
Davis et al. Criticism of Social Institutions 

Davis et al. Religious Conformity 

Davis et al. Sympathetic Understanding 


*p <.05. 
*k p <01. 
wkp < 001, 


of the correlations with stylistic variables are 
small and nonsignificant, while the correla- 
tions with the attitudinal variables are sub- 
stantial and significant. Nevertheless, there is 
a tendency for highly “prejudiced” subjects 
to show a high mean agreement response set 
and a low independence of judgment. These 
results suggest that stylistic factors have a 
relatively small influence on the D,. and D,a 
scores. 

Another check involved the relationship be- 
tween the x (subject’s aloofness) and y (sub- 
ject’s prejudice) scores and the stylistic vari- 
ables. The pattern of correlations of this 
analysis was very similar to that of the previ- 
ous analysis, and will not be reported in the 
present paper. 


Relative Importance of Race, Occupation, Re- 
ligion, and Nationality in the Three Cultures 


Since the German and Japanese studies em- 
ployed different stimuli, and somewhat dif- 
ferent scoring procedures, the appropriate 
comparisons are only those between these 
two cultures and the corresponding American 
“replications.” The two American studies em- 


d —.05 
“30% 21* 
—.10 —.14 
. 27 21% 
Authoritarian content corrected response set (Triandis items) È 33k 25% 
é — 70% — 5 
. — ADH — 34H 
. .04 06 j 
D —.22* —.25* 
y 
ployed stimuli and scoring procedures identi 


cal to those used with the German and Japa 
nese subjects, respectively. Differences þe- 
tween the two American sets of results may be 
used to assess the influence of the choice 0 
stimuli, and scoring procedures, on the ob- 
tained results. 

It is clear that the analysis of variance Te 
sults may not be interpreted without refer 
ence to the “level” of the “treatments.” Fo 
instance, the German study employed Whith 


Negro as the treatments for the race va 


riable, 
White 


while the Japanese study employed be 
Negro-Mongoloid. This difference must P 
reflected in the results. i 

Table 2 presents the results of the analyst 
of variance. The analyses are based on 
judgments of subjects who are homogeneo 
on religion. 


: 

us 

jghts 

First, the American-Protestant-Male weigh : 


(percentage of variances) are fairly a 


in spite of the use of different America this 
jects and different stimulus persons 1 i 
comparison. Race accounts for about a 
the variance of the social-distance judgm® the 
occupation about 29%. The difference 1 


SUMMARY OF ANALYSES OF VARIANCE: PERCENTAGE OF 


TABLE 2 


VARIANCE OF SOCIAL DISTANCE ACCOUNTED FOR BY CERTAIN VARIABLES 


American male 


German stimulus persons 


American male 


Japanese stimulus persons 


. source rye a) Male Male Male Protestants | Nonreligious | Nonreligious Buddhist Protestant 
Protestants Catholics nonreligious females males males males, 
(N= 74) N = 8) N = 6) N = 28) wN = 37 (N= 14) w = 17) 
Race (An 59,8 6.10 3.59* 3.30" 57.0% 20.3 37.60% 36.0%" 43.0 
Occupation (B) 22,3 69.94% 82.20" 71.8" 35.5% 72.5 50.5 53.16 47,50 
Religion (C) 5.7 12.4% 6.1 7.0% 1.38 ae = S 1.4* 
Nationality (D) 1,0%% 2.2* 1.26 6.7 4. 1.2 3.20 3.8% 4.6%" 
AXB kiad 3.248 = = 3.30 2.9% = 
AXC 
AXxD si 7% st 1.08 = = 
BXC 4.5* 3.00 2.78% 
BXD = 16 2.50 11 an 
cxD 
Range of average scores 1,23-10.01 1.96-7.15 1.76-10.50 -11-4.62 -29-6.29 +72-6.53 42-6.83 .67-7,78 
Maximum possiblerange| _1.00-15.00 1,00-15.00 1,00-15.00 .00-11.00 0-11.00 .00-11.00 .00-11.00 00-11,00 
Preferred stimulus American, American or | German, 7 American, Mongoloid, Japanese, Japanese, Japanese, 
Physician, German, Roman Catholic, Protestant, Physician, Mongoloid, Mongoloid, | Mongoloid, 
Protestant, Physician, | Physician, $ White, Japanese Physician Physician | Physician 
White Protestant, White Physician, Physician 
White Protestant 
Nonpreferred stimulus | Italian, Jew, | Italian, Jew, | Italian, Jew, American, || Portuguese, Ro- | American, American, Portuguese, - | American, 
Be Unskilled, Unskilled, | Unskilled, Unskilled, || man Catholic, | Negro, Negro, Negro, Negro, 
Negro Negro Negro Jew, Negro. || Unskilled Laz | Unskilled Unskilled Unskilled | Unskilled, 
borer Roman Catholic 
*p < .05. 
+p < 01, 
wee D < .001. 
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amount accounted for by religion reflects the 
addition of the religion “Jewish” in the case 
of the German study. The greater emphasis 
on nationality in the Japanese study probably 
reflects the presence of “Japanese” as one of 
the stimulus elements, whereas the German 
study employed only German, American, and 
Ttalian, that is, all “Western” nations. 

Second, the German subjects gave to race 
a smaller weight than did the Americans, and 
to occupation a larger weight; they also 
tended to give more weight to the religion of 
the stimulus person than was the case with 
the Americans, The German nonreligious sub- 
jects gave a higher weight to nationality than 
the other German subjects, but the N was 
only 6, so that this result may not be reliable. 

The inclusion of the data obtained from the 
German Catholics and nonreligious subjects, 
in spite of their small W, is justified because 
it gives some sense about the homogeneity of 
the samples here employed and the sensitivity 
of our method. Here the results of the various 
German samples are strikingly similar, and 
differ considerably from the American results. 
The same may be said for the Japanese re- 
sults, which are strikingly similar for the 
Japanese males, regardless of religion, The 
significance of the interactions, across the 
various same-culture samples, is a particu- 
larly striking demonstration of the reliability 
of this method of analysis. The same may be 
said of the data of Table 3, but here the 
heterogeneity of American samples is par- 
ticularly striking, and will be further dis- 
cussed below. 
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All the analyses that were performed are 
not shown, But they may be summarized as 
follows. The American subsamples are not * 
particularly similar to each other. For in- 
stance, the American Roman Catholics tend | 
to give larger weights to race and religion 
and a smaller weight to occupation than do | 
the American Protestants. The American Jew- | 
ish subjects had weights of 13.7 for race, 67.0 
for occupation, 11.3 for religion, and 0.6 for 
nationality (preferring the Italians to the 
Germans), and 1.9 for the interaction Re | 
ligion X Nationality (preferring German 
Jews to German Protestants). This pattern of 7 
weights (not of preferences) is more similat 
to the German than to the American Protes: | 
tant weights. 

The German samples showed particular 
preference for Protestant Physicians and re 
jection of Unskilled Jews, thus showing a sig 
nificant Occupation X Religion interaction. 

Since the Negro problem is peculiarly” 
American, it may be desirable to compare the 
German and American responses without con 
sideration of race as a variable. Table 3 Sum- | 
marizes the relevant analyses of variance, M 
which race is not employed as a source af 
variance, by limiting the analysis to “while 
stimulus persons. Here we tested an American | 
mixed-sex sample, not previously mentioned. 
The nonhomogeneity of the American sam 
ples is clear from these results. Religion has 4 
a considerable weight for the Protestant | 
males, the Jewish females, and for all the 
Catholics, while it has less importance for the 
other American subsamples. : 


TABLE 3 
PERCENTAGE OF VARIANCE IN SOCIAL DISTANCE ACCOUNTED FOR BY RELIGION, OCCUPATION, 
AND NATIONALITY IN AMERICAN AND GERMAN SUBSAMPLES 
ee = bs Re — me 
32 Ameri i ican femal 
Source Poema | > peca |S American male | S Aree 
Religion 37.0 16.7 31.9 51.2 
Occupation 47.0 71.0 34.2 | gir 
Nationality 7.6 1.9 11.8 4.0 
eine 7 American male | 5 American female 74German | &, German 
ews Jews Protestants Catholics 
Religion 16.2 30.7 9.8 re 
Occupation 720 35.8 82.6 86:4 
Nationality 12 20 29 12 
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Socrar DISTANCE 547 
TABLE 4 
CORRELATES OF MEASURES OF INDIVIDUAL DIFFERENCES IN SOCIAL DISTANCE 
IN THREE CULTURES 
Samples: | A la G J A A | A í 
Variable Stimuli: | G | J G J G J € é € é i € 
Score š y on y <9 x x x Dre Dre | Dra Dra | Drs 
F Scale (+) mi | 26| 03| .22*| -27+ | .20* .24*| 16 2 | Co * 
F Scale (—) | —103 | —03 | (07| —.o1 | =:24* | —.15 | —24* | —13 | —.18 si at do 
_ Acquiescence Response 104 106 | —.03| -18 | 09 Hr] 103) | 04°) =.02 13 05 | 4 
Set (p +a) | Sa, 
ase} o | og] 18 | asme] asme] 25| 28" 24") 394K) 25%) 10 


(p—a) 
(corrected for 
response set) 


*p <05. 
** p <01. 


Authoritarian content 
mk p < 001, 


Some Correlates of Social Distance in Three 
Cultures 


There are individual differences in social 
distance. That is, some people indicate that 
they perceive a great difference between the 
nonpreferred and the preferred stimulus per- 
sons, and other people perceive little differ- 
ence, The measure which taps this character- 
istic is the y measure described above. In ad- 
dition to this measure, and for exploratory 
purposes, the indices x, Dye, and Dya (see 
above) were employed in some of the analy- 
ses, The correlations between these measures 
of individual differences in social distance and 
the F Scale and its derivative scores are re- 
ported in Table 4. 

The data from this table show that al- 
though the F Scale does not correlate highly 
with the various social-distance scores, it 
does correlate positively in all cases. The re- 
versed F Scale correlates consistently nega- 
tively with the various scores. The authori- 
farian content score, that is, the F Scale cor- 
tected for response set, does remarkably well 
with the American sample, and creditably 
with the German and Japanese scores. Thus, 
the American-developed F Scale items are 
Particularly useful in connection with the 
4 American sample as one might have expected. 
It is interesting, however, that the transla- 
tion of the scale may be employed in the pre- 
diction of high scores in social distance in the 
Other samples also. Of course, scales specially 
_ standardized in other cultures ought to be 
More useful in predicting high social-distance 
Scores in those other cultures and less useful 
3 if translated into English and administered 

to American subjects. In the case of the Ger- 


man sample, we did administer some scales 
that were recently standardized and validated 
with German subjects, and the results are 
shown in Table D (see Footnote 4). The 
Adorno et al. (1950) Authoritarianism and 
Brengelmann Intolerance of Ambiguity scales 
did show significant correlations with our 
measures of individual differences in social 
distance, 


DISCUSSION 


The present study, together with the previ- 
ous study (Triandis & Triandis, 1962), sug- 
gests the following generalizations. 

1. Individuals in different cultures employ 
different weights for the characteristics race, 
religion, occupation, and nationality in deter- 
mining the social distance that they experi- 
ence toward other persons. 

In Greece, the major emphasis was placed 
on the religion of the stimulus person, fol- 
lowed by his race and occupation, In Ger- 
many, the major weight was on the occupa- 
tion, followed with emphasis on religion, race, 
and nationality, in that order. In Japan, the 
emphasis was again on occupation, but it was 
much less than in Germany, and the next 
most important factor, race, was much more 
important than in Germany. Nationality was 
also important in Japan. 

In the five replications with American sub- 
jects, the most important factor was race, fol- 
lowed by occupation, religion, and nationality, 
in that order. There is evidence, however, that 
Americans are very heterogeneous. For in- 
stance, American Jewish subjects place more 
weight on occupation and much less weight 
on other characteristics. 
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It is remarkable that though the pattern of 
weights across cultures is never the same, 
at least for the four cultures studied so far, 
there are substantial within-culture agree- 
ments across subsamples differing in religion 
(e.g., see the German and the Japanese males 
in Table 2). 

2. The weights given to the above-men- 
tioned four factors are also determined by 
the demographic characteristics of the sub- 
ject. Taken as a whole, the three studies 
(Triandis & Triandis, 1960, 1962; and the 
present one) suggest that differences in so- 
cial class (upper class showing less distance), 
religion (Jewish, showing less distance than 
Christian), ethnic background (Southern and 
Eastern European background showing less 
distance than Northwestern European back- 
ground), and sex (see Table 2 where Japanese 
females show more emphasis on occupation 
than the Japanese males) have some effect on 
the social distance of the subject. 

3. The personality of the subjects deter- 
mines, in part, the degree of social distance 
experienced. A number of measures tend to 
suggest this. High social-distance subjects 
were found to be high in acquiescence re- 
sponse set, high on F Scale, on the authori- 
tarian content of the F Scale (corrected for 
response set), and low on independence of 
judgment and tolerance for ambiguity. When 
these results are considered together with 
those of Table 1, which shows that the high 
scorers are low on tolerance for minorities, 
low on criticism of social institutions, and low 
on sympathetic understanding, the picture 
that emerges is as follows: The high social- 
distance scorer is conforming and uncritical 
of the values imparted to him by his culture, 
conservative, and intolerant of ambiguity. 
These results are, of course, limited by the 
nature of the sampling. The subjects were 
more or less normal young adults. Other fac- 
tors of a psychopathic origin may also be 
operating in the case of a few of these sub- 
jects, but it is unlikely that a random sample 
of college sophomores will have many highly 
disturbed, high social-distance subjects. 

4. The norms of behavior that are appro- 


. priate toward members of different outgroups 


differ from culture to culture. Thus, in Amer- 
ica and Japan, exclusion from the neighbor- 
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hood appeared as a fairly prevalent norm! 
while in Greece and Germany this was not |. 
the case. Presumably, differences in residen- * 
tial patterns account for this fact. The most 
social distance indicated by the Germans and 
Greeks was a refusal to be more than an 
acquaintance of the nonpreferred stimulus 
person. In the other two cultures, greater 
social distance appeared. 

It should be noted, however, that the Ger- | 
man sample was particularly “liberal” in this 
study. These results may be accounted for, 
in part, by the fact that this Hannover high 
school was a part of the UNESCO Model 
School Program. Whether this school is typi- 
cal of Germany remains to be checked in fur- 
ther studies. Davis has collected similar data 
from other German schools, and when the 
data are analyzed such a check will be pos } 
sible. | 

In general, the manifest content of the 
items being used to tap the attitudes must be 
considered within the framework of the pat- 
ticular cultural setting in which they are be | 
ing used. In postwar Germany, there has 
been a widespread reaction against authori- 
tarian and chauvinistic ideals and against 
any overt expression of intergroup hostility. 
Thus, well-educated young Germans are not 
likely to agree with statements involving | 
“strict discipline,” “patriotism,” etc. Simi- 
larly, the small amount of variance accounted 
for by nationality in social distance 15, not Í 
surprising in the light of postwar enthusiast | 
for European unity and close ties to the 
United States. It may be that our results are 


9 The Japanese translation for “exclude” on 
neighborhood) implies “reluctance to admit wit bi 
action,” in other words, it does not involve the a i 
tion component which is connoted by the et, 
word. The Japanese pattern of neighborhood exci — 
sion is interpreted by Takezawa as follows: i 


Tt is too bad if he lives in my neighborhood: i 
would be better if he lived elsewhere. But, a 
way, there is nothing I can or should do about } 

If he comes to my home, I will have to ae 
in, and even offer tea to be nice to neigh our 
(After all, we do not invite people to show i 
friendship. But coming for a visit is 4 wo f 
showing one’s respect, How can I be rude on 
to shut up my doors to him even if I ees da 
But all the time I will keep thinking how DA 
luck we have struck, quietly hoping that he 
leave us alone as soon as possible. 
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an expression of truly liberal attitudes. On 
the other hand, in making cross-cultural com- 
parisons, one must take into account differ- 
ences between schools and in the mass media. 
In postwar Germany there has been a con- 
certed effort to impart democratic ideals to 
the young. Whether the impact of these pro- 
grams is shallow or deep may not be ascer- 
tained by the present study. 

The Japanese data are also surprisingly 
“liberal.” This result, however, is consistent 
with McGinnies’ (1963) findings. McGinnies 
tested approximately 300 Japanese and 300 
American students in 1962 and 1963. He ob- 
tained their responses, on a Likert-type 5- 
‘point scale, to 12 items reflecting issues of 
concern to the American Civil Liberties Un- 
ion. He found that on the majority of the 
items the Japanese responses were more lib- 
eral than the American. He comments: 


While American students appear to have lost sight 
of some of the liberties protected by the Bill of 
Rights, Japanese students give evidence of having 
adopted these same guarantees with an enthusiasm 
that seems inconsistent with their cultural and po- 
litical heritage. 


These findings, considered together with 
those of Remmers and Radler (1957), who 
found that nearly half of America’s teen- 
agers are not committed to freedom of the 
press, over half approve censorship of books, 
one-third believe that persons should be 
forced to testify against themselves, one-third 
do not believe in freedom of speech, and one- 
quarter believe that the police should be al- 
lowed to search a person or his home without 
a warrant, suggest that there is a distinct 
tendency for young people in previously to- 
talitarian countries to move toward the 
‘American ideal” and for American young 
people to move away from it. The first phe- 
nomenon may not be difficult to understand, 
in view of the recent military history. The 
victor’s views must have more prestige than 
those of the losers. The American shift away 
from the American ideal may require a more 
complex explanation. It is conceivable that 
McCarthy and the radical right have had an 
influence. 

The most comprehensive study that has 

» been made of national characteristics of the 
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Japanese, conducted by the Institute of Sta- 
tistical Mathematics (1961) of Tokyo, also 
reveals some liberal attitudes among selected 
samples of the total Japanese population. 
Among their findings are: 65% have no per- 
sonal religious faith, 68% would talk back to 
a policeman, 69% favor labor unions, and 
55% favor “democracy.” The more rational, 
progressive, and newer ideas were found to 
be more common among young males with 
higher educational background. It should be 
noted that the present Japanese sample was 
primarily drawn from this sector of the total 
population. 

The foregoing discussion of the German 
“liberalism” seems equally applicable to the 
Japanese data. The shift in emphasis in the 
postwar educational trend toward democratic, 
nonauthoritarian, and egalitarian ideals ap- 
pears to have left an undeniable impact upon 
the young students who participated in this 
study. A word of caution, however, may be 
added that this is not to imply either that 
education has been solely responsible for such 
liberal attitudes, or that such attitudes rep- 
resent only “imports” from other nations. 
Furthermore, it must be pointed out that such 
liberal attitudes may coexist with less liberal 
and even conflicting attitudes as found by 
the study mentioned in the preceding para- 
graph. For example, as the more important 
value to stress to primary school children, 
63% placed discipline over freedom, whereas 
only 16% favored the latter over the other 
(Institute of Statistical Mathematics, 1961). 
Coexistence of conflicting attitudes is particu- 
larly possible when they represent different 
depths in attitude or different attitudinal 


areas, 


Limitations of the Study 


The problem of sampling is an obvious 
limitation. The strategy that we adopted was 
to select three student populations, and keep 
them relatively homogeneous. Thus, by maxi- 
mizing within-culture homogeneity, we should 
be able to observe between-culture differences. 
However, Germany, Japan, and even more 
clearly, the United States, are culturally ex- 
tremely heterogeneous. Thus, our results, 
properly speaking, are a comparison of Ti- 
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nois students with students from a particular 
Tokyo university and a Hannover high school. 

The second major limitation is one inherent 
in all cross-cultural comparisons. Such com- 
parisons can only be made in terms of “com- 
mon elements.” To the extent that a culture 
has features that are unique, it cannot be 
compared with other cultures. For example, 
the Indian system of social distance utilizes 
concepts such as “will permit him to touch 
my earthenware,” “will permit him to touch 
my brassware,” etc. (Mahar, 1960). It is ob- 
vious that such items cannot be compared 
directly with American social-distance items 
since they may form an entirely different 
dimension of social behavior. The comparison 
must be limited to only those broad aspects 
of social acceptance on which there is some 
agreement. 

The multidimensionality of the social-dis- 
tance scale (Triandis, 1964) is a further limi- 
tation. Factor analyses of social-distance 
items, performed after the data of the present 
study were collected, revealed the presence of 
five independent factors: Factor I, called So- 
cial Acceptance with Subordination, had high 
loadings on items such as “would admire the 
ideas of,” “would invite to dinner,” “would 
obey,” etc. Factor II, called Marital Accept- 
ance, had loadings on “I would marry,” “I 
would date,” etc. Factor III, called Friend- 
ship Acceptance, loaded on “I would eat 
with,” “I would go fishing with,” etc. Factor 
IV was a “pure” Social Distance factor, with 
loadings on “accept in the neighborhood.” 
Finally, Factor V, called Subordinate, had 
loadings on “I would be commanded by,” “I 
would not treat as a subordinate,” etc. The 
present social-distance scale, because of its 
utilization of only one or two items from each 
of these factors, has some of the character- 
istics of a unidimensional scale but future re- 
search should probably employ the multi- 
dimensional scales, and more items. This re- 
quires more elaborate studies, but the domain 
under investigation is extremely complex and 
does require more complex instruments, 

In spite of the several limitations of the 
present study it did show that the method of 
investigation was sensitive to Many significant 
determinants of social distance, and added 
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some additional data to the existing knowl- 
edge of how subjects from different cultures 
employ these determinants in the specifica 
tion of their social distance. 
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RECOGNITION OF TABOO STIMULI AS A FUNCTION 
OF EXPOSURE TIME* f 


DONALD D. DORFMAN, JOHN M. 
San Diego 


In Exp. I, 2 groups of 50 women served as Ss. Each S in Group 1 was given 
a pair of words, 1 taboo and 1 neutral, and was instructed that 1 of the 


words would be shown on each trial, 
seen. Each word was shown at each 


repeated on 4 word pairs. Group 2 was tested under identical conditions 
except for a different taboo indicator. Aĉcuracy was significantly higher to 
neutral words at the shorter durations, but higher to taboo words at the 
longer durations in both groups. A control experiment was performed to deter- 
mine if similar results would be obtained with frequent and infrequent words. 
In contrast to the findings on taboo words, Ss were more accurate to frequent 


words at all durations. 


This study investigated the relation of ex- 
posure time to accuracy of reporting taboo 
and neutral words. Demonstrations of ele- 
vated thresholds for socially taboo words 
have been numerous, and most investigators 
have concluded that response bias is the criti- 
cal factor in determining these elevated 
thresholds (Goldiamond & Hawkins, 1958; 
Postman, Bronson, & Gropper, 1953; Za- 
jonc, 1962). Recently, some investigators 
have analyzed the role of response bias as 
the energy or exposure level of the stimulus 
is increased. For instance, Brown (1961) 
has presented an S-R analysis of the taboo- 
word experiments in which he was specifi- 
cally concerned with exposure time. He argued 
that response tendencies of taboo and neutral 
words change systematically with exposure 
time in accordance with the laws of stimulus 
generalization. 

It has also been suggested that the response 
strengths of words vary with increasing stim- 
ulus information (Zajonc & Nieuwenhuyse, 
1964), or with the particular cues identified 
by the subject (Eriksen, 1956; Kempler & 
Wiener, 1963; Spence, 1963). Thus, as ex- 
posure time increases, more cues become 


1 This research was supported in part by a grant 
from the San Diego State College Foundation. Ac- 
knowledgment is made to William A. Hillix for 
valuable suggestions and to Bruce Wilson, Jr. for 
his assistance under the support of an Undergradu- 
ate Research Grant G-21679, National Science 
Foundation. Experiment I served as part of a 
master’s thesis by L. Kroeker. 
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and she was to utter the word she had 
of 7 durations, and the experiment was 


available, thereby changing the relative 
strengths of possible responses to the stimu l 
lus. However, there is very little systemati 
data concerning the relation of exposure time 
to the identification of taboo words. This} 
study provides such empirical data. In ordet 
that the data be eventually accounted {ot 
theoretically, exposure-time data were ob 
tained in a very simple case. Two stimulis 
words (e.g., PENIS, MIXER) and two response 
were used (e.g., PENIS, MIXER). Each wor 
was presented at each of a series of durations 
by the method of constant stimuli, Subject 
in one group were instructed to utter tht 
word they saw, and subjects in a secon 
group were instructed to say the neutral 
word when the neutral word was seen, angi 
to say the neutral word preceded by the 
word “not”? when the taboo word was seei 
This manipulation was designed to obtall 
exposure-time data under two levels of 1% 
sponse bias. 


EXPERIMENT I 
Method 


Subjects. The subjects were 100 girls enrolled A 
introductory psychology who were given ex i 
points for participating in the experiment, bad 
upper age limit was set at 28, and the subjects 
a mean age of about 20. 

Apparatus. Stimuli were viewed in a O° four 
tachistoscope, the eyepiece of which contain 10% 
neutral density filters, two each of 20% and ind 
transmission. The experiment was conducted pane | 
sound-insulated room lighted by a ceiling P's 
containing one 60-watt bulb. | 
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Stimuli. The stimuli were presented on 9X 12 
inch white cards with the stimulus word placed in 
the center. The stimulus words were made with 
Letraset instant lettering (Sheet 190), with each 
letter 1 millimeter long. The fixation card was 
white with an “X” of the same size as the stimu- 
lus letters placed in the center. The preexposure and 
exposure fields were of approximately equal illumi- 
nation, and the visual angle for the seated subject 
was 0°23/06". 

Two words were used in each subexperiment, a 
socially taboo and neutral word. For the complete 
study, four pairs of words were used, The words of 
each pair were equated for word frequency accord- 
ing to the Thorndike-Lorge (1944) semantic count, 
although in the case of KOTEX and PENIS, the esti- 
mated frequency of Postman et al. (1953) was em- 
ployed. The word pairs were as follows: PENIS, 
MIXER; WHORE, TALLY; RAPED, PHOTO; KOTEX, 
ctrar. The taboo words were all members of the 
original list used by McGinnies (1949). 

Procedure. Subjects were randomly assigned to 
two groups of 50 each. Group 1 was instructed to 
say aloud the neutral word when the neutral word 
was seen, and to say aloud the taboo word when 
the taboo word was seen. Group 2 was instructed 
to say aloud the neutral word when the neutral 
word was seen, and to say aloud the neutral word 
preceded by the prefix “not” when the taboo word 
waseseen, Both groups were required to utter one 
of the two appropriate responses on each trial and 
if they were not sure, they were instructed to guess 
one of the two responses. There was no mention of 
the fact that one of the two words was socially 
taboo, and the other neutral. 

Each subject was brought into the experimental 
room, introduced to the two male experimenters, 
and seated in front of the tachistoscope. The fol- 
lowing instructions were then read: 


_ This is a study in perception. One word at a 
time will be presented to you in the machine at 
various intervals of time. Before we begin, you 
will be given a card with two words printed on 
it. One of these two words will be presented in 
the machine on each trial. You are to look into 
the eyepiece and tell us which one of the words 
was flashed. The words will be presented fairly 
rapidly so that at times it will be very difficult 
for you to see them and you will frequently have 
the feeling that you are guessing. Don’t be dis- 
turbed by this. When you can’t see the words, 
just guess. The important thing is that you must 
report a word after each trial. 

Now look into the eyepiece and let your eyes 
adjust to the darkness. We will now run four 
practice trials to acquaint you with the machine. 
In this practice, we will present five arbitrary 
digits on each trial. Fixate the “X”; that is 
where the five digits will appear. 


mee subject was given four practice trials on the 
S. 
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Group 1 subjects were then given a card with a 
pair of words on it (eg., PENIS, MIXER). Word or- 
der was randomized. The following instructions 
were then read: 


Now we will begin with the experiment, Fa-< 
miliarize yourself with the two words on this 
card. Now look into the eyepiece, and remember 
that you must report a word after each trial. If 
you are not sure which word was presented, 
guess. Are there any questions? 


Group 2 subjects were also given a card with a 
word pair printed on it. Below the word pair to 
the left was the phrase, “your response will be,” 
and beneath the neutral word was the neutral word, 
and beneath the taboo word was the neutral word 
with the “not” prefix. This group. was given similar 
instructions except for an additional note on the 
appropriate responses for each of the two words. 

During the experiment, each of the two words 
was presented four times at each of the sever. dura- 
tions: 0, 30, 60, 90, 120, 150, and 180 milliseconds. 
The O duration consisted of the presentation of a 
word with the timer set at O exposure level, and 
was simply a stimulus-absent trial. The word-dura- 
tion combinations were presented in a random 
fashion for 56 trials (two words, seven durations, 
at four times each), with an interstimulus interval 
of 12 seconds. Each subject was run on all four 
word pairs, with approximately a 4-minute rest 
period between word pairs. The order of stimulus 
presentation and the order of word pairs was ran- 
domized across subjects. 


Results 


Figure 1 shows the results for Group 1 and 
Group 2, pooled across word pairs. These data 
reveal a rather surprising interaction between 
word class and duration. Accuracy is higher 
to the neutral words at the very short dura- 
tions, which agrees with traditional findings 
in this area; however, at the longer durations 
accuracy is higher to the taboo words.” The 
effect appears quite systematic, and occurs 
in Group 1 in which the taboo word was ut- 
tered as well as in Group 2. Figure 2 sepa- 
rates the results for the four word: pairs, and 
averages over the two groups. As can be seen 
in Figure 2, three of the four word pairs 
show a reversal in the relative accuracy of 
the taboo versus neutral word as duration 
was increased. Analysis of variance performed 
on these data provided statistical support for 
these observations. For purposes of analysis, 


2 Accuracy at O duration was defined as agree- 
ment between the response that was uttered and 
the stimulus in the exposure field. 
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the data on the stimulus-absent condition 
(0 duration), and the data for durations 
greater than O were analyzed separately. In 
“both cases, the analysis was performed on an 
“arc-sine transformation of each percentage of 
‘correct responses. 
© Analysis of the data on the stimulus-absent 
condition showed a highly significant emo- 
tionality effect (F = 48.25, df= 1/98, $ < 
001) which resulted from a strong tendency 
“to oyercall the neutral word. This result oc- 
curred in all word pairs and even in Group 2 
where the taboo response was not directly 
‘uttered (see Figures 1 and 2). Also of in- 
terest was the reliable Emotionality x Groups 
“interaction (F = 9.75, df= 1/98, p< .01). 
_ Figure 1 shows this interaction rather clearly 
and indicates that the tendency to utter the 
taboo indicator was larger in Group 2 (e.g, 
C NOT-MIXER) than in Group 1 (e.g., PENIS). 
“It might be noted that there was no reliable 
| groups effect (F <1), showing that accu- 
-racy was equal in the two groups. The aver- 
age percentage of correct responses in the 
 Stimalus-absent condition was 49.6 in Group 
= Land 50.5 in Group 2. Equal accuracy is ex- 
" pected in this condition since the subjects 
~ teceived no stimuli, and the taboo and neu- 
tral words were presented an equal percent- 
age of times. 
The results for the case in which duration 
_ Was greater than O provide an interesting con- 
| trast to the stimulus-absent case. For pur- 
Poses of analysis, the durations were com- 
‘bined into three blocks, 30-60, 90-120, and 
C 150-180 milliseconds. Analysis of variance 
_ showed no significant emotionality effect, in- 
“dicating that accuracy to the taboo words 
"Was not lower than to the neutral words if 
‘One averages over durations. This conclusion 
“held for both Group 1 and Group 2 since 
‘there was no Emotionality x Groups interac- 
tion (F < 1). Of particular interest was the 
‘Significant Emotionality X Duration interac- 
tion (F = 19.46, df = 2/196, p < .001). This 
Provides statistical support for the crossover 
Of the functions observed in Figure 1. In 
Order to draw more detailed conclusions about 
this interaction, simple comparisons were 
hen performed on these data. For the dura- 
„tion block—30-60 milliseconds—accuracy to 
boo words was significantly lower than to 
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neutral words (F = 18.02, df= 1/98, p< 
.001) and agrees with the findings on the 
stimulus-absent condition. However, at 90- 
120 milliseconds, the reverse was obtained. 
Accuracy to taboo words was significantly 
higher than to neutral words (F = 8.03, df = 
1/98, p< .01). For 150-180 milliseconds, 
accuracy was higher to taboo words with the 
difference almost significant (F = 3.25, df = 
1/98, p < .10). In the simple comparisons, 
the error term of the F ratio was error (w),, 
Subjects x Emotionality interaction, pooled 
over the two groups (Winer, 1962). 

It might be argued that this higher level 
of accuracy observed at the longer durations 
is not unique to taboo words in this situa- 
tion, but may occur to any class of words 
for which there is a negative response bias on 
stimulus-absent trials. There is evidence of 
a response bias against reporting infrequent 
rather than frequent words on stimulus-absent 
trials (e.g, Goldiamond & Hawkins, 1958; 
Zajonc & Nieuwenhuyse, 1964), In evaluat- 
ing the above hypothesis, it would be useful 
to determine if the higher level of accuracy 
obtained with taboo words is also observed 
with infrequent words. If such is the case, 
then this phenomenon may be accounted for 
by some property which taboo and infre- 
quent words have in common, such as a com- 
mon negative response bias on stimulus-ab- 
sent trials. 

The following experiment compared re- 
sponses to frequent-infrequent word pairs as 
a function of duration. The procedure used 
on Group 1 in Experiment I was employed 
here. 

EXPERIMENT II 


The words were taken from the experiment 
by Howes and Solomon (1951) in which they 
showed 15 words and obtained a correlation 
of —.76 between log frequency based on 
the Thorndike-Lorge magazine and semantic 
count, and duration threshold. We selected 
their four most frequent and their four most 
infrequent words, choosing an infrequent and 
frequent word for each pair. The word pairs 
were as follows: MACHETE (.20), COUNTRY 
(674); FIGMENT (.50), PROMISE (150); TITU- 
LAR (1.1), EXAMPLE (123); IRKSOME (2), 
BALANCE (63), Where each number following 
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over the magazine and semantic count. 


Method 


Subjects. Thirty girls served as subjects and were 
selected from the same pool of subjects as those in 
Experiment I. 

Apparatus. The apparatus was the same as in Er 
periment I. 

Procedure. The procedure was identical to that 
used for Group 1 in Experiment I except for a 
change to the above four word pairs, 


Results 


Figure 3 shows the results on the infre 
quent-frequent pairs averaged over the four ( 
word pairs. As can be seen, there was & 
tendency to utter frequent rather than infre 
quent words at .00 duration (stimulus ab- 


a word is the average occurrence per J 
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sent), However, in contrast to the findings 


„on taboo and neutral words in Experiment I 


| 


l 


(Figure 1), frequent words had a higher 
level of accuracy than infrequent words at all 
durations. It is of interest that this higher 
level of accuracy was obtained in spite of the 
fact that the subject was given the two words 
on a card and no other response was per- 
mitted, Figure 4 presents the data separately 
for each word pair. In none of the four cases, 
was accuracy higher for the infrequent mem- 
ber of the pair at the longer durations, noting 
that in all cases, there was a bias against ut- 
tering the infrequent word at 0 duration. The 
analysis of variance performed on the stimu- 
lus-absent condition revealed a significant 
frequency effect (F = 5.88, df=1/29, p< 
.025), showing that the tendency to utter 
infrequent words was significantly lower at 0 
duration. No other reliable effects were ob- 
tained. 

The durations were also combined into 
three blocks, 30-60, 90-120, and 150-180 
milliseconds for the analysis of variance of 
the stimulus-present condition, The analysis 
of variance showed a significant frequency 
effect (F = 9.10, df = 1/29, p < .01). This 
result is in contrast to the statistical findings 
in Experiment I, where no significant emo- 
tionality effect was obtained (F < 1). Thus, 
accuracy to frequent words was greater on 
the average than accuracy to infrequent words 
for the positive durations, whereas accuracy 
to neutral words was no higher than to taboo 
words on the average. Another interesting 
result was the lack of a significant Frequency 
X Duration interaction (F < 1). In Experi- 
ment I, the Emotionality X Duration inter- 
action was significant at the .001 level. 


DISCUSSION 


A quite fruitful account of the data on the 
recognition of taboo words has been the ar- 
gument that there is a response bias favoring 
neutral rather than taboo words. In some 
Cases, data showed that a response bias may 
have resulted from the taboo words being less 

. frequent than neutral words (Howes & Solo- 
Mon, 1950), and in other cases, that subjects 
tay have witheld report of taboo words when 
they were instructed to utter them publicly 
(Postman et al., 1953; Zajonc, 1962). The 
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present data on the stimulus-absent condition 
clearly support this assumption of a response 
bias favoring neutral words. Our results on 
the positive durations are more difficult to 
explain. On the average, as duration increased, 
the subjects showed a more accurate report 


‘of taboo words. 


As noted earlier, a number of investigators 
have concluded that response bias may change 
as exposure level is varied. However, it has 
not been suggested that the bias may actually 
reverse itself and favor the taboo word when 
duration is sufficiently long. For instance, 
Brown (1961) assumed that the function 
relating the strength of a taboo-word response 
to exposure time was a gradient of stimulus 
generalization. Although he accounted for the 
traditional threshold differences between ta- 
boo and neutral words, he did not predict an 
average higher level of accuracy for the taboo 
words, His analysis is directly relevant to the 
present study since it was applied specifically 
to a two-response Case. 

Also, Zajonc and Nieuwenhuyse (1964) 
have suggested that response bias diminishes 
in importance as exposure level of the stimu- 
lus increases. Although their hypothesis 
would account for the convergence of the ta- 
boo and neutral functions as exposure time 
is lengthened, it would not appear to explain 
the higher level of accuracy we obtained for 
the taboo words. 

One possible explanation for these data is 
that with this procedure, the subjects are try- 
ing to utter each response an equal number 
of times, Since they have undercalled the 
taboo response at the shortest durations, they 
compensate by overcalling it at the longer 
durations. However, an inadequacy of this ex- 
planation is that it should also apply to the 
word-frequency study. In this study, no Fre- 
quency X Duration interaction was observed. 

While these hypotheses assume a change in 
response bias as duration is varied, at least 
one investigator has suggested that the proba- 
bility of using anxiety or taboo words in a 
stimulus-absent condition is predictive of their 
usage when stimuli are present (Goldstein, 
1964). These data clearly contradict this as- 
sumption in a simple case. 

A somewhat different approach which may 
be of value in interpreting these data, is the 
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application of a mathematical model which 
separates the response bias from the discrim- 
inability of the stimulus. For instance, such 
an approach has been employed in the area of 
signal detection by Tanner and Swets (1954) 
and Birdsall (1955) among others. Since 
their model has been applied to psycho- 
physical data obtained by the method of con- 
stant stimuli (Birdsall, 1955; Tanner & Swets, 
1954) with success, a model with similar 
properties is applied here. 


Model 


The observable events of the experiment 
consist of a set of word-duration combinations 
(SiT;) where S; is the stimulus word and T; 
the duration, and a set of responses Ry. The 
i= tn for taboo and neutral, respectively, 
j= 0, 30, 60, 90, 120, 150, 180 milliseconds, 
and = t,n. For example, S;Tso might be the 
stimulus PENIS at 60 milliseconds, and R; 
might be the response PENIS. 

The assumptions are as follows: 

Assumption 1, On each trial, the presenta- 
tion of an S;T; combination leads to an event 
Xi; located on a unidimensional continuum, 

Assumption 2, Over a large series of trials, 
each S:T; leads to a normal distribution of 
such X;;’s with variance oi? and mean M, ij: 

Assumption 3. There exists a fixed cutoff 
point X, on the continuum such that if Xy< 
Xea then the subject will respond R,, and if 
Xy >X, the subject will respond R„. The 
location of X, is some function of the values 
and costs associated with his decisions. For 
Present purposes, this function need not be 
specified, 

Assumption 4. At duration 0, all S;7 are 
associated with the same normal distribution 
with mean Mo and variance O°. 

Assumption 5. All o? are assumed equal 
and are designated as o,°. 

The normal distribution associated with 
SiTo will serve as a standard such that its 
mean Mo will be the point of origin and its 
standard deviation oo, the unit of measure- 

ment of the continuum. In the present case, 
this distribution serves a function analogous 
to the noise distribution in the signal-detec- 
tion model (Tanner & Swets, 1954). A meas- 
ure of word discriminability for a Particular 
duration can be taken as the distance of the 
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mean of the S,7; distribution fro 
of the standard distribution, Wi 
a measure Dw, and it is defined a 
— Mu = Mo i 


Du 
To 

From these assumptions, 0 
the following equation relating 
ity of responding the neutral 
given that the neutral stimulus wi 
at the jth duration, to X, and Du 
tion is as follows: - 


P(Ry|SnTj) = E(D 


where 


stimulus at the jth duration 
F =the normal cumulative distribution 


The constant z, can be shown to be 
the probability of uttering the ne 
sponse given the stimulus-absent ev 
following way: 


Similarly, we can derive the equal 
curacy to the taboo stimulus and if 
lows: 


P(R,|S:T;) = F(— Di #3 


Figure 5 shows three of the distri 
the attribute for illustrative purp : 
Equation 3 was used to estimate 
of 2, for each word pair within a gr 
standardized normal tables. The 
sponse percentages were employ 
mates of the probabilities. The 20's 
similar for all word pairs in each 
Group 1, the average z, was E 
range from —.43 to —.58, and ini 
the average ze was —.178 with a 
—.12 to —.23. The negative sign 


i 
ê The function used by Tanner and $i 
and Birdsall (1955) for data obtained by 
of constant stimuli with yes-no alterni 
follows (Birdsall, 1955, p. 399): 
P(Yes) =F(c+d'), where ¢ is 
determined by the false-alarm probal 
the normal cumulative distribution f 
function for false-alarm probability 


Px(A) =F(c). | 
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the cutoff was stringent with respect to the 
taboo response. 

The z,’s were substituted in Equations 2 
and 4 in order to estimate the Dys and D,’s. 
The group response percentages were again 
used as estimates of the probabilities. Figure 
6 presents the estimates of Dw separated 
for words and durations and averaged over 
Group 1 and Group 2. The overall results 
were similar for both groups. Since the at- 
tribute was given a direction, the D;’s were 
multiplied by — 1 for purpose of comparison, 
and will be called the absolute discriminabil- 
ity values. They are symbolized as D’». As 
can be seen, the results are quite consistent 
tn that the discriminability measure, D’'w, was 
larger for the taboo stimulus in all four word 
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Fic. 5. An example of three theoretical functions 
and the cutoff located on the dimension, (The shaded 
area represents the probability of a neutral response 
given 0 duration, In this example it was arbitrarily 
set at .84.) 
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pairs. This result occurred even in the case 
of the PENIS-MIXER pair where a higher level 
of accuracy to the taboo word was not ob- 
served. It is also of interest that the differ- 
ence in discriminability of the taboo and 
neutral stimuli, D's — D'a, increased rather 
substantially with duration. Since a D’,, was 
obtained for each word-duration combination 
within a group, an analysis of variance could 
be performed on these data where each word 
pair is analogous to a particular subject in a 
traditional analysis. This analysis of variance 
revealed a significant emotionality effect (F 
= 31.86, df = 1/3, p< .001) and an Emo- 
tionality X Duration effect (F = 14.22, df = 
5715, 2 < -0s 

These findings give us a possible explana- 
tion for the interaction of emotionality and 
duration observed in Figure 1. Equations 2 
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and 4 show that with D’,, equal, a negative 
cutoff, z, results in a higher level of acca 
to the neutral stimulus. Counteracting ths 
effect is the fact that the taboo stimulus i 
a larger discriminability, D'w. AS a 
becomes large, the difference, D'i — Dm i 
creases enough to cancel the negative ae 
leaving a remainder which results in hig a 
level of accuracy to the taboo stima 
the case of the PENIs-mrxeR pair, the E 
ference in D’,, may not have been larg 
enough to produce this result. 

There ST important difference bee 
the findings on Group 1 and Group LN ce 
should be noted. The analysis of eee 4 
showed a significant Emotionality X oe 
effect (F = 35.46, df = 1/3, p < -001): “iil 
discriminability measures on the taboo 4 
neutral words were more similar in GrouP 
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Fic. 8. D' as a function of exposure time averaged 
over word pairs. 


_ than in Group 1. To account for this finding, 


it might be tentatively argued that the taboo 
or emotional nature of a stimulus is reduced 
when the response associated with it is neu- 
tral, and therefore the taboo and neutral 
stimuli are more similar in Group 2, with re- 
Spect to their emotional nature. Although the 
two groups may differ, one should still expect 
the D'„ measure to be correlated in the two 
8toups. A Pearson correlation was run be- 
tween the D’,,’s obtained in Group 1 against 
those obtained in Group 2. A correlation was 
Tun separately for the taboo and neutral 
Words. In the case of the taboo words, the 
correlation was +.959 (df = 22, p <.001), 
for the neutral words, r was +.891 (df = 
2, $ < 001) with 24 pairs of observations. 
A ‘Although the D’,, measure was higher for 
Soo than neutral words, it may be higher 


for any class of words for which there is 


eae response bias on stimulus-absent 
. To test this hypothesis, the results of 
ee ent II on frequent-infrequent words 
ahaa to D'w’s and these are pre- 
ei Figure 7. The average 2, in this 
o T was —.15, with the range from 
findin © —.37. In contrast to taboo-neutral 
ee these results show no consistent 
acr One igure 8 presents the D’,,’s averaged 

S word pairs, and it is quite clear that 
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the D’,,’s do not change with word frequency. 

Analysis of variance on the D’,’s shows no 

reliable frequency effect (F <1) or Fre- 

quency X Word Pairs effect. This finding may , 
be consistent with the absence of an interac- 

tion between word frequency and duration. 

The negative cutoff lowers accuracy to the 

infrequent word, but the infrequent word 

does not have a greater discriminability to 

offset this bias. 

In summary, the data showed a substantial 
interaction between word emotionality and 
duration, This analysis suggests that emo- 
tional stimuli may actually be more discrim- 
inable than neutral stimuli, and the difference 
in discriminability may account for this inter- 
action, Taking a more conservative position, 
one may conclude that accuracy to taboo 
words is no lower than to neutral words be- 
yond 60 milliseconds even when the taboo 
word is publicly uttered. 
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CASE REPORT 


THE GREAT CABBAGE HOAX: 
A CASE STUDY? 


MAX HALL 


Harvard University Press 


During the price-control period of the Korean War, vast circulation was given 
to a false rumor that the Office of Price Stabilization had a 26,911-word 
cabbage order. The entertainment yalue was enhanced by comparing this with 
the brevity of the Gettysburg Address, Ten Commandments, and Declaration 


of Independence. In 1 version of the story, whose origin is precisely known, 
the cabbage order became a 12,962-word regulation on manually operated 
foghorns. The cabbage joke has persisted even into the 1960s, A tracing of the 

. cabbage and foghorn versions through the nation’s communication system 
produces over 50 examples of their use and provides illustrative material on 
how rumor travels despite wide publicity given to the true facts, 


On November 16, 1964, the National Ob- 
" Server, a weekly newspaper published by Dow, 
< Jones, and Company, printed this item: 


b, The Ten Commandments contain 297 words. The 

Bill of Rights is stated in 463 words. Lincoln’s 
Gettysburg Address contains 266 words. A recent 
Federaf directive to regulate the price of cabbage 
contains 26,911 words. 


The item was attributed to the Messenger-Ga- 
zette, Somerville, New Jersey. The editor of the 
Messenger-Gazette told me he did not remember 
Where he got it. Meanwhile the Concord (Massa- 
chusetts) Journal printed exactly the same para- 
a staph on November 12, and the editor told me 
it came from a magazine called Quote: The 
Weekly Digest, published in Richmond, Indiana. 
~ About this time I happened to see the item in 

Sixty Six Ninety-Nine, a leaflet of amusing quo- 

issued periodically by the Rapid Service 

fess, a Boston printer. The compiler of the 

Publication told me he took the story from 

Bote I wrote to Quote, and the editor, Lotte 
F cay promptly sent me a copy (September 
Bin 64) containing the cabbage item. She ex- 
Ties ed that she got it from the August issue of 
cAi a house organ of the Southern Railway Sys- 
E: pa that Ties got it from the Atlanta Jour- 
Feditor. re it had appeared as a letter to the 
ae be assumed that these seven printings 
bin came to my attention were only a frac- 

; perhaps a tiny fraction—of the total repe- 


Ary 
SEUN is a follow-up study of an earlier investiga- 
1954, ch appeared in The Reporter of March 16, 

” PP. 32-35, entitled “The Great Cabbage Hoax.” 


titions of this joke in 1964. Some people, enjoy- 
ing the laugh, may have assumed that the cab- 
bage directive, being “recent,” was a product of 
the Johnson or the Kennedy administration. Per- 
haps others, remembering that the country had 
not had price controls since the Korean War, 
blamed it on the old Office of Price Stabilization 
(OPS), as many Americans did while OPS was 
still alive. For example: 

On the morning of May 1, 1952, millions of 
housewives were listening to an NBC radio pro- 
gram called “Double or Nothing.” The master of 
ceremonies, Walter O’Keefe, came to the mo- 
ment for putting the “grand-slam” question, He 
announced that the contestant who came closest 
sto the correct answer would win $80. The ques- 
tion was approximately as follows: “Lincoln’s 
Gettysburg Address contains 266 words, The 
Ten Commandments contain 297 words, The 
Declaration of Independence contains 300 words, 
How many words are in the OPS order to reduce 
the price of cabbage?” The contestants made 
their guesses, and O’Keefe proclaimed that the 
correct answer was 26,911 words. He awarded 
the $80 to the person who had come closest. 
Laughter and applause filled the studio. 

The remarkable thing about this episode was 
that OPS had no cabbage order. During its en- 
tire existence, from its birth in January 1951 to 
its death in April 1953, OPS never placed con- 
trols on the price of cabbage. 

The first time I heard the claim that OPS had 
a 26,911-word cabbage order was on August 9, 
1951, 9 months before the “Double or Nothing” 
broadcast. Michael V. DiSalle, then Director of 
Price Stabilization, was in Memphis, Tennessee, 
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to address a convention of the Southern Gar- 
ment Manufacturers Association, and I was with 
him because I was the OPS Director of Public 
Information. The toastmaster, introducing Di- 
Salle, made a long, rambling speech in the course 
of which he recited wordage figures for the 
Gettysburg Address, Ten Commandments, and 
Declaration of Independence and then said the 
OPS cabbage order had 26,911. This surprised 
DiSalle as much as it did me, but he contented 
himself with beginning his address, “Took the 
toastmaster almost twenty-six thousand words to 
introduce me.” 

Though we did not realize it at the time, the 
cabbage joke was already gathering momentum. 
For example, on July 28, 1951, it had appeared 
in New England Homestead, a semimonthly farm 
journal published at Springfield, Massachusetts 
(the earliest printing I have seen). On August 10 
the New York Daily News, circulation 2,200,000, 
printed it in the form of a letter signed Paul D. 
Warsley, North Bergen, New Jersey. In the Octo- 
ber issue of The Inland Printer, a monthly trade 
paper published at Chicago, Wayne V. Harsha 
used the cabbage item in his column and credited 
it to New England Homestead. The typical word- 
age figures used in 1951, and indeed in nearly all 
the printings I collected in the 1950s, were: 
Gettysburg Address 266, Ten Commandments 
297, Declaration of Independence 300 (grossly 
inaccurate), and OPS order 26,911. It was not 
until the 1960s, so far as I know, that somebody 
substituted the Bill of Rights for the Declara- 
tion and “Federal directive” for “OPS order.” 

Someone sent me a 1-page circular, dated Oc- 
tober 24, 1951, mimeographed on the letterhead: 
of Glaser, Crandell Company, a Chicago pro- 
ducer of pickles and preserves. The cabbage 
anecdote was there, followed by a few statistics 
about the food industry. The cabbage item was 
headed “According to G.M.A.” I wrote to R. J. 
Glaser, president of the company. He replied on 
November 13 that the item came from a news- 
letter of Grocery Manufacturers of America, In- 
corporated. He ended his letter thus: 


So far as our Company is concerned, it is obvious 
that we were doing nothing much more than send 
out some interesting statistical excerpts and mingled 
them with what we thought was a rather funny re- 
mark, We had no idea, personally, as to whether 
there was a cabbage regulation or not. The chances 
are that G.M.A. didn’t either. 


I also wrote to Paul S. Willis, president of 
Grocery Manufacturers of America, Incorpo- 
rated. He replied on November 21, 
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At our annual meeting at the Waldorf last week, a 
number of persons had a copy of that item and 
asked if I had ever seen it. Either because of a | 
faulty memory, or perhaps because the article car- 
ried no interest for me, I had no recollection of 
ever having seen it. Upon investigation, I found that |} . 
in one of our circulars, the staff included the article 
because of its humorous slant. We did not create it 
and we have no interest in circulating it, 


A representative of Willis also telephoned me to 
explain apologetically that one of the publicity | 
men for the association had seen the cabbage f 
yarn in some paper, had thought it was funny, | 
and had put it in the association’s literature. i 

About this time OPS officials who had served f 
in the Office of Price Administration during 
World War II told me that the same story had 
been circulated in OPA days. It was a hoax thet, 
too, for OPA had no regulation devoted exclu | 
sively to cabbage. (OPA, however, unlike OPS, 
did have a regulation covering many fruits and | 
vegetables, cabbage included.) * n 


ATTACK AND COUNTERATTACK 


I did not discover who invented the cabbage | 
rumor in the 1940s, or who first applied it to | 
OPS in the 1950s, but I know many people and 
publications that kept it alive. Some of these, 
when told of the apocryphal nature of the joke, 
printed retractions. For example, when the She 
boygan, Wisconsin, Press produced an editorial 
on November 6, 1951, entitled “Too Much Cab: | 
bage in OPS Orders,” I wrote a polite letter 
to the editor, acknowledging that some of the 
agency’s major regulations were lengthy becaust 
of the legal necessity of covering all continge i 
cies in a complex situation, but pointing out that | 
the cabbage fiction, which was being systemat | 
cally and widely circulated, tended to cast 1 p 
cule on an agency that needed the cooperatioi 
and respect of business and the public. The vel 
tor printed my letter on November 23 une 
the heading: “Apparently We Fell for Politi 


21n this article I use the word “hoax” in bs 
sense of a device intended to deceive widely bye 
posing on people’s credulity. The intention 5 
to deceive utterly or only momentarily; the Mi 
may be pure amusement, mockery, 
more serious, or a mixture. To be sure, herens 
hoax is often associated with harmless humor Wigs 
deceptions for serious purposes usually draw k 


si the 
like “fraud,” “false propaganda,” or eie" a att 
distinction is not a precise one because mO a 


so hard to identify and their classification a oint 
less or harmful is so subjective. The important P i 
is that a deliberate deception, whether We “part 
hoax or fraud, can become rumor when it 


traveling. 
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Propaganda.” But meanwhile other papers—no 
one will ever know how many—were printing the 
item in letters to the editor. For example, a let- 
ter signed “N. S.” appeared in the Tulsa Tribune 
on November 13 under the heading: “Oops! OPS 
Js Too Wordy.” 

On November 23, 1951, the Associated Press 
carried a humorous story by Arthur Edson about 
our difficulties in combating the hoax. This story 
was widely printed. For example, the Tulsa Trib- 
une used it on the front page; the Washington 
Evening Star headlined it—“DiSalle Tries to 
Make Slaw of Cabbage Price Barrage”; the Chi- 
cago Daily News—“Wordy on Cabbage? Spin- 
ach! Says OPS”; and the New Vork Times 
(November 24)—“O.P.S. Cabbage Order a 26,911- 
word Myth.” Abraham H. Wolfish of Philadel- 
phia wrote hotly to DiSalle on November 24 
asking why cabbage wasn’t under control. 

But the Associated Press debunking did not 
halt the cabbage rumor. On December 5 the San 
Jose News repeated it in an editorial; the editor, 
Jack Wright, when told the facts, wrote to an 
OPS official : 


" Lregret exceedingly that we have been inaccurate in 


this way. It is bad for any newspaper to be inaccu- 
tate. AS I remember this item came to us from the 
Newspaper Enterprise Association. We quoted in 
good faith, believing it to be correct. I am very sorry. 


k Soon the cabbage regulation was immortalized 

in the newspaper column by Dr. George W. Crane 

called “Test Your Horse Sense,” distributed by 

the Chicago Tribune Syndicate. Problem 6 of 

Test Your Horse Sense” as it appeared through- 

fend country on December 14, 1951, began as 
ws: 


pss is often linked with simplicity as you may 

peeve when you try to solve this problem which I 

ay adapted from the “Inland Printer.” You de- 

items One point for each correct matching of the 

in ath the left hand column below, with their word 
gth, as indicated in the right hand column. 


The first column looked like this: 


(a) Lincoln’s Gettysburg Address. 

(b) Declaration of Independence. 

(c) The Lord’s Prayer. 

(Ù) The Ten Commandments. 

(©) OPS order to reduce the price of cabbage. 


And here was the second column: 


(v) 66 words 


(w) 197 words 
(x) 266 words 
(y) 1,348 words 


(z) 26,911 words 
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The answer, as supplied by Dr. Crane, brought 
e and z into a cozy alliance. 

Dr. Crane, besides putting the anecdote into 
the form of a quiz question, made two changes. 
One was the insertion of the 66-word Lord’s 
Prayer, But the most interesting change was in 
the wordage of the Declaration of Independence. 
By linking b with y, he gave the Declaration 
1,348 words, whereas his source, The Inland 
Printer, had said 300, (My own count is 1,322 
without signatures and 1,439 with signatures.) 
The significance of Dr. Crane’s alteration was 
that the alert skepticism that caused him or his 
staff to check the length of the Declaration ap- 
parently did not lead him to check the 26,911- 
word cabbage order. 

On December 21, the cabbage hoax crept into 
a business page of the New York Herald Trib- 
une, headed “Masters of Brevity Contrasted to 
O.P.S.” and beginning: “One of the Chicago 
commodity services notes that . . . .” The OPS 
regional information officer wrote to the news- 
paper, which printed a handsome apology on 
December 28, adding: 


The agency is attempting earnestly to do a neces- 
sary job, and ridicule can interfere with its func- 
tioning. If the regulations sometimes are lengthy, 
that is a matter of legal necessity. 


On December 28, the very day that the canard 
was crushed to earth in the Herald Tribune, it 
sprang up in the Redondo Beach, California, 
Daily Breeze. A front-page columnist in that pa- 
per, F. S. Haynes, used the item and attributed 
it to Frank J. Macomber, a Washington corre- 
spondent for Copley Press, Incorporated, which 
had a string of 14 papers in Illinois and Cali- 
fornia, including the Daily Breeze. 


THE FOGHORN VERSION 


At this point I must take the reader down a 
curious bypath. Suddenly the final sentence of 
the hoax began appearing not as a 26,911-word 
cabbage order but as a 12,962-word regulation 
on manually operated foghorns. 

Though I never discovered the fountainhead 
of the cabbage story, the foghorn version was 
easy to trace. Its inventor was Ray H. Wilson, 
then editor of Business Action (later changed to 
Washington Report), a weekly publication of the 
Chamber of Commerce of the United States. 
This is how it happened: The cabbage item came 
to Wilson’s attention. He wanted to reprint it 
but decided to check its accuracy. Quickly he 
learned that the cabbage order did not exist. The 
story, however, was really too good to discard. 
Therefore, Wilson told me, he searched for some 
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other commodity that he could substitute for 
cabbage. He found foghorns, and on November 
10, 1951, Business Action published the follow- 
ing: 


297—Words in Ten Commandments 


266—Words in Lincoln’s Gettsyburg 
Address 


12,962—Words in OPS order establishing 
the ceiling price of manually-op- 
erated fog horns and other manu- 
factured items 


What Business Action omitted to say was that 
the “other manufactured items” were about 376 
in number and included practically every kind 
of machinery—for example, oil burners, X-ray 
apparatus, tractors, trucks, abrasive products, 
soot blowers, freight cars, well-drilling equip- 
ment, escalators, and rock crushers, “Fog horns 
and whistles, manually operated” was 1 of 24 
items of marine equipment. Wilson had got his 
hands on the Machinery Regulation, one of the 
most comprehensive of all OPS orders. 

The foghorn version caught on handily. Local 
chamber of commerce publications reprinted it— 
for example, the Grand Rapids Chamber of Com- 
merce News in its December issue under the 
heading “Wonders of Bureaucacy.” Daily news- 
papers reprinted it—for example, the Sacra- 
mento Union on December 4 under the heading 
“New Deal Verbosity” (in this paper the phrase 
“and other manufactured items” became “and 
related manufactured items”). On November 21, 
1951, members of Congress received a lengthy 
mimeographed newsletter, unidentified as to 
source and containing 12 items unfavorable to 
the Truman administration. One of the items 
said: 


* * * DISALLE AND WORDAGE: In The Ten 
Commandments there are 297 words; in Lincoln’s 
Gettysburg Address, there are 266 words. In an OPS 
order establishing a ceiling price on manually op- 
erated foghorns, etc., there are 12,962 words. 


This paragraph, in which Wilson’s phrase, “and 
other manufactured items,” had been reduced to 
an inconspicuous “etc.,” appeared forthwith in 
some of the newsletters which Congressmen write 
for publication in their home districts. One such 
newsletter was that of Representative Allan Oak- 
ley Hunter of Fresno, California, This somehow 
reached the desk of Steffan Andrews, a Washing- 
ton correspondent for the North American News- 
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paper Alliance, who quoted the foghorn item ing 
feature article that appeared in various parts of | 
the country (e.g., in the Fort Worth Star Tele | 
gram of December 8). Nie 
Soon thereafter a certain editorial began ap f 
pearing in small newspapers such as the Anaheim, | 
California, Bulletin of January 1, 1952, and the 
Pottsville, Pennsylvania, Republican of January | 
31. This canned editorial, as printed in the Potts 
ville paper, began: f 


drews of the North American Newspaper Alliance 
quoted a Congressman as saying: “In the ten com 
mandments there are 297 words, in Lincoln’s Gettys- 
burg address 266 words, but in an OPS order setting | 
ceiling prices on manually operated foghorns there | 
are 12,962 words.” s 


By now, as you see, the regulation had become 
simply “an OPS order setting ceiling prices ol | 
manually operated foghorns.” The editorial con- 
tinued: iil 


No one would expect the master-minds at OPS to 
produce anything within cannon shot of the ten 
commandments or the Gettysburg address. But the | 
OPS order on the vital subject of foghorn prices i 
a perfect example of how the bureaucrats wor 
once they get a taste of power. The red-tape gels | 
thicker and thicker. The orders become mote ant | 
more verbose—and, of course, more and more Gil 
cult to understand and follow. Some of the OPS of 
ders, in fact, have been masterpieces of utter con- l 
fusion. wA 
The people who make and sell foghorns undoubt- 
edly spent some sleepless nights trying to get Ai 
teeth into OPS’ 13,000 word order. And they el 
the only ones... . In a situation like this m 
producer and merchant becomes, willy-nilly, a fort 
breaker... . More and more of the time aa 
and money of manufacturer, retailer and all otl a 
involved must be given to trying to untangle 
red-tape and to following the orders as best 
can. That’s one way to hamstring an economy. 


The foghorn version soon spent itself. One a | 
my staff reported that Henry J. Taylor, the ra E 
commentator, then sponsored by General ve y 
broadcast the item on the evening of Mai 5 but 
as an illustration of government malpractice: ‘af 
that was the last time I heard of anyones sng 
it. The great cabbage hoax, however, kept 10, 
along. 


“Kirt THAT STORY” 


On January 11, 1952, the New York nii 
of Commerce published the cabbage yarn 5 ‘The 
editorial page and instantly realized its em the 
editors sent word that they would wee he 
facts. Almost by return mail I gavè a ver” 
progress of both the cabbage and foghor 


Jo 


' 


; 


CasE REPORT 


sions, and on January 16 the publisher, B. J. 
Ridder, put the information in a front-page col- 
umn headed “Kill That Story.” Feeling encour- 
aged, I had my letter reproduced in the form of 
a memorandum to all OPS offices for their con- 
venience in explaining things to interested per- 
sons. This resulted in more debunking. For ex- 
ample, the Youngstown Vindicator (January 23) 
and the St. Louis Post-Dispatch (February 9) 
published editorials based on the memorandum. 
When a grocer named Harold Stoner inserted the 
cabbage yarn in his advertisement in the Garden 
City, Kansas, Telegram, an OPS man hurried to 
the editor with my memorandum. The editor 
wrote a piece in which he gave the true facts, 
then added, 


And speaking of words, we took the time to count 
and found out that Mr. Hall himself used 1,008 
words to complain about the cabbage hoax in his 
memorandum, 


j The truth now seemed to be worming its way 
into print with increasing frequency—but not 
fast enough to catch up with the cabbage rumor. 


. A businessman in Louisville had the untrue state- 


ment printed on blotters, according to Bill Ladd 
in the Courier-Journal of February 2. Rilea W. 
Doe, a vice-president of Safeway Stores, Incor- 
porated, recited the cabbage item to the Rotary 
Club of San Bernardino, California, according to 
the San Bernardino Telegram. The Directors 
Digest of the United States Savings and Loan 
League printed the story in its March issue and 
credited it to the St. Paul Pioneer Press. On 
March 31, 1952, the Wall Street Journal printed 
it notwithstanding the fact that its competitor, 
the Journal of Commerce, had so recently dis- 
credited the story on its front page. In the Wall 
Street Journal the item appeared in a column 
called “Abreast of the Market,” edited by Oliver 
J. Gingold, who attributed the item to Edward 
T. Prosser, of Hicks and Price, a firm of stock- 
brokers in Chicago. Morgan Beatty, an NBC 
Commentator, saw it in the Wall Street Journal 
and repeated it on the radio the same night. I 
telephoned Beatty the next day, April 1. He was 
pete: That night he explained to his listen- 
ag that no cabbage order existed, and he chided 
le Wall Street Journal for printing the story 

Without checking its accuracy! 
pot apparently some people did not hear 
Eois ty, for on May 1, as already mentioned, an- 
ae NBC program entitled “Double or Noth- 
: pred an $80 prize in connection with 
Walter bage order, The master of ceremonies, 
Mire z O'Keefe, had scarcely signed off when 
: Leona Eviline Kidwell, of North Arlington, 
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Virginia, dialed OPS headquarters in Washing- 
ton. She had read somewhere that the cabbage 
order was a myth, and she was downright be- 
wildered. We got in touch with the OPS infor- 
mation officer in Los Angeles, where the program $ 
had originated. He asked O’Keefe to correct the 
mistake on a subsequent broadcast. O’Keefe re- 
ported later that his sponsor, the Campbell Soup 
Company, did not want any correction made. An 
OPS man then wrote to Ward Wheelock Com- 
pany, of Philadelphia, the sponsor’s advertising 
agency. The reply, dated May 27, 1952, and 
signed by Robert C. Alexander, began thus: 


We understand your position in the “cabbage story” 
and are very glad to help. However, we too have a 
position, We run a quiz program, and for the most 
part we have very few errors. When one does occur, 
we don’t want to publicize it. From experience we 
have found it bad business to try to publicize such 
mistakes, 


Mrs. Kidwell received a chatty letter signed by 
O'Keefe, praising her alertness and notifying her, 


I am sending you an assorted case of Campbell soup. 
[A year and a half later, she still had not received 
the soup.) 


Meanwhile the cabbage yarn continued to be 
popular with columnists, luncheon clubs, banquet 
speakers, and letter writers. The Washington 
Times-Herald on May 6, 1952, published a letter 
signed Ed. Purcell, 3d, of Harrisonburg, Virginia, 
who began: 

It is the general consensus of most people in the 
rural areas and communities of this nation that this 


Administration is more wasteful and corrupt than 
any that has ever held sway in our nation’s capital. 


He went on to prove it by giving the cabbage 
story, which he said was taken from his Rotary 
Club’s weekly, The Rotorgraph. A week or so 
later, Dr. Thurman Sensing, head of the Southern 
States Industrial Council, made the cabbage item 
a part of his address before a large banquet of 
the State-Wide Safety Conference in Richmond, 
Virginia. The rumor was given as fact in Walter 
Winchell’s column on August 1, 1952, credited 
to the Sales Executive Club of Pittsburgh. It 
was in Bennett Cerf’s “Trade Winds” column in 
the Saturday Review on August 2, credited to 
Edward T. Prosser of Hicks and Price, Chicago, 
the same man who had fed the hoax to the Wall 
Street Journal, It was a part of a column headed 
“Flotsam and Jetsam” in the August 8 issue of 
Tide, a magazine for the advertising profession, 
credited to the Sales Executive Club of Pitts- 
burgh, which had also planted the item in 
Winchell’s column. It showed up in the Dallas 
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Times Herald on September 8 under the head- 
ing, “Today’s Chuckle.” 

By this time Mike DiSalle had been succeeded 
as OPS director by Ellis Arnall, In September 
'1952 Arnall was succeeded by Tighe Woods, who 
promptly arranged a series of meetings with 
housewives. At the Cincinnati meeting on Sep- 
tember 23 a woman named Mrs. Russell Howard 
arose and said: 


I call your attention to the fact that there are 297 
words in our immortal Declaration of Independence 
but OPS has 26,911 words in an order regulating 
the price of cabbage. 


Ohio at that time was the scene of a political 
race between United States Senator John W. 
Bricker and Mike DiSalle for Bricker’s Senate 
seat. In a speech at Ohio Wesleyan University, 
Bricker ridiculed OPS for its 26,911-word cab- 
bage regulation. DiSalle offered to give $5,000 
to Bricker’s campaign fund if Bricker could pro- 
duce the regulation. Bricker dropped the subject. 


ON THROUGH THE YEARS 


Not long afterwards I left the Office of Price 
Stabilization. In January 1953 a friend of mine, 
a naval reserve officer, was attending a manage- 
ment training course at the Navy Department in 
Washington. He reported that the instructor cited 
the OPS cabbage order as an example of how 
not to do things. A month later another friend 
telephoned and referred me to the editorial page 
of the Saturday Evening Post of February 7, 
1953. This is what I saw: 


Soviet Official Wordage 
Outdistances Even OPS 


The Soviets, as they keep telling us, have invented 
everything worth inventing. . , . Not so long ago, 
some socialist-minded Washingtonians tried to give 
the Muscovites a run for their rubles, But the evi- 
dence shows that they really never had a chance. 

We cite as witnesses the Sales Executive Club, of 
Pittsburgh, and a comparatively recent Associated 
Press dispatch from Moscow. The Pittsburgh execu- 
tives, probably scribbling carelessly on their club’s 
tablecloth, offer a set of comparative statistics on 
verbosity. They find that the Lord’s Prayer is made 
up of sixty-six words, Lincoln’s Gettysburg Address 
of 264 words, the Ten Commandments of 322 words, 
and the United States Declaration of Independence 
of 1318 words. By comparison, these statisticians 
assert, the Office of Price Stabilization’s order set- 
aao price on a head of cabbage ran to 26,911 
words. 


The editorial then said a factory in Siberia had 
received from the Kharkov Institute of Fire 
Bricks a questionnaire consisting of 1,487,400 


Case REPORT 


questions—“sure to bring a blush of shame { 
the cheeks of our OPS.” y 
OPS went out of existence on April 30, 1953 
By that time there was no doubt in my mind 
the next price control agency, if America ey 
had another, would issue a 26,911-word cabba 
order too. Nor was I surprised when someont 
handed me a column by Erich Brandeis in tl 
Charlotte, North Carolina, News of June 1, 195 
in which William Teeling, Conservative member 
of the British Parliament, was quoted as saying 
that the Lord’s Prayer had 55 words, the Magna 
Charta, 650, and the American Declaration of 
Independence, 1,800, but an explanation of 
ain’s controls on the use of shell eggs req 
29,000 words. (I wrote to Brandeis asking for h 
source; he replied that it was the Associated 
Press.) 4 
Even so, I suppose that I rather expected th 
OPS cabbage order to be laid to rest with the 
agency, there to remain unless price controls 
came back in some future emergency. In this] 
was wrong. The document seemed to have a¢ 
quired an independent life. A friend sent me thit 
July 1953 issue of a small magazine called Cob 
ton Pickers, published at Atlanta by the “Auto 
motive Booster Club, B-6, Inc.” You can gues 
what was on page 17. About this time I wrolé 
an article about the cabbage hoax, and it api 
peared in The Reporter of March 16, 1954. i 
on December 13 of that year one of my old OPS 
associates sent me a clipping from the Washing 
ton Daily News published during the prec 
week. Therein, a letter-writer signing hi 
“Veteran Worker” defended the Eisenhower aM 
ministration by attacking Truman’s OPS 1 
retrospect. 


The whole setup covered our nation like 4 
damp, costly blanket [he recalled]. The tax Pi 
were the dupes in the whole deal. It was ama 
to me once when I was invited out to Laura oh 
old teahouse during DiSalle’s “reign of cash A 
votes,” to find on the table a little folder on 
back of which was the following. [Then he quo ; 
the cabbage story and concluded] That sized up ™ 
whole stupid agency. 


man from Virginia. The cabbage regulation 
called a “recent OPS order,” though the He 
had been dead nearly 7 years. On June 25, 
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Reporter. He did so because, he said, he had 

_ seen the cabbage item in print many times. In 
closing he predicted that the cabbage story would 
persist because it caters to the prejudice of those 
who want to believe 


that Washington is filled with stupid bumblers at 
best, and dangerous enemies of True One Hundred 
Per Cent Americanism at worst. 


In 1964 came the new flurry described at the be- 
ginning of this article. 


THE CHARACTERISTICS OF RUMOR š 


A familiar saying in journalism is that a de- 
nial never catches up with a false story. Some 
false rumors, especially those passed mainly by 
word of mouth, scarcely get denied at all. The 
cabbage case is unusual in that the truth achieved 
quite a lot of circulation. Active debunking prob- 
ably counteracted the effects of the fraud to 
some extent and forestalled an indeterminable 
number of printings. But it could not stop the 
flow of this harmless-looking little joke. The cab- 
bage case illustrates the extreme copiousness and 

l overlapping of our transmission system, making 
it almost impossible to catch and kill a false tale 
of propaganda value that has appeared as fact. 
It illustrates, too, how easy it is for writers and 
editors to forget to pick up the telephone and 
cy the accuracy of an item they would like to 
print. 

The cabbage story, though it probably traveled 
more by the printing press than by mouth, fits 
into the framework of rumor as analyzed by psy- 
chologists, Allport and Postman (1947) state the 
two basic conditions for rumor thus: 


Rumor travels when events have importance in the 
lives of individuals and when the news received 
about them is either lacking or subjectively am- 
biguous [p. 2]. 


Both conditions are clearly visible in the cab- 
age case; the ambiguity seems to have been in- 
pe by emotional tensions that kept people 
Tom questioning the truth of the item or made 
them not care whether it was true or not. But 
the case also shows that sheer entertainment 
Value has much to do with a rumor’s strength. 
Entertainment value, though not a basic condi- 
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tion for rumor, plays an important part in its 
perseverance. Some who printed the cabbage 
hoax were probably motivated only by their 
search for readable items, their stock in trade. 
And even the person whose distaste for price 
controls, bureaucrats, or Democrats was causing 
him to seize upon a story 


that would relieve, justify, and explain his underlying 
emotional tension [Allport & Postman, p. 36] 


could tell others, even tell himself, that he was 
only repeating a funny story. I assume that it 
was the combination of entertainment value and 
emotional satisfaction that gave the cabbage 
hoax its special durability. The cabbage item is 
a good example of what Allport and Postman 
call “hostility rumors that are wittily cast [p. 
193]”—which are similar to, but not quite the 
same as, tendentious jokes which are merely hu- 
morous and do not claim to state a verifiable 
fact. 

The cabbage and foghorn versions, not being 
spread primarily by word of mouth, did not un- 
dergo as much change in their travels as rumors 
generally do. But the progression of the fog- 
horn version from “manually operated fog horns 
and other manufactured items” to “manually 
operated fog horns, etc.,” and finally to “manu- 
ally operated fog horns” is a striking illustration 
of the “leveling and sharpening” discussed by 
Allport and Postman (pp. 75-98). The other 
items were leveled out and the foghorns sharp- 
ened. And the cabbage hoax, reappearing in each 
successive decade with a different target, illus- 
trates what has been called a “diving rumor” 
(Allport & Postman, pp. 171-172). First (ac- 
cording to undocumented information) it was 
aimed at OPA, Then it was aimed at OPS, and 
in my judgment was significantly harmful to the 
agency. And in 1964, 11 years after OPS had 
vanished, it ridiculed nothing more specific than 
the federal government. 
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MANIFEST ANXIETY, REACTION POTENTIAL CEILING, 
AND WORD ASSOCIATION * 


CRAIG WOLFF 
University of California, Los Angeles 


The present study investigated the relationship between Manifest Anxiety (MA) 
scale scores and word-association performance. Specifically, it was predicted 
from the Broen and Storms (1961) reaction potential ceiling hypothesis that 
medium-anxious Ss would yield a greater frequency of common or dominant 
responses to word stimuli which had a single highly probable response than 
either the high-anxious or the low-anxious Ss, This same effect was pre- 
dicted to be much weaker for word stimuli which had many approximately 
equally probable responses available. These predictions were confirmed by the 
data, and the results were interpreted as both providing some support for the 
ceiling hypothesis as well as for the drive measuring properties of the MA scale. 
The 2nd part of the experiment was concerned with experimentally inducing 
stress in Ss by telling them that the word-association test was a sensitive meas- 
ure of personality disorders. The results indicated that the stress manipulation 


was ineffective. 


The Manifest Anxiety (MA) scale (Taylor, 
1953, 1956) used as a measure of the Hullian 
construct of drive has been related to word- 
association performance in two recent experi- 
ments (Goldstein, 1961; Kanfer, 1960). Both of 
these experiments indicated that college sub- 
jects who scored high on the MA scale did not 
differ significantly from those scoring low on the 
MA scale in the number of dominant associative 
responses given to a list of stimulus words pre- 
sented one at a time. Both studies used high 
commonality stimuli (i.e., stimuli to which the 
dominant associative response has a consider- 
ably higher probability of association than any 
other), and low commonality stimuli (i.e., stim- 
uli to which the dominant response is relatively 
weak, with many competing responses). On the 
basis of Hull’s original drive hypothesis as ex- 
plicated by Taylor (1956), the prediction was 
made that high-anxious (HA) subjects would 
give a greater number of dominant responses 
than low-anxious (LA) subjects to the high 
commonality stimuli which represent a relatively 


1 This work was done as part of the Honors 
Research Program in psychology under the guidance 
of D. O. Sears and R. E. Whalen, and with the 
special support and advice of C, Y. Nakamura. 
Partial support was provided by the National 
Science Foundation’s Undergraduate Research Pro- 
gram, and by Research Grant M-4493 to Charles 
Y. Nakamura from the National Institute of Men- 
tal Health, United States Public Health Service. 
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noncompetitional situation, and fewer dominant 
associative responses to the low commonality 
stimuli which represent a situation in which | 
there is no strong dominant response and manyi 
competing response tendencies of approximately 
equal strength. Since the two investigators found | 
no significant differences in the number o 
dominant responses given to either the high com” 
monality or low commonality stimuli, the 1 
sults were not consistent with Hull’s hypothe | 
sis, or with a drive theory of anxiety. A 
Broen and Storms (1961) have presented a 
dence which indicates that the relationship ‘fe 
tween drive level and the probability of em | 
sion of a dominant response takes the form d 
an inverted U, with moderate drive levei E 
ing optimal probability. In trying to account 0 
this relationship by means of Hullian ia 
these investigators suggested the utility o | 
pothesizing a reaction potential (Æ) caan 
Hullian theory Æ =f (D X H) and the Tie ) 
bility of a response is a function of the a 
ences in E values between the dominant het 
sponse and competing responses, with hig s 
relative Æ resulting in greater response D 4 
bility. Due to the multiplicative relati 
between D and H, increases in drive will att 
give relatively more E to the high H dom! re 
response than to the lower H competing ay | 
sponses; thus, differences in Æ values beai 
the dominant and competing responses Wises 
crease as drive is increased. However, inch 
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in drive beyond the level sufficient to raise the 
dominant response to Æ ceiling can only act to 
strengthen the competing responses in the re- 
sponse hierarchy and reduce the difference in E 
values between the dominant and competing 
responses. 

The main purpose of the present study is to 
determine whether the finding of no significant 
difference in word-association performance be- 
tween HA and LA subjects is consistent with the 
predictions one would make from the Broen- 
Storms modification of Hull’s hypothesis. One 
would predict from the inverted U shaped “rela- 
tionship between drive level and the probability 
of emission of a dominant response that the HA 
and LA subjects must be on opposite sides of, 
as well as an equal number of drive units away 
from, the highest point on this curve or the 
point where the probability of occurrence of the 
dominant response is greatest in order for these 
subjects to give the same frequency of dominant 
responses (see Broen & Storms, 1961, p. 410, 
Figure 2). Furthermore, if this condition were 
in fact true, one would predict that a group of 
medium-anxious (MA) subjects lying between 
the HA and LA subjects on the drive dimension 
would give a greater frequency of dominant re- 
sponses than either the HA or LA subjects. 

In addition, since a decrement in a dominant 

response theoretically results when drive is in- 
creased beyond the point necessary to raise the 
dominant E value to E ceiling, one would pre- 
dict that the largest effect due to the decrement 
to be in the high commonality stimuli since 
their dominant associative responses having a 
relatively greater H should reach Æ ceiling at 
lower drive levels than for the much weaker 
responses of the low commonality stimuli. 
_ In reference to these various predictions it 
Must be emphasized that the high commonality 
words used did have some degree of competition 
in the hierarchies they elicit. With essentially 
zero or very little competition, there would be 
nothing to compete with the dominant response 
even after it reached ceiling (Broen, Storms, & 
Schenck, 1961). The present study attempted to 
test these various predictions. 

A final aspect of this study is to observe the 
effects of stress on word-association performance 
On the three levels of anxiety. Stress is opera- 
tionally defined here as informing the subject 
that the word-association test is a sensitive 
measure of personality disorders. Three studies 
oo 1961; Saltz, 1961; Sarason, 1959) have 
penn that the effect of stress manipulations 
ae to reduce the number of dominant responses 

t common associations given to verbal stimuli. 
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METHOD 
Subjects 


Subjects were 57 male students enrolled in an 
introductory psychology course at the University of 
California, Los Angeles. be 


Apparatus 


Tape-recorded stimulus words were presented to 
an isolated subject, and both verbal responses and 
response latencies were recorded.* 


Stimulus Materials 


Two lists, each consisting of 22 stimulus words 
were selected from the Russell and Jenkins (1954) 
tables of word-association probabilities. Responses 
to the first 2 words were not analyzed. The remain- 
ing stimuli consisted of 10 words for which the 
most common response had a high probability of 
occurrence (high commonality stimuli, .61<~< 
84) and 10 words for which the most common re- 
sponse had a low probability of occurrence (low 
commonality stimuli, .24 < p < 38). These stimuli 
were placed in a random order in the list. They 
were of approximately the same number of sylla- 
bles. The total frequency of occurrence of the most 
common (i.e, dominant) response to the 10 high 
commonality stimuli in the first list was approxi- 
mately equal to the total frequency for the 10 high 
commonality stimuli in the second list. The same 
was true for the 10 low commonality stimuli in each 
list, 


Procedure 


The MA scale was group administered in a first 
session. At the second appointment an individual 
subject was told that he was to hear a list of words, 
one by one, and that he was to say the first word 
that came to him; these were the “nonstress” in- 
structions, 

All subjects were then given the first list of words. 
The order of presentation of Lists 1 and 2 was 
counterbalanced. 

Twelve HA subjects scoring 19 or above on the 
MA scale, 12 MA subjects scoring between 13 and 
18, and 12 LA subjects scoring 12 or below were 
assigned to the stress condition and were given List 
1 under the nonstress condition, and List 2 under 
the stress condition. 

Seven other subjects in each of the three anxiety 
groups were assigned to the nonstress (control) 
group and were given both List 1 and List 2 under 
nonstress conditions. 


Stress Group 

After giving verbal associations to the stimuli in 
List 1 in the nonstress condition, the experimenter 
entered the experimental room and told the subject 
that he had just given associations to a practice list 
of words and that the next list was chosen by 
clinical psychologists on the basis of its ability to 
sensitively measure problems of personality adjust- 


Sense 
2 Response latencies were not analyzed, 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF THE 
COMMONALITY SCORES BY ANXIETY AND 
BY COMMONALITY LEVEL OF THE 
StimuLus Worps 


Anxiety 


Frequency High Medium Low 


M SD M SD M SD 


HC 403 | 108 | 483 | 149 | 394 | 151 
LC 121 14 | 136 15 | 135 18 


Note,—N = 19 in each anxiety group. 


ment. The subject was also told that if his associa- 
tions indicated that he had personality problems, 
these problems would be discussed with him at the 
end of the experiment. 


Nonstress Group 


Each subject in the control group was given List 
1 under exactly the same conditions as the stress 
group. The experimenter then told the subject that 
he had given associations to a list of words, asked 
him if the words were clear, and if everything were 
all right. 

At the completion of the experiment each subject 
was given a rating scale on which he checked how 
relaxed he was during his associations to List 1 and 
during List 2. 


REsULTS 


Each subject was assigned a commonality fre- 
quency score for each verbal response according 
to the frequency of occurrence of that response 
in the Russell and Jenkins norms. 

Table 1 presents the means and standard devi- 
ations of the commonality frequency scores of 
the high commonality (HC) stimuli and the low 
commonality (LC) stimuli of List 1 for the three 
levels of anxiety. Inspection of this table reveals 
that MA subjects gave a greater frequency of 
common (dominant) responses to the HC stim- 
uli (M = 483) than either the HA (M = 403) 
or the LA (M = 394) subjects, An analysis of 
a specific comparison between the HA subjects 
and LA subjects on the frequency scores of the 
HC stimuli was found to be nonsignificant. A 
further specific comparison of the MA versus 
the average of the HA and LA subjects was 
found to be significant (F=4.71, df= 1/54, 
b <.05); dominant responses were given more 
frequently by the MA subjects than by HA and 
LA subjects, 

For the LC stimuli Specific comparisons of 
HA versus LA (F = .75, df = 1/54) as well as 
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MA versus average of HA and LA (F=.29, df 
= 1/54) were found to be nonsignificant. 
Analysis of variance revealed no significant 
differences between the stress and nonstress 
groups at any of the three levels of anxiety. 


Discussion 


The MA subjects gave a higher frequency of 
dominant associative responses to a set of HC 
verbal stimuli than did either the LA or the HA 
subjects, This finding is congruent with the 
predjctions based on the Broen-Storms reaction 
potential ceiling hypothesis that if the HA and 
LA subjects did not differ in the number of 
dominant associative responses given to a set of 
HC verbal stimuli then the MA subjects would 


give a greater number of dominant responses ° 


than either the HA or the LA subjects. 

The finding that the HA and LA subjects did 
not differ in word-association performance is 
consistent with the findings of Kanfer (1960) 
and Goldstein (1961). 

As Jessor and Hammond (1957) have pointed 
out, the clinically derived MA scale may not 
have construct validity with respect to the 
Hullian construct of drive. The data of, the 
present experiment provide some evidence for 
a relationship between the MA scale and the 
Hullian construct of drive. Although the MA 
scale has not been shown to be related to the 
Hullian construct of drive consistently, if future 
research will be consistent with the present find- 
ings, one will be able to place increasing cre- 
dence in both the ceiling effect and in the 
effectiveness of the MA scale as a measure of 
the Hullian construct of drive. 

Finally, since the results indicated that the 
stress manipulation was ineffective, it is sug- 
gested that future work of this type incorporate 
stronger or more operationally clear stress 
manipulations, 
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EFFECTS OF APPARENT PAIN IN OTHERS ON OBSERVER’S 
REACTION TIME* 


VINCENT Dı LOLLO? anp SEYMOUR M. BERGER 
Indiana University 


The reaction time (RT) of 4 groups of 25 Ss each was measured under the fol- 
lowing conditions: Group SM (shock-movement) was told that an electric 
shock was being delivered to another person (performer) who responded with 
jerky arm movements; Group SNM (shock-no movement) received the same 

. instructions but the performer made no arm movement; Groups NSM (no 
shock-movement) and NSNM (no shock-no movement) received the same 
treatment as Groups SM and SNM, respectively, with the omission of the 
shock instructions. The RT of Group SM was significantly faster than the other 
3 groups which performed at approximately the same level. The results are in- 
terpreted as supporting the notion that exposure to emotional manifestations 
of other people has activating effects on the observer’s RT. 


The behavior of an observer has been shown 
to be effected by exposure to emotional mani- 
festations of others (Allport, 1924; Bandura, 
1962; Berger, 1962; Church, 1959). In these 
studies, changes in the observer’s responses have 
„been viewed as resulting at least in part from 
a heightening of the arousal component of 
motivation in the observer. In a series of studies 
in which an observer witnessed the delivery of 
an apparent shock to another person, Berger 
inferred the occurrence of motivational changes 
from variations in the observer’s GSR at the 
time of the alleged delivery of the shock. It is 
the purpose of the present study to investigate 
further the motivational aspects of such situa- 
tions as affecting a behavioral, rather than a 


* This research was supported in part by a Faculty 
Research Grant from the Graduate School, Indiana 
University, to the first author, and by Grant MH 
07528-01 from the National Institute of Mental 
Health, United States Public Health Service, to the 
second author, We are grateful to Lorna M. Di- 
Lollo for her careful collection of the data. 

2 Now at the University of Western Australia, 


physiological response. A response suitable for 
this purpose is simple reaction time (RT) which 
has been shown to increase in speed following 
increments in the level of motivation (Johanson, 
1922). It may be expected that, under the 
experimental conditions described by Berger, 
exposure of an observer to apparent shock de- 
livered to another person should result in an 
increment of the observer’s level of arousal and, 
in turn, to an increment in the speed of RT. 


METHOD 
Apparatus 


An inductorium was mounted vertically on a 
wooden box and a voltmeter dial was mounted on 
the same box directly above the inductorium. The 
box was then clamped to one end of a small table 
and was illuminated by a 40-watt desk lamp which 
was the only source of illumination in the room. 
A button switch was mounted on the other end 
of the table, and was connected with a Standard 
Electric .0l-second timer located tn a recording 
room adjoining the experimental room. Whenever 
the experimenter started the clock, the light dimmed 
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REACTION TIME (SEC 
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TRIALS 
Fic. 1. Mean RT for each group. 


in the experimental room and the needle on the 
voltmeter deflected across the dial. The timer could 
be stopped by pressing the button switch in the 
experimental room. 


Performer 


The subjects were tested in pairs but one member 
of the pair was always the experimenter’s research 
assistant (a female undergraduate, hereafter referred 
to as the “performer”) who always acted according 
to a prearranged plan. 


Subjects and Procedure 


The subjects were 100 female students attending 
the introductory psychology course at Indiana Uni- 
versity, and were randomly allocated to four groups 
of 25 subjects each. Each group received one of the 
following conditions: 

Shock-movement (SM). Each subject and the 
performer (who appeared to be just another sub- 
ject) sat in a waiting room until ushered to the 
experimental room. The two were then told that 
the study investigated the effects of previous experi- 
ence with electric shock on RT, and one of them 
had to be shocked (the performer volunteered). The 
subject was then reassured that she would not be 
shocked at all, and would be used only as a control. 
The performer’s hand was strapped to the induc- 
torium, and the experimenter announced that on 
each delivery of the shock the needle on the dial 
above the inductorium would deflect all the way, 
and then return to O after 1 second (the duration 


of the shock). The shock was, in fact, never de- 
livered but every time the needle moved the per- 
former jerked her arm as if in pain. The subject 
sat near the button switch with her hand on a 
starting position 18 inches from the button. Her 
task was to press the button as fast as possible 
every time the needle moved. Since the movement 
of the needle corresponded with the starting of the 
clock which was stopped by pressing the button, 
a measure of the subject’s RT was actually taken. 
In the instructions it was made abundantly clear 
to the subject that: the button was entirely un-- 
related to the delivery or cessation of the shock, 
and that she would not receive any electric shock 
at any time. Twelve RT trials were then given, 
with an intertrial interval varying between 8 and 16 
seconds with a mean of 12 seconds. 

At the end of the session the subject was asked 
whether she expected to receive an electric shock 
at any time during the experiment, and whether 
she believed that the other person was, in fact, 
being shocked. 

Shock-no movement (SNM). This was the same 
as the SM condition with the exception that the 
performer kept her arm entirely still. 

No shock-movement (NSM). The subject was 
told that the study investigated RT following prac- 
tice with making arm movements. The performer 
clutched the wooden base of the inductorium and 
shook her arm as in the SM condition every time 
the needle moved. In all other respects this group 
received the same treatment as Group SM with the 
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exception that the subjects were not asked the 
final two questions. 

No shock-no movement (NSNM). The subject 
was told that the study investigated RT and that 
subjects were being tested in pairs to save time 
with reading instructions. The performer was asked 
to sit in the chair near the inductorium and just 
wait her turn. In all other respects this group 
received the same treatment as group NSM. 


RESULTS 


Mean RTs for each group on each trial are 
presented in Figure 1, An analysis of variance 
was performed on the individual scores over the 
last 10 trials, and the results are summarized in 
Table 1. The last column of this table shows 
the F ratios obtained in the between-groups 
¿analysis of variance performed on the slopes 
of the linear component of each subject’s RT 
during the last 10 trials. The significant effects 
of shock (S) and movement (M) on level of 
performance (Table 1) lose their relevance 
when compared with the significant SX M 
interaction. Taken together with the informa- 
tion supplied in Figure 1, this interaction indi- 
cates that a higher speed of RT was found only 
when the alleged delivery of the shock was 
accompanied by the arm movement. 

In answer to the two questions, no subject 
reported that she expected to receive an electric 
shock; all the subjects in Group SM believed 
that the performer was being shocked, while 23 
of the 25 subjects in Group SNM believed that 
the performer received only a mild shock or 
was not shocked at all. 


DISCUSSION 


The results lend support to the notion that 
exposure to apparent shock delivered to an- 
other person may have activating effects on the 
observer’s response, Furthermore, the signifi- 
cant SX M interaction may be viewed as indi- 
cating that the effects of performer-shock on 
the subject’s RT are manifest only in the pres- 
ence of some corroborative evidence that the 
shock is, in fact, being delivered. It should be 
noted that the present RT measure differs from 
the GSR measures obtained under similar con- 
ditions; the GSR can be attributed to a particu- 
lar aspect of the situation while the RT measure 
cannot. 

_ There are at least three possible interpreta- 
tions of the factors underlying the increments in 
. the subject’s arousal: increments in arousal due 
to the subject’s indignation at the socially dis- 
approved practice of hurting other people; incre- 
ments in arousal due to fear of being shocked; 
. *ìncrements in arousal through vicarious instiga- 
tion (VI) which was defined by Berger (1962) 


SS 


TABLE 1 


F RATIOS FOR LEVELS AND SLOPES OF RT OVER 
THE Last 10 TRIALS 


Source af Level Slope 
M 1 — 1.75 
Shock (S) 1 19.54* 61 
Movement (M) 1 12.08* 1.76 
SXM 1 11.68* 257 
Error MS 96 (2647) (13148) 
*p <.001. 


as the occurrence of an emotional response in 
an observer as the result of his exposure to the 
emotional manifestations of another person. The 
individual relevance of these possibilities cannot 
be evaluated in the present study. However, it 
seems likely that if either the first or the second 
alternative were true, the increments in arousal 
should occur as a result of the instructions; 
unless confirmation of delivery of the shock is 
held to be necessary, such increments in arousal 
should result in faster RT on the very first 
trial, which was patently not the case. The 
second possibility is further infirmed by the 
subjects’ reports that they did not expect to 
be shocked during the experiment. The incre- 
ment in the performance of Group SM after the 
first trial is, however, in line with the VI con- 
cept which predicts such increments to occur 
only following the subject’s exposure to the 
performer’s manifestation of pain. 

When viewed in terms of Berger’s (1962) 
paradigm, the present results suggest that VI 
manifests itself not only in emotional arousal 
(Berger, 1962) but also in behavioral changes. 
Such changes may be described as an example 
of a type of social facilitation which is of 
relevance to an analysis of the motivational 
aspects of social learning phenomena. 
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SEX DESIGNATIONS OF RIGHT 


AND LEFT BODY SIDES AND 


ASSUMPTIONS ABOUT MALE-FEMALE SUPERIORITY * 


SEYMOUR 


FISHER 


Upstate Medical Center, State University of New York, Syracuse 


The hypothesis was evaluated that the greater the degree to which the right 
side of one’s body is identified with one’s own sex and the left side with the 
opposite sex the more likely is one’s own sex to be seen as relatively superior 
to the opposite sex in power relationships. A semiprojective puppet technique 
was used to determine the degree of masculinity-femininity associated with 


each body side. Attitudes with regard to 


power relationships between the sexes 


were measured by asking Ss to judge whethor the male or female in a series of 


paired pictures was apparently the more 
males and 52 females, Consistent support 


There is considerable literature which specu- 
lates about the symbolic meaning of right and 
left and more specifically about the significance 
of the right and left sides of one’s body. It is 
typically suggested that the right side of one’s 
body is perceived as stronger and “better” than 
the left, Sometimes the difference between the 
two sides is depicted in terms of the right being 
associated with masculinity and the left with 
femininity. These speculations are variously de- 
rived from observations of schizophrenics and 
patients with organic brain pathology who de- 
velop delusions in which their right and left 
sides have different identities (Fenichel, 1945; 
Schilder, 1935); from folklore and myths 
(Hertz, 1960) in which right and left are given 
dramatic symbolic meaning; and from semantic 
analyses of words denoting right and left in 
different languages (Blau, 1946), Empirical sup- 
port for such speculations has been fragmentary. 
It is true that experimental evidence exists that 
individuals vary considerably in the degree to 
which they are aware of a differentiation be- 
tween their right and left body sides (Fisher, 
1958, 1960). It is also true that clear-cut right- 
left gradients of GSR reactivity have been 
measured which are significantly linked with 
such variables as degree of personal disorganiza- 
tion (Fisher & Cleveland, 1959), body-image 
disturbance (Fisher & Abercrombie, 1958), and 
qualities portrayed in figure-drawing produc- 
tions (Fisher, 1959). One must add that there 
are a few studies which indicate that the right- 
left directionality ascribed to moving stimuli 
may reflect trait characteristics (Fisher, 1961; 
Thurstone, 1952). However, in general it must 
be accepted that our empirical knowledge about 
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dominant of the 2. Ss consisted of 50 
for the hypothesis was obtained. 


the nature of the evaluative distinctions we 
make between the right and left sides of our 
bodies is quite limited. 

The present study is concerned with the rela- 
tive sex-role characteristics which the individual 
ascribes to the right and left sides of his body. 
It is assumed that persons do differ in the degree 
to which they attribute male and female quali- 
ties to their two body sides. Further, it is 
assumed, in keeping with the usual speculative 
formulation, that the right side of the body is 
perceived as stronger and more powerful than 
the left. In this context the specific hypothesis 
was formulated that the greater the degree to 
which an individual ascribes his own sex charac- 
teristics to his right side and those of the 
opposite sex to his left side the more likely he 
is to perceive his own sex as relatively superior 
to the opposite sex. That is, if one identifies the 
presumed “strong” side of one’s body with one’s 
own sex and the “weak” side with the opposite 
sex, this may be taken as an indication that a 
corresponding differentiation is made in th 
power status of the two sexes. { 

It should be emphasized at this point that the 
hypothesis is intended at a more general level 
to provide an element of support for a view 
expressed elsewhere (Fisher & Cleveland, 1958) 
that the value one assigns to a given body region 
reflects a projection upon the body scheme of 
a related attitude which has been adopted in 
relation to an issue or object which is of special 
significance in one’s environs. In this instance, 
the perception of a particular power difference 
between the sexes is considered to be reflected 
in the assignment of differential sex attributes to 


the right and left body sides. Initial studies © 


have already shown promise in demonstrating 
correlations between the individual’s values and 


I 


y 
P; 


attitudes and the patterning of his body scheme. , 
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Thus, it has been observed that the qualities one 
assigns to one’s body-boundary regions are 
linked with intensity of achievement motivation 
(Fisher, 1963). Also, evidence exists that an 
individual’s perception of his own height may 
be influenced by the degree to which he feels 
inferior or depreciated (Popper, 1958). Even 
further, it has been demonstrated that the size 
ascribed by an individual to his arm may be a 
function of whether it is passive or pointing at 
a meaningful target (McFarland, Wapner, & 
Werner, 1960). 


PROCEDURE R 


The degree to which the subject links mascu- 
linity or femininity with each of his body sides 
was measured by means of a previously used semi- 

= projective technique (Fisher, 1960). He was asked 
to place one narrow paper cylinder on the middle 
finger of his right hand and an identical cylinder 
on the corresponding finger of his left hand. He 
was told to imagine that each cylinder represented 
a puppet. He was to assign a name to each and 
compose a brief story in which the two characters 
he had named interacted in some manner. Four 
sets of characters and accompanying stories were 
first obtained spontaneously and without directive 
instructions. Then, one story was secured with the 
instwiction to name and imagine an interaction be- 
tween two characters “from some culture other than 
the United States.” Finally, a story was obtained 
with the instruction to imagine “two people who 
are involved in a very dramatic situation.” The 
special instructions given for the last two stories 
were adopted because preliminary trials indicated 
that it was difficult to motivate most subjects to 
create more than three or four stories completely 
spontaneously. 

Scoring of the subject’s productions was con- 
fined entirely to the sexes of the names he assigned 
to the total of 12 story characters he created. The 
masculine or feminine designation of the name 
applied to a body side was the basic scoring unit. 
For male subjects a score was tabulated which 
equaled the number of instances in which simul- 
taneously a male name was applied to the right 
hand and a female name to the left. The analogous 
Score derived for the female subjects was equal to 
the frequency with which there was simultaneously 
a female name applied to the right hand and a 
male name to the left. This scoring procedure was 
intended to reflect the specific fact of having as- 
signed one’s own sex to the right or “strong” side 
and the opposite sex to the left or “weak” side. 
When both right and left were simultaneously 
labeled as being of the same sex, the names were 
not included in the score total. This was also true 
for response patterns in which the right was identi- 
fied with the opposite sex and the left with one’s 
Own sex. It should be parenthetically noted that 
subjects often expressed overt conflict and indecision 

. + about whether to choose a male or female name 
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for a given body side. Some would debate for as 
long as several minutes before reaching a decision. 
The stories which were related by the subjects about 
each pair of puppets were not analyzed. These 
stories were obtained simply to insure that the 
subjects would experience some degree of ego inf 
volvement and not see the task as one merely 
requiring the assignment of names to right and left. 

Of course, underlying the entire procedure was 
the assumption that each puppet took on special 
meaning for the subject in terms of its right versus 
left placement. The tenability of this assumption 
will be shown at a later point in terms of the 
fact that any but a right-left perspective on the 
data fails to explain the results obtained. 

The hypothesis under consideration required a 
measure of the subject’s concept of the relative 
power status of the male versus the female. The 
technique adopted to secure this measure was based 
on judgments of male and female pictures; and 
may generally be compared to analogous procedures 
involving ratings of faces which have variously 
been used to tap prejudice (Sherif, 1935) and emo- 
tional states (Murray, 1933). Each subject was 
presented with a series of 40 paired pictures of 
faces (front view). Twenty of the pictures involved 
male-female pairs. Ten involved male-male and 10 
female-female pairs. The subjects were given the 
following instructions: 


You will be looking at a number of pairs of 
pictures. Each pair is presented on a card that 
is numbered. Also, each picture is identified by a 
letter. Study each pair and try to imagine from 
the pictures what sort of person each of the 
pictured persons is. Then, imagine that they were 
to meet each other and set up a personal rela- 
tionship. If this happened, which of each pair 
would probably turn out to be the dominant one, 
the one who took leadership? 

On the sheet of paper you have indicate next 
to each number the letter of the person you think 
would be the dominant one. Consider this to be 
a guessing game. Don’t worry about whether you 
are right or wrong, 


It was assumed that the greater the degree to 
which a subject designated one sex as likely to be 
dominant over the other sex in his judgments of the 
20 pairs involving a male and a female figure the 
more he was indicating his assumption of relative 
superiority for that sex. 

The pictures used were taken from a college year- 
book in which poses and photography were standard 
for all. Male and female pairs were randomly com- 
posed from a pool of pictures, but a few were 
eliminated in which one of the pair seemed obvi- 
ously to have a more aggressive expression than 
the other. In half of the pictured pairs the male 
was on the right and in half the male was on the 
left. The male-male and female-female pairs were 
introduced into the series in order to conceal from 
the subject that his judgments were to be analyzed 
with relation to male-female differences. The 40 
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TABLE 1 


CHI-SQUARE ANALYSIS OF RELATIONSHIP OF RIGHT- 
Lerr PUPPET SEX SCORES TO JUDGMENTS 
OF MALE-FEMALE SUPERIORITY 


Males Females 
Number 
ste Male puppet right Female puppet right 
ae and female left and male left 
superior 
He L xe H È x? 
H 20 8 | 3.5* 9 17 | 4.7% 
L 10 12 18 10 


aH = above median, L = at median or below. 
*p <.05, one-tailed test. 
* p < ,02, one-tailed test. 


pictures were presented in a randomized order which 
was the same for all subjects. 


Subjects 


The subjects consisted of 50 males and 52 female 
college students who were recruited by payment 
of a fee. The median age in both groups was 20. 
To control for handedness all subjects were right- 
handed. In order to qualify as right-handed, a 
subject had to indicate that he wrote and ate with 
his right hand and also considered it to be stronger 
than the left. All the procedures were administered 
on an individual basis. 


RESULTS 


In the male group the median number of 
instances in which the right-hand puppet was 
designated male at the same time that the left 
was designated female was O (range: 0-4), In 
the female group the median for labeling of 
right as female and left simultaneously as male 
was 1 (range: 0-5). 

As for the picture superiority judgments, it 
was found that the median tendency in the male 
group was to choose 11 males as superior to 
females (range: 5-19), The corresponding 
eae in the female group was 12 (range: 
7-17). 

As is apparent in Table 1, the results are 
supportive of the hypothesis in both sex groups. 
Thus, male subjects with above median fre- 
quency of puppet designation patterns in which 
the male is on the right and the female on the 
left exceed those at or below the median in this 
respect with regard to the frequency with which 
they judge males to be superior to females 
(x? =3.5, p <.05, one-tailed test), Relatedly, 
one finds that female subjects who are above 
the median in labeling the right hand as female 
and the left as male surpass those at the median 
or below in this respect in ascribing relative 


superiority to female pictures (x? = 4.7, p< 
.02, one-tailed test). 

Since there were also puppet designations in 
which simultaneously the right puppet was 
identified with the opposite sex and the left 
puppet with the subject’s own sex, the question 
does arise as to how the results would be influ- 
enced if such designations were balanced 
against those in which the right puppet is identi- 
fied with the subject’s own sex and the left with 
the opposite sex. It could be argued, for example, 
that if a male subject gives two stories in which 
the right puppet is male and the left female 
he should not be classified as perceiving his 
right side as masculine if he simultaneously 
gives as many or more stories in which the 
obverse sex designations of the right and left 


puppets occur, An analysis was therefore under-“ 


taken of all cases in which both types of right- 
left puppet response patterns were present. This 
involved 44 males and 51 females. In each sex 
group, whenever the number of responses depict- 
ing the same sex on the left and the opposite 
sex on the right equaled or exceeded the num- 
ber in which the same sex was on the right 
and the opposite sex on the left, a score was 
derived by subtracting the first mentioned from 
the second. If the first was less than the sevond, 
no change was made in the second. These 
derived scores were related to the picture judg- 
ments by means of chi-square. It was found 
that in the male group those who were above 
the median in labeling the right puppet as male 
and the left as female exceeded those who were 
at or below the median in this respect in 
choosing males as superior to females (x? = 3.0, 
~<.05, one-tailed test). However, in the 
female group those who were above the median 
in portraying the right puppet as female and 
the left as male were significantly more inclined 
than those at or below the median in this respect 
to perceive females as superior to males (x? 
=2.7, ~<.05, one-tailed test). Thus, al- 
though the hypothesis under consideration was 
phrased specifically in terms of a count of 
puppet response patterns in which one’s own sex 
is on the right and the opposite sex on the left, 
it is generally supported by the more complex 
scoring procedure just described. 

It seemed important to establish that the over- 
all results were unique to the matter of right- 
left differentiation and could not be duplicated 
merely by counting the gross frequency with 
which male and female names were applied to 
puppets. Therefore, in each sex group the total 
number of male puppet names was related to 
the picture judgment scores; and the same was 


om 
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done for the total number of female puppet 
names. In all four of these comparisons the 


` chi-square values proved to be of a chance order. 


Discussion 


The results provide considerable support for 
the hypothesis. It is apparent that the specific 
equation of right-side puppets with one’s own 
sex and left-side puppets with the opposite sex 
is indicative of a trend to place one’s sex as 
relatively high in the male-female power rela- 
tionship. It has been assumed that the sex roles 
applied to the puppets reflect attitudes that the 


. individual has developed with regard to the right 


and left sides of his body. While there has been 
no direct demonstration of such a relationship, 
the fact of its probability has been enhanced 
by showing that only a data analysis based on a 
right-left differentiation of response provides 
support for the hypothesis. It could, of course, 
be argued that similar results might have been 
obtained by the use of puppets in right-left posi- 
tions which were not attached to one’s body in 
any fashion. That is, the mere matter of right- 
left differentiation rather than right-left dif- 


` ferentiation of one’s body areas might be viewed 


as thg prime parameter. However, while it 
would be interesting to repeat the present study 
with puppets in right-left positions that were 
not attached to the body, it would not in any 
way invalidate the present data interpretations 
if a similar pattern of results appeared. One 
must consider that right and left have meaning 
only with reference to one’s body as an axis or 
marker. Presumably, the same or similar right- 
left differentiations in the body scheme would 
be tapped in making right-left judgments about 
objects “out there” as upon one’s own body. 
Indeed, Gerstmann (1942), Benton (1959), and 
Critchley (1953) have called attention to the 
intimate connection between the ability to dis- 
criminate the right-left sectors of one’s body and 
to perceive accurately the right-left dimensions 
of non-self-objects, 

‘ At a general level, the results provide another 
Increment of support for the view that the body 
Scheme is an expression in body terms of atti- 
tudes about significant issues and objects in 
one’s environs. This view suggests that one’s 
body is unique as a perceptual object because 
it is part of the perceiver. As such, it is closer 
to the perceiver and likely to elicit a higher 
level of ego involvement from him than most 
other perceptual objects, In this unique position 
it easily becomes, as do other highly ego- 
involving objects, a target upon which the indi- 


_ Vidual projects feelings, values, and attitudes. 
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Thus, one body sector may be assigned values 
related to attitudes about incorporation. Another 
sector may be linked with feelings related to 
the expression of anger. While in the present 
instance, values have apparently been assigned 
to the right and left sides as a function of 
assumptions about male-female power relation- 
ships. Of course, this model regarding the body 
scheme is speculative, but it offers abundant 
opportunity for the formulation of testable 
hypotheses. 
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“INTERPERSONAL” EXPERIMENTS, WITH A COMPUTER MODEL 
OF PERSONALITY * 


JOHN C. LOEHLIN? 
University of Nebraska 


2 copies of a computer model of personality were placed simultaneously in a 
computer and allowed to interact, the responses of one serving as the stimuli 
for the other, and vice versa. In several experiments, the initial attitudes of 
the models, personality traits, roles, and situational variables were manipu- 
lated, and the resulting patterns of interaction observed. The output sequences 
bore at least some resemblance to human interaction patterns, being complex, 
phasic, and sensitive to some of the kinds of variables that affect human 


interaction. 


The possibility of using digital computers to 
simulate complex psychological processes is by 
now well known to psychologists (e.g., Hovland, 
1960; Newell, Shaw, & Simon, 1958; Taylor, 
1960; Uhr, 1963), but it is still far from clear 
just how much and in what ways this model- 
building method will contribute to theorizing in 
the area of personality and social psychology 
(Tomkins & Messick, 1963). A potential merit 
of representing psychological processes by com- 
puter programs is that (at least in principle) 
such programs, once written, can be combined 
in various ways as subunits of larger programs, 
with possibly interesting emergent properties. 
Thus, for example, various specific models of 
cognitive, motivational, and learning processes 
might be combined into a general behavior model, 
or two or more models of individuals combined 
to form groups. The present paper reports some 
experiments of the latter variety in which two 
copies of a computer model of personality were 
placed together in a computer and allowed to 
interact with each other. 


1 Portions of the material in this paper were pre- 
sented at a symposium at the APA meetings in St. 
Louis in 1962. I wish to express my gratitude to the 
University of Nebraska Computer Center for mak- 
ing available the computer time required for these 
experiments, 

2 Now at the University of Texas, 


The personality model employed for these ex- 
periments has been described elsewhere (Loehlin, 
1962, 1963). Briefly, the model—named Aldous— 
operates as follows: Stimulus situations (coded 
as 7-digit numbers) are presented to it, and vari- 
ous subsystems within the model mediate its re- 
sponse, There are three main subsystems: “recog- 
nition,” “emotional reaction,” and “action prepa- 
ration,” plus immediate and permanent memories. 
In these various subsystems the input is recog- 
nized, an emotional reaction is developed to it 
(based on previous experience with similar situa- 
tions, plus the model’s current emotional state), 
and an appropriate action is selected and exe- 
cuted—that is, printed out by the computer. 
The model has three emotions: one positive, 
corresponding to love, desire, or attraction, and 
two negative, corresponding to anger and fear. 
Associated with these are a positive action, ap- 
proach, and two negative actions, attack and 
withdrawal. The model may also show indiffer- 
ence or conflict. Depending on the action taken 
in a given situation, the model may or may not 
receive further consequences—if so, these will 
modify its memory of the situation, and on sub- 


sequent encounters with this or similar situa-. 


tions its emotional reaction—and hence its be- 
havior—may be different. 

One may speak of the “personality” of such a 
model in two senses. First in terms of attitudes: 


ra 
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as a result of the particular set of experiences it 
|. has had, a given model has a particular set of 
attitudes toward the situations it has encoun- 
tered, as well as more general attitudes toward 
+ classes of objects or situations, and broad, per- 
vasive attitudes toward its environment as a 
whole. Second, one may speak of personality in 
terms of traits. In the subsystems within the 
model are several constants which govern vari- 
ous features of its behavior: the number of 
errors made in recognition, the degree to which 
specific or general features of the stimulus situa- 
tion are reacted to, the vigor with which emotion 
- is translated into action, the rapidity of memory 
change, and so on, By setting these constants at 
various values, we can have models with differ- 
sent personality patterns: a hesitant Aldous, a de- 
cisive Aldous, an impulsive Aldous, an intellec- 
tualizing Aldous, and so forth. 

While this description of the model has been 
largely in psychological rather than computer 
language, it should of course be kept in mind 
that each of the psychological terms corresponds 
to some computational or logical feature of the 

/ . computer program—usually quite a simple one. 
Each of Aldous’ emotions, for example, is repre- 
sented by a single number which may vary in 
7s magnitude from 0 to 9; the justification for re- 
ferring to one of these numbers as “fear” and 
another as “anger” lies solely in the nature of 
their relationship to other events in the system— 
“fear,” for instance, is linked more closely to 
variables like “injury” and “withdrawal,” and 
“anger,” to variables like “frustration” and “at- 
tack.” Likewise, the change from a “hesitant” to 
a “decisive” personality is again the change of 
a single number: the constant that determines 
how vigorously action follows from emotion. Ob- 
Viously no pretense is made that the present 
model reflects the full complexity of person- 
,  ality—just that it includes enough to be inter- 
esting. 

It may be useful to contrast Aldous as a per- 
sonality model with some other existing attempts 
at computer models of personality processes, 

` such as those of Abelson (1963) and Colby 
(1963). Aldous is much simpler and much more 
abstract than either of the other two models. 
Both Colby and Abelson attempt to stay rela- 
tively close to the moment-by-moment flow of 
Psychological events; the attitudes of their mod- 
els have much more elaborate content; and the 
‘mechanisms for attitude transformation are 
_ Specified in considerable detail. By contrast, 
» Aldous is less a detailed simulation, and more 
a schematic model of personality. 
Also closely related to the present research is 
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the work of Gullahorn and Gullahorn (1963, 
1964). They have focused directly on the in- 
terpersonal situation, constructing their model 
around several propositions of Homans concern- 
ing interpersonal interactions. Again, the repre- 
sentation of the attitudes of the participants and 
the details of the interaction process are at a 
considerably finer level of detail than in the 
present model. 

The question of what might be an optimal 
level of detail for a computer model of person- 
ality is a difficult one, to which there is doubtless 
no simple answer. At this point, however, all 
that need be argued is that a number of some- 
what conflicting considerations are involved, and 
what a less complex model loses in fidelity may 
at least partly be gained back in greater man- 
ageability. 


INTERPERSONAL EXPERIMENTS 


The responses of a model like Aldous can be 
studied in various “environments,” by supplying 
appropriate series of “situations” to it. Some ex- 
periments of this kind have been described else- 
where (Loehlin, 1963). The experiments reported 
in the present paper, however, examine the be- 
havior of the model in an environment provided 
by another model of its own kind; the distinc- 
tion between “model” and “environment” thus 
disappears, and one simulation serves for both. 

The situations responded to by the models 
were arbitrarily specified as sequences of two 
actions: the other model’s act in the context of 
the model’s own previous action—thus, for ex- 
ample, an attack by B was evaluated differently 
if it was in response to A’s attack than if it was 
in response to A’s positive approach. While the 
effect of varying the length of sequence re- 
sponded to was not systematically explored, 
some preliminary work with response to single 
actions suggested that under this condition fairly 
uninteresting behavior would result. 

The general arrangement for the experiments 
to be described was as follows: Two models are 
placed in the computer, along with a third pro- 
gram which represents the experimenter and cer- 
tain features of the real world which determine 
the consequences of various interaction se- 
quences. Three kinds of consequences may oc- 
cur: injury, satisfaction, and frustration. In gen- 
eral, injury results from the other model’s attack, 
if steps are not taken to avoid or counter it; 
satisfaction, from the other model’s positive ap- 
proach (in neutral or positive circumstances) or 
from successful attack or escape behavior; and 
frustration, from an incongruent reply to a posi- 
tive or negative approach, or from interference 
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with attempted attack or escape. Each response 
sequence is begun by the control program in- 
ducing one of the models to act—the model and 
the initial act are selected randomly. The second 
model then reacts to the action, the first replies, 
and so on. After each action, the control pro- 
gram evaluates the consequences which would 
ensue from that action in response to the other’s 
action; these consequences are supplied to the 
model and modify its subsequent emotional re- 
action, An interaction sequence continues until 
a sequence of acts occurs which terminates it: 
these include vigorous withdrawal on the part 
of either participant, mild withdrawal if ignored 
by the other, mutual indifference on a series of 
trials, and consummatory sequences of mutual 
strong affection or attack. (Indeterminate se- 
quences are also stopped after some specified 
number of trials.) After each interaction se- 
quence the attitudes of each model toward the 
other are measured and recorded by the con- 
trol program, and another interaction sequence is 
started. A particular experiment continues until 
some apparently terminal relationship between 
‘the models is reached, or until a set time limit 
is exceeded. Two experimental runs will be de- 
scribed in order to suggest something of the 
flavor of the interactions which occur, and then 
a number of further experiments summarized 
briefly. The various outcomes described should 
be regarded as illustrative rather than conclu- 
sive, since a local change of computers made it 
infeasible to evaluate fully the effects of some 
of the variables discussed. Still as the results 
stand they may serve to suggest something of 
the potential scope of a model of this kind. 


Experiment I: Both Attitudes Negative 


In the first experiment to be described, two models 
(A and B) were used, both of which began the ex- 
periment with mainly fearful and hostile attitudes, 
the fearful attitudes being somewhat stronger. Each 
had its traits set to yield a fairly impulsive and 
readily modifiable personality, 

The first 14 interaction séquences were brief, usu- 
ally only one or two responses; each sequence was 
terminated by B’s retreat. A’s behavior, in the se- 
quences where he got to show any, was attack or 
conflict (presumably between hostile and fearful 
tendencies). During this period, B showed an in- 
creasing predominance of fearful over hostile atti- 
tudes. A’s fear, on the other hand, decreased towards 
the level of his hostility. On Sequence 15, for the 
first time, A did not permit B to retreat but fol- 
lowed him up aggressively—this led to a hostile ex- 
change finally terminated by B’s escape. The next 
9 sequences were somewhat similar: aggression with 
occasional admixtures of conflict or even positive 
approach behaviors on the part of A, and ageres- 
sive and fearful behavior on the part of B; each 
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sequence was terminated after from 1 to 19 inter- 
actions by B’s escape. During this time B’s attitudes 
were becoming less fearful and more hostile; A 
showed comparatively little change. On Sequence 25, 
after a couple of tentative retreats, B stayed for the 
first time and fought back—the sequence ended in 
mutual strong attack. This happened more fre- 
quently in the next few sequences, and on Sequence 
29, A, after showing signs of severe conflict, made 
his first (and only) escape. At this point fearful 
and hostile attitudes were very nearly equal for 
each model, From here on until the end of the ex- 
periment a pattern of mutual hostility developed 
and gradually increased in strength: the interaction 
sequences would usually start with mild aggressive 
behavior, which would build up to a strong mutual 
aggression over a few trials. Once in a while B 
would retreat but usually he would stay and fight 
it out. At Sequence 50, when the experiment was 
terminated, hostile attitudes predominated in both 
models, no response other than an aggressive one 
had occutred in the last 5 sequences, and the rela- 
tionship appeared to have settled down into its 
terminal form. 


Experiment II: Both Attitudes Positive 


The next experiment to be described was similar, 
except that the two models were started off with 
fairly strongly positive attitudes. Each had a sec- 
ondary moderate level of hostility and lory fear. 
Things started off beautifully. During the first 26 
interaction sequences (one might describe it as the 
honeymoon) mutual approach behavior prevailed, 
positive attitudes got stronger and stronger, and 
what fear and hostility there was weakened. There 
were minor exceptions: during this time 5 sequences 
were ended by B avoiding, and 1 by A, and A 
showed hostility once, but in the great majority of 
sequences an initial mild positive approach by one 
or the other led only to more vigorous positive in- 
teraction. Potential trouble existed, however, in the 
fact that the models had not learned how to cope 
with aggression, and this trouble broke out on Se- 
quence 27 in the form of a long “lovers’ quarrel”. in 
which the two oscillated among attack, approach, 
and conflict behaviors. B was the main aggressor in 
the sequence. Both showed a rise in hostile attitudes 
as a result. Six sequences of mutual affection fol- 
lowed this incident, and then came another “quar- 
tel,” this time with A the main aggressor. Two se- 
quences later, on Sequence 37, there was a third, 
with aggression on both sides—still interspersed with 
affection and conflict. There were two more such 
episodes, on Sequences 43 and 51, before the models 
worked out a pattern of adjustment in which the 
threat of severe aggression was met with prompt 
withdrawal, and the threat of mild aggression 
averted by continued positive response. When the 
experiment was terminated at Sequence 78, the 
models had gone without serious trouble for 27 se- 
quences, safely passing several times through situa- 
tions which had earlier precipitated trouble. At this 
point hostile and fearful attitude levels were very 
low, and behavior almost totally positive. 
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Other Experiments Involving Attitudes 


Various other combinations of initial attitudes 
were also tried. In one experiment an A with posi- 
tive attitudes was paired with a B with negative 
attitudes. Initially, B attempted to withdraw from 
A, but as the latter persisted in his approaches, B 
eventually became hostile. A, frustrated, replied, and 
a pattern of mutual hostility developed and eventu- 
ally became dominant. 

In another experiment an A of generally positive 
attitudes was paired with a B of mixed attitudes. 
Persistent cyclic “quarrels” of the type described in 
Experiment II developed early and prevented a posi- 
tive relationship from becoming established. Again 
the models ended up in mutual hostility. 


Experiments Involving Trait Differences 


A number of experiments were run in which con- 
stants were adjusted in one or both models, pro- 
ducing trait changes. (These experiments were mostly 
started from the initial position of Experiment II; 
that is, with both models having predominantly but 
not exclusively positive attitudes.) A change in “cog- 
nitive style” in one model (a decreased tendency to- 
ward abstraction) appeared on the whole to have 
little effect on the course of the relationship. A 
change of constants to make one model more “con- 
servative” (slower memory change, less weight given 
to temporary moods) appeared if anything to make 
the relationship more stably positive. However, a 
change in one model in the constant regulating the 
intensity with which emotions were expressed in 
action proved to have seriously adverse effects on 
the relationship, The interaction began positively 
enough, but the more strongly acting model, its 
Positive approaches not fully reciprocated by the 
other, eventually reacted with hostility, which led 
to a deterioration of relations and ultimately to a 
terminal condition of mutual hostility. 


Experiments Involving Roles 


Some experiments were done in which one model 
was allowed to react freely, while the other carried 
out a specified role, Some of these experiments rep- 
resented efforts to see if a model starting out with 
hostile and fearful attitudes could be “reformed” 
by a suitable partner. In one such experiment, a 
model with negative attitudes was paired with a 
model (informally dubbed “Saint Aldous”) that 
Was set to respond in a uniformly positive way 
regardless of the first model’s acts, The saint, alas, 
bid fair to become a martyr: the first model re- 
Sponded to the situation by settling rapidly into a 
Pattern of vigorous and persistent attack behavior. 

In another experiment, the model with negative 
attitudes was paired with a model that was set to 
carry out a specified therapeutic role. The therapy 


» Selected was a highly directive one: withdrawal 


Was met by mild positive approach, mild aggression 
contained by stronger aggression, and strong ag- 
gression neutralized by mild withdrawal. Neutral, 


. “mbivalent, or positive acts were given strong posi- 


tive support, The outcome of this “therapy” was a 
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marked change toward positive attitudes on the 
part of the treated model in two such experiments 
run. In both cases, however, the model, although 
responding strongly positively in the therapeutic 
relationship, ran into difficulties when later placed 
on its own with a normally responsive partner, In 
one of the experiments the model was then returned 
to interact further with the “therapist,” after which 
it was again paired with a normal model, this time 
with a successful outcome. 


Experiments Involving Situational Variables 


Several experiments were run to investigate the 
effects of chance events on interaction outcomes. 
This was done by varying the series of random 
numbers used in the selection of the initial acts of 
sequences, The results suggested that chance factors 
could indeed have significant effects on the inter- 
action—in at least one instance a change of random 
numbers led to several adverse encounters early in 
the series that reversed the outcome of the experi- 
ment. In some other instances the outcome remained 
the same, but was considerably retarded or ad- 
vanced by fortuitous events. While the evidence on 
the point was not very extensive, it appeared that 
chance events tended to have their greatest impact 
early in the series of encounters between two 
models. 

Another situational factor investigated was the 
cutoff point imposed on indeterminate sequences, 
which can be thought of as representing interrup- 
tion by events external to the interacting pair. Two 
different cutoff points were used: in the basic ex- 
periments an indeterminate sequence was stopped 
after 40 responses, but in a number of other experi- 
ments a cutoff after 10 responses was used. The 
major effect of the quicker interruption was to 
terminate some adversely developing sequences be- 
fore their effects became too severe. In two experi- 
ments a relationship that was not viable under a 
long cutoff survived and stabilized under a short 
one; in three others the short cutoff seemed to have 
little effect, beyond spreading the same number of 
trials out over more sequences. 


Discussion 


The experiments described in the present paper 
obviously do not assert any new facts about hu- 
man interpersonal behavior, nor, indeed, do 
they even represent a comprehensive exploration 
of the properties of a particular model, They 
do, however, suggest that a computer model 
originally constructed to represent individual 
personality may under appropriate conditions 
produce behavior sequences bearing at least 
some resemblance to those occurring in human 
interactions, and showing sensitivity to at least 
some of the same kinds of variables. Despite the 
limitations of both model and experiments, one 
or two general features of the present results 
may be worth brief further comment. 
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In the first place, while the relationship be- 
tween the two models appeared to approach 
some asymptotic condition in each experiment, 
movement toward this condition was by no 
means always—or even usually—a simple, linear 
affair, Rather it occurred in stages; one phase of 
the relationship giving place to another, often 
quite different phase, and this in turn to further 
phases. Such differing stages in a relationship 
are obviously characteristic of much human 
interaction, In the present instance they are not 
a conspicuous characteristic of the personality 
model itself, which behaves quite stably in a 
generalized environment (Loehlin, 1963), but 
appear rather to arise as a result of the shifting 
balance of relationships in the interaction situa- 
tion. When the behavior of one of the models is 
constrained (as in the role experiments) such 
shifts are less frequent, although some still occur. 

Second, it proved much easier to introduce 
conditions that would lead to a negative relation- 
ship than to a positive one; it was also much 
commoner for a potentially positive relationship 
to be disrupted by adverse events than for a 
negative trend to be reversed by chance favor- 
able encounters. At best, a positive relationship 
between two models was a somewhat delicate 
affair to establish, although once well developed 
it could survive some degree of adversity. In 
computer model interpersonal relations, as in 
other complex systems, there are more ways for 
things to go wrong than to go right. 

There was a marked tendency for negative 
relationships in the present experiments to ter- 
minate as hostility rather than fear. This may 
have been to some extent a result of the greater 
symmetry and hence stability of a hostile rela- 
tionship, but was doubtless also a function of 
the experimental procedure, which repeatedly 
brought the two partners into contact and did not 
permit either to withdraw permanently from the 
situation. 

Some possible extensions of the present ex- 
periments are obvious—for example, to three or 
more interacting models, Such experiments in- 
troduce certain additional problems—such as the 
order of noticing and responding to the acts of 
the various others—which are conveniently by- 
passed in the two-person case. At the same time, 
new phenomena, such as subgroup formation, 
can emerge (cf, Gullahorn & Gullahorn, 1964). 
Probably of greater urgency than such exten- 
sion, however, is the development of techniques 
for matching models to the parameters of par- 
ticular individuals, to permit direct empirical 
comparisons between model interactions and 
person interactions. A particularly intriguing 
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possibility for such matching would be by direct 
interaction of subject with computer—the com- 
puter systematically adjusting the parameters of 
the model until it predicts the subject’s re- 
sponses. However, such a method poses a num- 
ber of practical problems, and conventional 
psychometric techniques probably offer more im- 
mediate promise as a means of assessing indi- 
vidual parameters. Alternatively, one could avoid 
the problem of matching models to individuals 
by an approach using samples of subjects and 
manipulating situational variables (such as con- 
straint to return to the situation, or frequency 
of interruption). In any of these approaches the 
outcomes of particular computer experiments 
represent hypotheses—ultimately testable against 
the empirical data of human interpersonal be- 
havior. 
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AUTHORITARIANISM, DISPLACED AGGRESSION, AND 
SOCIAL STATUS OF THE TARGET 
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Personality and situational parameters related to displaced aggression were 
evaluated by investigating the following independent variables upon aggression 


(the administration of shock 
authoritarianism, (c) S’s sex, 


xX 2 factorial design was employed with 


in a serial learning task): 
and (d) victim’s socioeconomic status. A2X2X2 
5 Ss in each of 16 experimental condi- 


(a) frustration, (b) 


tions. The results generally confirmed the predictions that: (a) high authori- 
tarians (Fs) would be more aggressive than lows towards both high- and low- 


status targets; 
towards the low-status relative to 
constituted the major target for the 


Although the displacement hypothesis is a 
cornerstone of major conceptualizations regard- 
ing aggression, its vagueness regarding both the 
personality traits of the potential aggressor and 
the stimulus characteristics of the target tends 
to limit its explanatory power. As a result of this 
ambiguity, experimental investigations of dis- 
placed aggression have yielded contradictory and 
inconsistent results (Cowen, Landes, & Schaet, 
1959; Stagner & Congdon, 1955). Recent re- 
search, however, indicates the potential useful- 
ness of exploring relevant personality character- 
istics: generalized prejudice (Berkowitz, 1962; 
Stricker, 1963), anti-Semitism (Weatherley, 
1963) as well as the stimulus characteristics of 
the target: prior dislike (Berkowitz & Howes, 
1959), 

_The current study assumes that maximal pre- 
diction may be obtained through the simultane- 
ous investigation of personality and situational 
parameters within a systematic theoretical frame 
of reference. A major assumption in the Au- 
thoritarian Personality (Adorno, Frenkel-Bruns- 
wik, Levinson, & Sanford, 1950) is that authori- 
tarian attitudes result from harsh and punitive 
Parental discipline. Since these practices may 
result in an excessive sensitization to power re- 
lations of strong versus weak, superior versus 
inferior, as these dimensions may be culturally 
defined by social status, it is likely that punitive 
discipline will signify that it is relatively more 
Justifiable to aggress against low-status as com- 


. Pared to high-status persons. This conceptuali- 


zation leads to the following hypotheses: rela- 
tive to low authoritarians (Fs) high Fs will dis- 
pace more aggression towards both high- and 
-low-status targets, and whereas the high Fs will 


interaction points to the conditional nature of these results. 
tionship between frustration and aggression was attributed primarily to the 
inhibitory tendencies aroused in college students by task failure. 


(b) relative to low Fs, highs would express more aggression 
the high-status target. The high-status target 
low authoritarians. A significant 3rd-order 


The inverse rela- 


displace more aggression towards a low- relative 
to a high-status target, the low F's displaced 
aggression will not be differentially influenced 
by the victim’s social status. 

Whereas a converging body of evidence indi- 
cates that high-status frustraters tend to arouse 
inhibitions regarding retaliatory aggression (Co- 
hen, 1955; Roberts & Jessor, 1958), data regard- 
ing the relationship between the victim’s social 
status and displaced aggression are virtually non- 
existent. 

Furthermore, since the validity of the F Scale 
continues to be a focus of controversy, a signifi- 
cant goal of this research is to evaluate the pre- 
dictive or construct validity of the F Scale 
within a behavioral situation. Although a great 
deal of debate has centered on the relative con- 
tribution of response style versus content vari- 
ance to responses to the F Scale (Jackson & 
Messick, 1958), few attempts have been made 
to resolve this controversy empirically. 


METHOD 
Subjects 


Authoritarianism was measured by the 30-item F 
Scale, which was group administered to 240 under- 
graduate students enrolled in introductory psy- 
chology courses at Wayne State University. The 
high authoritarianism group (N =40) consisted of 
the 20 highest male and female scorers, whereas the 
low group (W=40) consisted of the 20 lowest male 
and female scorers, Thus, in addition to authori- 
tarianism the major independent variables were: 
level of frustration, sex of the subject, and socio- 
economic status of the victim: high versus low. 
Thus, a 2X2X2X2 factorial design was em- 
ployed with five subjects in each of 16 experimental 
conditions. 
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TABLE 1 


MEANS AND VARIABILITIES OF SHOCK INTENSITIES BY 
EXPERIMENTAL CONDITION 


Frustration No frustration 
Condition 
High Low High Low 
status status status status 
Males 
HighF | 1314 128.0 122.8 148.0 
(12.6)* | (16.4) (9.6) (11.4) 
Low.F 90.6 122.2 139.6 115.0 
(7.8) (23.6) (22.8) (17.2) 
Females 
High F 114.4 109.6 125.4 139.2 
(6.8) (7.4) (9.0) (6.6) 
Low F 101.8 92.1 115.8 104.0 
(15.0) (9.8) (4.4) (12.8) 


a Standard deviations are enclosed in parentheses. 


Behavioral Situation 


Aggression was defined operationally as a response 
which delivers noxious stimuli to another person, A 
modified “aggression machine” (Buss, 1961) was 
employed so that varying intensities of electric 
shock could be administered to a “victim.” Each 
subject was told that he would play the role of an 
experimenter in a study on the effects of shock upon 
learning. He was also told that another subject (an 
experimental accomplice acting as a victim seated 
behind a one-way vision mirror) would be presented 
with a serial learning task on a memory drum, The 
subject was requested to shock the victim whenever 
he incorrectly anticipated a stimulus word during 
the serial presentations, The subject was given a 
copy of a 10-item, meaningful word list which was 
presented to the victim via the memory drum. At 
this point, the subject was introduced to a small 
box which ostensibly controlled 5 shock intensities. 
Each intensity could be activated by pressing one 
of five buttons clearly labeled from low to high. It 
was emphasized that the subject could press any one 
of these buttons during the experiment. 

Depending on the appropriate condition, a low- 
or high-status male victim was introduced to the 
subject. In the low-status condition, the victim 
wore old, disheveled clothes and responded to an 
orally administered questionnaire so as to reveal the 
following information about himself within hearing 
distance of the naive subject: family income, less 
than $1500 per annum; parental occupation, unem- 
ployed; parental education, grade school; subject’s 
future plans, drop out of school. In the high-status 
position, the same accomplice appeared well dressed 
and responded in the following manner: family in- 
come, $20,000 per annum; parental occupation, 
vice-president of an advertising firm; parental ed- 
ucation, college graduate; ‘future goal, graduate de- 
gree in business administration. In order to maintain 
the subjects’ naivete, they were told the data 
regarding the victim’s social background would be 
correlated with his learning ability. 
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Several procedures were employed in order to 
convince the subject that the aggression machine 
was operative. Prior to the first trial, the subjects 
were encouraged to touch the victim’s electrodes 
and receive a shock which represented the lowest 
intensity. In addition, the subject observed the ex- 
perimenter carefully place the electrodes on the 
victim’s wrists and fingertips. Prior to the experi- 
ment, the accomplice was introduced to the subject 
as another subject. A rigged coin toss was employed 
to determine which subject would be the victim 
and inevitably, of course, the accomplice “lost.” 

The victim made a programed series of responses 
such that 32 shocks were given for incorrect re- 
sponses in the 7-trial learning series. Unknown to 
the subject, however, a switch in the experimenter’s 
room was pulled so that no shock was actually 
delivered. Recordings of the shock intensities were 
made by observing a series of differentially colored 
lights located in the experimenter’s room and wired 
to the aggression machine. 


Frustration Manipulations 


Prior to the behavioral situation, half of the 
sample was presented with the frustration condition 
which consisted of the Cowen et al. (1959) dot 
problem, the Block Design and Digit Symbol sub- 
tests of the Wechsler Adult Intelligence Scale. The 
subjects were told that these were valid measures of 
mental ability which are usually administéfed in 
order to predict academic achievement. These tests 
were now being administered to college students in 
order to collect further normative data. In order 
to maximize the frustrating effects of these tasks, 
inadequate time limits were established so that 
failure was inevitable and the experimenter reacted 
with surprise and amazement at the subjects’ fail- 
ures. In order to divorce the frustration condition 


TABLE 2 


ANALYSIS OF VARIANCE OF AGGRESSION SCORES AS A 
FUNCTION OF AUTHORITARIANISM, FRUSTRATION, 
Sex, AND SOCIAL STATUS : 


Source df MS F 
Authoritarianism (A)| 1 | 5,695.9 | 39.18% 
Frustration (B) 1 4,667.1 32.10** 
Sex (C) 1 | 3,063.4 21.07% 
Status (D) 1 128.1 — 
AXB 1 52.8 T 
AXC 1 21.0 = 
AXD 1 750.3 5.16* 
BXC 1 90.8 = 
BXD 1 70.3 = 
CXD 1 465.0 3.20 
AXBXC 1 449.1 3.09 
AXBXD 1 | 3,237.3 | 22.26% 
BXxCxD 1 533.9 3.80 
AXCXD 1 102.3 = 
AXBXCXD 1 | 3,590.8 | 24.69% 
Within 64 145.4 

*p <05 
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Frustration, Hi Status 
Frustration, Low Status 
No Frustration, Low Status 
No Frustration, Hi Status 


High 


AUTHORITARIANISM 


Fic. 1. Effects of authoritarianism, frustration, and status upon aggression. 


from the shock condition, subjects were told that 
they evould participate in two separate studies: the 
first involving the administration of mental ability 
tests, and the second a study of the learning proc- 
ess, 


RESULTS 


For the purpose of intergroup comparisons, 
Table 1 summarizes the means and standard 
deviations of the shock intensities for the 16 
experimental groups. The assumption of homo- 
geneity of variance was tested by the Fmax test 
(Walker & Lev, 1953) and on the basis of this 
analysis (F = 28.7, df = 16/4, p> .05), it was 
concluded that this basic requirement for em- 
ploying parametric tests of significance had been 
fulfilled. The analysis of variance is summarized 
in Table 2. 

A significant first-order interaction between 
social status and authoritarianism upon shock 
intensity was obtained (F = 5.16, df = 1/64, p 
<.05). With the exception of the “nonfrus- 
trated, male group,” this interaction suggests that 
the high Fs were more aggressive towards both 
the high-status victim (t= 2.52, p < .05) and 
the low-status victim (t= 5.76, p<.01), rela- 
tive to the low Fs. However, the significant in- 
teraction basically stems from the high F's 
greater aggression towards the low-status victim 
relative to the high-status victim. 

Table 2 indicates a significant second-order 


. «interaction between authoritarianism, frustra- 


tion level, and the victim’s social status upon 
level of shock. The nature of this interaction is 
graphically represented in Figure 1. 

Analysis of this interaction indicates that 
under the nonfrustration condition, high Fs were 
more aggressive towards the low-status victim 
relative to the high-status victim (¢= 5.94, p < 
.01), whereas low Fs were more aggressive 
towards the high rather than the low-status vic- 
tim (= 2.99, »<.05). Furthermore, low Fs 
expressed significantly more aggression towards 
the high-status victim (ż= 5.39, p<.05), 
whereas high Fs expressed more aggression 
towards the low-status victim under the “no- 
frustration” condition as compared with the 
“frustration” condition (t= 6.4, p< .01). In 
summary, a pattern emerges whereby high and 
low Fs were less aggressive under the frustra- 
tion condition relative to nonfrustrated subjects; 
differential aggressiveness as a function of the 
victim’s status was minimal under the frustra- 
tion relative to the nonfrustration condition; and 
under the no-frustration condition, the targets of 
aggression were reversed for these groups—high 
Fs expressed more aggression towards the low- 
status victim, low Fs expressed more aggression 
towards the high-status victim. The third-order 
interaction suggests the necessity for a cautious 
evaluation of these results. The finding that high 
Fs were more aggressive towards high- and low- 
status victims relative to low Fs does not hold 
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for the nonfrustrated male condition. Further- 
more, the finding that high and low Fs were less 
aggressive under the frustration relative to the 
nonfrustration condition does not hold for males. 
,Although the highest order interaction is not 
readily explicable, it does serve to highlight the 
complex factors related to aggression. 


Discussion 


The prediction that high Fs would be generally 
more aggressive is supported insofar as the highs 
generally expressed more aggression towards 
both the high- and low-status targets. This result 
is congruent with previous research (Anisfeld, 
Munoz, & Lambert, 1963; Berkowitz, 1962) 
which indicates that authoritarian hostility is a 
generalized phenomenon across situations. The 
finding that high Fs expressed more aggression 
towards the low-status victim, whereas the high- 
status victim constituted the primary target for 
the low Fs suggests the inappropriateness of con- 
ceptualizing authoritarianism as a unidimensional 
concept whereby the personality characteristics 
of low scorers are viewed as the obverse of high 
scorers, These results suggest that both highs 
and lows may be sensitized to the social status 
of others. However, the direction of the hostil- 
ity associated with this sensitivity may differ 
between high and low Fs: the former group ex- 
pressed more aggression towards a low-status vic- 
tim whereas the latter group’s hostility was di- 
rected towards the high-status victim. These 
findings support the assumption that some low 
scores on the F Scale may be indicative of an 
“authoritarianism of the left” which is charac- 
terized by hostility and rejection of groups that 
appear economically privileged or powerful 
(Shils, 1954). 

Two unexpected findings in this study were 
an inverse relationship between frustration and 
aggression, and minimal difference in aggressive- 
ness as a function of the victim’s status under 
the frustration relative to the no-frustration con- 
dition. These results indicate the desirability of 
reeyaluating the direct and coercive relationship 
between frustration and aggression which has 
been postulated by the Yale group (Dollard, 
Doob, Miller, Mowrer, & Sears, 1939). This re- 
evaluation has already been stimulated by in- 
consistent findings regarding the effects of frus- 
tration upon aggression (Buss, 1961). A variety 
of conditions may have contributed to the in- 
verse relationship between frustration and dis- 
placed hostility. Due to prior socialization ex- 
periences which encourage responsibility for 
failure on intellectual tasks, college students 
are likely to view such failure as a function of 
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their own incompetence rather than as an arbi- 
trarily induced frustration. The nonarbitrarily 
perceived frustration is likely to inhibit extra- 
punitive aggression (Pastore, 1952). Whereas 
the displacement hypothesis has been supported 
by studies in which the frustrater’s aggressive 
behavior provided cues which indicated that ag- 
gression towards the target might be viewed as 
appropriate by the experimenter, the frustrater 
in the current study did not constitute an ag- 
gressive model which the subject might imitate 
in relation to the target. Furthermore, it is possi- 
ble,that subjects who have experienced frustra- 
tion, defined in terms of failure on intellectual 
tasks, will fail to view their aggressive behavior 
as potentially effective or instrumental in ena- 
bling another person to copy with a learning 
problem. Thus, task failure might result in some 
reluctance to employ high shock intensities. In 
general, these findings support the assumptions 
that frustration may result in the inhibition of 
extrapunitive aggression through the induction of 
anxiety, guilt, or other intropunitive tendencies; 
and frustration may increase the probability of 
aggression against only specific rather than all 
targets. 

A major controversial issue involves the, rela- 
tionship between attitude structure and overt 
acts (Adorno et al., 1950; LaPiere, 1934). The 
current study suggests the significance of situa- 
tional variable, namely, the target’s social status, 
in determining the relationship between a spe- 
cific attitudinal structure, that is, authoritarian- 
ism, and overt acts such as aggression. In con- 
clusion, this study points to the usefulness of 
exploring the joint influence of relatively stable 
psychological characteristics and situational fac- 
tors upon the development of aggression. 
Whereas previous research (Epstein & Komo- 
rita, 1965) has demonstrated that the social 
status of an outgroup may contribute to the 
development of prejudicial attitudes, the cur- 
rent study indicates that aggressive acts may 
also develop as a function of the social status of 
the victim. A major implication of these results 
for the modification and control of aggressive 
behavior is that the objective or perceived social 
status of the target must be changed as well as 
the psychological functioning of the so-called 
authoritarian individual. 
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AVOIDANCE OF DISSONANT INFORMATION * 


JUDSON MILLS? 
University of Missouri 


Data which point to an explanation for a previous failure to show avoidance of 
dissonant information and which provide evidence that dissonant information 
is avoided were collected in 2 experiments with the same basic procedure. Col- 
lege women ranked different products according to desirability, made a choice 
between 2, and then rated their interest in reading advertisements for each of the 
. products. The rankings of desirability of the products and the ratings of inter- 
est in ads about them were highly correlated. When the ratings of interest for 
the chosen and rejected products were adjusted to eliminate differences due to 
their initial desirability, interest in ads for the chosen was positive; interest for 
the rejected was negative, that is, less than would be expected from the de- 


sirability of the product. 


In their comprehensive review of the evidence 
Concerning Festinger’s (1957) theory of cogni- 
tive dissonance, Brehm and Cohen (1962) note 


+ This study was supported by a grant from the 
National Science Foundation and was also aided 
by a United States Public Health Service Special 
Research Fellowship awarded to the author. The 
author is indebted to Patricia L’Ecuse and Lucinda 
Lade for their assistance in collecting the data and 


to M. H. Quenouille and Ezra Stotland for helpful 


povice concerning the analysis. 
Formerly at Syracuse University. 


that there is one apparent disconfirmation of 
the theory, dealing with avoidance of dissonant 
information. Concerning the studies of exposure 
to information by Ehrlich, Guttman, Schönbach, 
and Mills (1957), Mills, Aronson, and Robin- 
son (1959), Rosen (1961), and Adams (1961), 
they state, 


In each of these studies, while subjects sought out 
dissonance-reducing information, they did not neces- 
sarily avoid dissonance-increasing information [p. 
93]. 
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In order to show that dissonant information 
is avoided, one must compare exposure to in- 
formation which is dissonant with exposure to 
information which is neutral, that is, neither 
consonant nor dissonant. Of the previous studies, 
only the one by Ehrlich et al. attempted such a 
comparison. Ehrlich et al. investigated reader- 
ship of automobile advertising following the 
purchase of a new car and found that reader- 
ship of advertisements for the chosen car was 
significantly greater than readership of adver- 
tisements for other cars. In addition they also 
compared readership of advertisements for the 
rejected cars with readership of advertisements 
for cars which were not considered. Since favor- 
able aspects of the rejected cars were dissonant 
with the choice while favorable aspects of cars 
which were not considered were irrelevant, it 
was expected from dissonance theory that read- 
ership of advertisements for the rejected cars 
would be less than readership of advertisements 
for the cars not considered. The results were 
in the opposite direction. However, the differ- 
ence was relatively small and nonsignificant. 

Data from the present study suggest an 
explanation for the failure of Ehrlich et al. to 
find less readership of ads about rejected cars 
than of ads about cars which were not con- 
sidered, A very strong relationship was found 
in the present study between the desirability of 
products and interest in reading ads about the 
products. The new car owners in the Ehrlich et 
al. study probably regarded the cars they con- 
sidered but rejected as more desirable than the 
cars which they never considered. This could 
have increased their readership of ads about the 
rejected cars enough to offset a decrease in 
readership resulting from avoidance of dis- 
sonance, 

In the present study, which was designed to 
investigate some of the determinants of interest 
jn information, differences due to the initial 
desirability of the products were eliminated 
from the measures of interest in ads for chosen 
and rejected products. 


EXPERIMENT I 
Method 


The experiment was conducted under the guise 
of a market research project, as in a study by 
Brehm (1956). After the subjects ranked 10 dif- 
ferent products according to desirability, they made 
a choice between 2 of them and then rated their 
interest in reading advertisements for each of the 
10 products. The 2 products between which the 
subject could choose and the instructions concern- 
ing the difficulty of changing the choice were varied 
for different subjects. 
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The subjects were 70 women attending a uni- 
versity summer session. They were interviewed in 
their dormitory rooms. After introducing herself and 
eliciting the subject’s cooperation, the experimenter 
explained that she was trying to find out how 
popular some new products for personal care will 
be. The subject was asked to read cards describing 
10 products and then to rank the products “accord- 
ing to how much you personally like them.” The 
products were: hand cream, .cleansing cream, 
deodorant, creme hair rinse, hair spray, toothpaste, 
wave-set lotion, shampoo, face soap, mouthwash. 
The following is an example of one of the 
descriptions: 


Cleansing cream. This facial cream reaches deep 
down dirt which soaps skip ov», It will leave the 
delicate skin of the face soft and clean, Special 
action with water serves to close pores to further 
dirt. 


After the ranking of desirability was completed, 
the experimenter told the subject that for helping 
with the study everyone receives a free gift. The 
subject was given a choice between 2 of the 10 
products. Some of the subjects were given a choice 
between the products they had ranked second and 
third in desirability, others between the products 
ranked second and ninth, and others between the 
products ranked eighth and ninth. In an attempt 
to vary the difficulty of changing the choise, to 
some of the subjects the experimenter said, 


Please write your choice here and then sign your 
name. Once you choose it will be impossible to 
change. (impossible variation) 


To other subjects the experimenter said, 


Please write your choice here and then sign your 
name. Once you choose it will be very difficult to 
change but if you should decide that you would 
definitely prefer the other product you can write 
to this address indicating the change. There will 
be a handling charge of 75 cents in that case. 
(difficult variation) è 


And to other subjects the experimenter said, 


It wont be necessary for you to tell me your 
choice now. I'll contact you in a day or two to 
find out which you would prefer. I’m sort of 
curious though. Do you have any idea which of 
the two you would prefer? That isn’t at all 


binding, of course. (easy variation) ` 


Because the time during which data could be 
collected was limited, only six of the nine possible 
combinations were run. They were: second versus 
third, impossible; second versus third, difficult; 
second versus third, easy; second versus ninth, im- 


possible; second versus ninth, easy; eighth versus ~ 


ninth, impossible. The subjects were randomly 
assigned to one of the six choice variations. i 
Following the subject’s choice, the experimenter 


went on to say that she was also trying to find out . 
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just how much interest people will have in seeing 
ads for the different products so that plans could 
zbe made for how much each of them should be 
advertised. The subject was given a rating sheet 
and asked to indicate her interest in reading ads 
. about each of the 10 products by circling a number 
from O (not at all interested) to 20 (extremely 
interested). 
At the conclusion, the experimenter mentioned 


™ to the subject that there was something more to 
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the study than she had been told and asked if she 
had any idea what it was. The replies of 4 subjects 
indicated suspicion about the procedure. These 4 
subjects were eliminated from the analysis, leaving 
11 subjects in each of the six choice variations. 
. Finally the experimenter explained the true purpose 
and got the subject to promise not to discuss the 
experiment with anyone. 


_ Results 


» Examination of the data revealed a very 
strong relationship between the initial rankings 
of the desirability of the products and the 
ratings of interest in reading ads about the 
products, The mean rating of interest over all 
66 subjects was 15.4 for whichever product the 
Subject ranked first, 12.1 for whichever product 
was ranked fourth, 11.0 for whichever was 
ranked’ fifth, 11.0 for whichever was sixth, 8.2 
for whichever was seventh, and 5.3 for which- 
ever was tenth. Correlations between the rank- 
ings of desirability and ratings of interest for 
the eight products not involved in the choice 
were calculated separately for each subject. The 
average correlation for all the subjects was de- 
termined with the use of the z transformation. 
Tt was —.67. Sixty of the 66 separate correla- 
tions were negative. 

The strong relationship between the rankings 
of desirability and the ratings of interest in 


ba reading ads makes inappropriate a direct com- 


parison of the ratings for the chosen product 
and the ratings for the rejected product with 
the ratings for other products. Differences in 
interest could be merely the result of differences 
in the initial desirability of the products. 

For this reason the ratings of interest in ads 
_ about the chosen and rejected products were 
adjusted to eliminate differences due to the 
` initial ranking of the products. This was done 
in the following way. The linear regression of 
the ratings on the rankings for the eight prod- 


~~ UctS\not involved in the choice was calculated 


oe ately for each subject. On the basis of 
y a Tegression equations, predicted ratings were 
anes for each subject for the rankings of 
cm chosen and rejected products. The adjusted 

gs were determined by subtracting the pre- 


_ dicted ratings from the actual ratings. A positive 
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TABLE 1 


MEANS OF ADJUSTED RATINGS OF INTEREST IN ADs 
FOR CHOSEN AND REJECTED PRODUCTS 


Difficulty of changing 
Alternatives and ads for 
Impossible | Difficult Easy 
Experiment I 
Second versus third 
Chosen +1.60* —.13" —.29" 
Rejected —.05* —,.999 | —2.40* 
Second versus ninth 
Chosen +1.48" —.50* 
Rejected —.35* — 1.06" 
Eighth versus ninth 
Chosen +1.26" 
Rejected — 4.415 
Experiment IT 
Second versus third 
Chosen +1.38 | +1.71° | +1.59 
Rejected —1,50> —.90° | —2.39> 


Based on 11 subjects. 
b Based on 24 subjects. 
e Based on 25 subjects. 


adjusted rating indicates greater interest in 
reading ads about the product than would be 
expected from the initial desirability of the 
product. A negative adjusted rating indicates less 
interest than would be expected from the initial 
desirability of the product. 

An illustration may help to clarify the pro- 
cedure. The data for the first subject in the 
second versus third, easy variation were as 
follows: 


Ranking 1 2 3 4 5 6 7 8 910 
Rating 1513101012 5 6 8 9 1 


The product ranked second was chosen; the one 
ranked third rejected? The b value for the 
regression equation based on the other eight 
products was —1.18. The predicted rating for 
the product ranked second was 13.27, for the 
one ranked third, 12.09. The adjusted ratings 
for the chosen and rejected products were —.27 
and —2.09, respectively. 

Means of the adjusted ratings of interest in 
ads for the chosen and rejected products for the 
six choice variations are presented in Table 1, 
under the heading Experiment I. Analyses of 
variance revealed no significant differences be- 
tween the six choice variations for the adjusted 
ratings for either the chosen or the rejected 


8 The lower ranked alternative was chosen by 22 
of the 66 subjects. They were distributed as follows: 
second versus third, impossible, 5; second versus 
third, difficult, 5; second versus third, easy, 8; second 
versus ninth, impossible, 1; second versus ninth, 
easy, 1; eighth versus ninth, impossible, 2. 
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products. The mean of the adjusted ratings for 
the chosen product for all 66 subjects combined 
was +.58. The overall mean of the adjusted 
ratings for the rejected product was —1.54. The 
overall mean for the chosen product did not 
differ significantly from 0. However, the overall 
mean for the rejected product was significantly 
less than 0 (p <.05, two-tailed), It can be seen 
from Table 1 that the means of the adjusted 
ratings for the rejected product were negative 
in all six of the choice variations. 

Although there were no significant differences 
between the choice variations in Experiment I, 
there was one trend in the data that was quite 
striking. As can be seen from Table 1, interest 
in ads for the chosen product was consistently 
positive in the three impossible variations; it 
was slightly negative in the one difficult varia- 
tion and the two easy variations. In order to 
verify this trend in the ratings of interest for 
the chosen product, a second experiment was 
conducted. 


EXPERIMENT II 


The procedure for Experiment II was exactly 
the same as that for Experiment I except that 
all subjects were given a choice between the 
products ranked second and third in desirability, 
only the instructions concerning the difficulty of 
changing the choice were varied for different 
subjects. The subjects were 73 freshman women 
who were interviewed during the regular school 
year, Twenty-three of the subjects were ran- 
domly assigned to the impossible variation; 24 
to the difficult variation, and 23 to the easy 
variation. The instructions for the impossible, 
difficult, and easy variations were exactly as 
before, as was the rest of the procedure. A dif- 
ferent woman played the role of the experi- 
menter. None of the subjects gave any indication 
of suspicion about the procedure. 

As in Experiment I, there was a strong rela- 
tionship between the rankings of the desira- 
bility of the products and the ratings of interest 
in reading ads about them. The mean rating of 
interest over all 73 subjects was 15.4 for which- 
ever product was ranked first, 13.1 for whichever 
was ranked fourth, 11.3 for whichever was fifth, 
11.8 for whichever was sixth, 10.9 for whichever 
was seventh, and 7.2 for whichever was tenth. 
The average correlation between the rankings of 
desirability and the ratings of interest for the 
eight products not involved in the choice was 
—.61, Sixty-eight of the 73 separate correlations 
‘were negative. 

Means of the adjusted ratings of interest in 
ads for the chosen and rejected products for the 
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three variations in Experiment II are presented 
in Table 1, under the heading Experiment IL 
Analyses of variance revealed no significant dif- 
ferences between the three choice variations in 
Experiment II for either the chosen or rejected | 
products, As can be seen from Table 1, the trend 
observed in Experiment I of more positive inter- 
est for the chosen product in the impossible 
variations did not occur in Experiment II; the 
means for the three choice variations were quite 
similar, 

The overall mean of the adjusted ratings for 
the chosen product was +1.56, which was sig- 
nificantly greater than O (p< .01). When the 
p value was combined with the p value for the 
same comparison in Experiment I by Stouffer's: 
method (Mosteller & Bush, 1954), the resulti 
p was < .01. The overall mean of the adjusted J 
ratings of interest for the rejected product was 
—1.59, quite close to the overall mean for 
Experiment I which was —1.54. The overall 
mean for Experiment II was significantly less 
than 0 (p<.02) and the combined p for the 
two experiments was < .01. 

It should be noted that the differences in the 
adjusted ratings for the chosen and rejected 
products cannot be explained by assuming that 
the true relationship between the rankings of 
desirability and the ratings of interest is non- 
linear and that the proper expected ratings for 
the second and eighth ranked products are really 
higher than those predicted on the basis of 
linear regression, while the proper expected 
ratings for the third and ninth ranked products 
are lower. If this assumption were true, then 
the adjusted ratings for the second and eighth 
ranked products should be positive when those 
products were rejected and the adjusted ratings 
for the third and ninth ranked products, negative | 
when those were chosen. This did not occur. The 
mean of the adjusted ratings for the second 
ranked product was +.76 when it was chosen 
(N=74, Experiments I and II combined) and 
—1.16 when it was rejected (V=54). The 
mean for the third ranked product was +1.55 
when it was chosen (N =52) and —1.99 when’ 
it was rejected (N = 54), The corresponding _ 
means for the eighth ranked product were +1.77 
(N=9) and —8.02 (N=2); for the ninth 
ranked product, —.06 (N=4) and —1.08 , 
(N =29). ʻi 


4 Twenty-four of the 73 subjects in Experiment 
II chose the lower ranked alternative; 14 in the 
second versus third, impossible variation; 8 in the 
second versus third, difficult variation; and 12 in í 
the second versus third, easy variation. 
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The results of the two experiments establish 
that dissonant information is avoided and also 
provide additional evidence of the tendency to 
seek out consonant information. Unfortunately, 
they do not shed much light on the factors which 
influence the magnitude of the tendencies to seek 
consonant and avoid dissonant information. 
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A STUDY OF POSTURAL-GESTURAL COMMUNICATION * 


B. G. ROSENBERG AND 


Bowling Green State University 


JONAS LANGER 
University of California, Berkeley 


The present study examined the communicative value of postural-gestural 
expression, 25 stick figures varying in posture were matched by 87 Ss to 6 
referent dimensions (feeling, achromatic color, chromatic color, vertical direc- 
tion, horizontal direction, and stability). A high degree of consistency obtained 
between Ss’ matchings. The results were compared to a previous study in 
which Ss consistently matched vocal-gestures to similar referent choices. It was 
concluded that nonverbal means of expression may be used to support verbal- 


linguistic communication. 


It has long been accepted that postural-ges- 
tural expressions antedate verbal-linguistic com- 
Munication in ontogenesis. Such gestures may 
Serve adjunctive or substitutive functions in 
adult communication (cf. Buhler, 1934; Paget, 
1927; Werner, 1948; Wundt, 1912). Notwith- 
Standing the widespread agreement regarding 
the importance of nonverbal communication, 
there is little systematic evidence regarding the 
communicative value of postural-gestural expres- 
Sion. Studies of interaction analysis (Miller & 
Swanson, 1960), psychotherapy (Ekman, 1964; 
Sainesbury, 1955), and gestural-motor activity 
(Piaget, 1951) attest to the increasing interest in 
this area. 

Some of the earliest investigations of postural- 
gestures focused upon such behavior as a source 


* This Study was conducted while the first author 
Ha a Visiting Associate Professor of Psychology at 
e University of California, Berkeley, 1962-63. 


of inference regarding personality structure (All- 
port & Vernon, 1933; Darwin, 1929). Later 
studies of gestural behavior emphasized its use 
as expressive style (Reich, 1949; Wolff, 1943), 
perceptual preference (Roe, 1951), underlying 
defenses and dynamics (Allport & Vernon, 1933; 
Wallach & Thomas, 1963), etc., rather than as a 
communicative code per se. 

The present study is based upon the assump- 
tion that gestures may be used as a relatively 
uniform, albeit rudimentary, means of communi- 
cation. It seeks to assess the communicative value 
of postural-type gestures when all other charac- 
teristics of the human situation are removed. It 
was hypothesized that these gestures would be 
consistently matched with various dimensions of 
referent choice when subjects are asked which 
they represent. The dimensions utilized were 
those for which consistent agreement obtained in 
matching them with vocal-type gestures (Langer 
& Rosenberg, 1964). 
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TABLE 1 
NUMBER or STICK FIGURES SIGNIFICANTLY MATCHED WITH REFERENT DIMENSIONS 


Color Direction 
py aumo aot ems [Feeling Flighty-stable 
Chromatic | Achromatic Up-down Fonasa 
ps 05 18 22 22 20 24 18 
p> 05 7 3 3 5 1 7 
METHOD Subjects 


Instrument Eighty-seven advanced undergraduates at the 
The Stick Figure Test (Sarbin & Hardyck, 1955) University of California, Berkeley (57 females, 30 

was utilized. It was developed in order to assess males) served as subjects. 

conformance and empathy as a dimension of role- 

playing ability. The test consists of 43 line-drawn Procedure r 

stick figures depicting people in varying states of The subjects were tested as a group. Each stick 

posture. Twenty-five of the figures were selected for figure was projected on a screen for 20 seconds under 

the present study. Any slide which contained lines six conditions, in the following order: matching the 

(or cues) in addition to the figure itself was elim- 25 stick figures to feeling (positive, neutral, nega- 

inated, tive), vertical direction (up, down), chromatic color 


oe ee 
Tooo = 
E ee) nh ee, a Aa 
o 8 
a oe 


Fic. 1. Stick Figure Test. 


ee 
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TABLE 2 
SUMMARY or Stick FIGURES, MODAL CHARACTERIZATIONS, AND REFERENT CHOICES 


Modal 


i Chrom: 
No | perceneaal Se eUe 
12 Curious Gray 
(51) 
16 Puzzled Green Gray 
: (39) (53) 
18 Indifferent Yellow Gra: 
(51) (85 
19 Rejecting red Black 
(39) (83) 
22 Watching | green-yellow Gray 
e|” (34; 36) (57) 
23 Dancer red-yellow White 
(35, 36) (97) 
24 Welcoming | green-blue White 
(35, 38) (57) 
25 Modeling White 
p (68) 
26 Military red-green 


attention (33, 36) 
29 Searching green-blue Black 


30 Watching blue-yellow | gray-White 


31 Relaxed blue Gray 
k (41) (54) 
32 Attentive gray-White 
(43, 49) 
a4 Violent Red Black 
anger (75) (90) 
35 Excited Red black 
ite (79) (54) 
36 Directing White 
(62) 
38 Sneaking Black 
(80) | 
39 Shy blue gray-white 
(41) (48, 39) 
40 Baseball Red Black 
umpire (62) (76) 
44 Stretching yellow gray-white 
) (46) (44, 51) 
46 Determined R 
S A Stealthy black-gray 
(43, 43) 
48 Affected yellow-blue Gray 
(36, 37) 65 
49 Balancing 
50 Thinking green-blue Gray 
(31, 47) (56) 


cee) | stapaiey ecan | Tateetion’ 
Stable Down Forward 
(47) (67) (90) (79) 
Stable Up Forward 
(58) (86) (89) (70) 
+ Flighty Backward 
(56) 9) 
= Stable Down Backward 
(93) (76) (83) 
aS Down 
(33, 35) (86) 
Flighty Up Forward 
(94) (86) (95) (92) 
Stable U Forward 
(70) 87) CD (89) 
Up Forward 
(64) (81) (87) 
=+ Stable Up Forward 
(39, 40) (88) (79) (76) 
- Down Backward 
(62) (95) (60) 
flighty . up Forward 
(38, 56) (61) (63) 
stable down Backward 
(40, 54) (62) (63) (84) 
Stable Up Forward 
(56) (19) (80) (70) 
— Flighty Down Forward 
(89) (66) (70) (78) 
a Flighty U 
(64) (85) (69) 
on Up Forward 
(57) (91) (77) 
- Flighty Down Backward 
(90) (84) (78) 
Flighty Down Backward 
(77) (79) 
= Stable Down Forward 
(67) (71) (91) (70) 
as Up 
(54, 38) (90) 
Stable up 
(45) (84) (65) 
— Flighty Down 
(58) (7) (84) 
Flighty Down 
(79) (71) 
Flighty Down Forward 
(83) (72) (82) 
+ Stable Down 
(52) (85) (79) 


are Note.—Uppercase notations are used only when agreement was overwhelming for a particular choice. Lowercase notations 
ane tsed when predominant, but not overwhelming, agreement obtained for a particular choice. When this predominance was 
s 


red by two choices, both are given. Numbers 


(red, green, yellow, blue), horizontal direction 
(backward, forward), achromatic color (black, gray, 


, White), and stability (flighty, stable). The following 


type of instructions were given: 


We have found that people can express the 
meaning of an idea without using words. One can 
use gestures, postures, line patterns, etc. What we 


ughout represent percentage of total sample (V = 87) cl 


‘oosing referent. 


are interested in here is the use of stick figures to 
represent an idea. Your task will be to decide for 
each of the stick figures you see, which of the 
four colors—RED, GREEN, YELLOW, or BLUE—it best 
represents. Please indicate your choice for each 
row [stick figure] in one of the four columns [the 
four ideas]. 
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The analysis utilizes the chi-square technique for 
“goodness of fit” for each of the stick figures by 
referent (conditions). The theoretic frequency 
for each cell, that is, the choice marked, was ob- 
„tained by examining the total frequency with which 
it has been chosen by the sample. Thus, the expected 
frequencies against which observations were tested 
were not derived from the less stringent assumption 
of a null relationship, that is, equiproportional dis- 
tribution of choice. Rather they were based on total 
choices. 


RESULTS 


The overall summary of responses to the 25 
stick figures under the six conditions is presented 
in Table 1.2 It is apparent that the stick figures 
are consistently matched with a variety of re- 
ferent choices, Highest agreement obtained when 
the stick figures were matched with vertical di- 
rection, achromatic color, and feeling, and least 
with chromatic color and horizontal direction. 
However, these differences were not significant 
with the exception of vertical direction as com- 
pared with chromatic color and horizontal direc- 
tion (p <.05). 

Figure 1 presents the stick figures used in 
the present study. 

Table 2 presents the stick figures and the 
referent choices by dimension. Whenever a cell 
has an entry (e.g., G for green under chromatic 
color), the figure yielded consistent agreement 
by the subjects in a statistically significant man- 
ner. Almost all of the stick figures were con- 
sistently matched by the subjects on several di- 
mensions. 

Table 2 also includes the modal perceptual 
characterizations of the stick figures obtained 
by previous investigators (Sarbin & Hardyck, 
1955), There is a certain logical consistency be- 
tween the choices of our subjects on the several 
dimensions of representation employed in this 
study and previous modal perceptual character- 
izations of the figures. For example, the major- 
ity of subjects agreed that the best description 
for Figures 34, 35, and 40 were, respectively, 
violent anger, excited, and baseball umpire. Al- 
though our subjects were not provided with 
these descriptions, they nevertheless consistently 
used them to represent red, black, and negative. 


Discussion 


This initial attempt to systematically examine 
the communicative value of postural gestures 
indicates that a high degree of consistency ob- 
tains among subjects in matching such gestures 
with a variety of referent choices. These find- 


2The authors are indebted to John Boaz for his 
aid in the analysis of the data. 


ings are particularly interesting since the situa- 
tional context in which such behaviors are ordi- 
narily observed was absent. They permit the 
conclusion that postural gestures may function 
as an independent source of communication, Con- 
sequently, it reinforces the theoretical notion 
that such expressions may be used by adults in 
conjunction with or as a substitute for verbal- 
linguistic communication. 

A variety of referent dimensions were con- 
sistently matched with stick figures. All of the 
stick figures were consistently employed to rep- 
resent at least three different referents. Thus, it 
is apparent that all the figures are plurisignifi- 
cant, that is, they may consistently convey @ 
variety of referents. Of all the dimensions used 
in the present study, the vertical choices seem on 
a priori grounds to be most explicitly tied to 
the stimulus material which consisted of two- 
dimensional presentations of postural gestures, 
This is consistent with the findings that highest 
agreement in matching obtained for the vertical — 
dimension. If one uses magnitude of intersubject 
agreement as an empirical index of the degree 
to which symbols explicitly or implicitly repre- i 
sent referents, then the results suggest that two- q 
dimensional postural gestures most explicitly 
represent vertical directions and most implicitly” 
represent horizontal directions and chromatic ` 
color, while achromatic colors, feelings, and sta- 
bility are somewhere in between. 

It is noteworthy that those stick figures for 
which greatest agreement obtained have been 
characterized earlier (Sarbin & Hardyck, 1955) 
by somewhat intense inner dynamics, for ex- 
ample, rejecting, welcoming, violent anger. On 
the other hand, those figures which resulted 
in least intersubject consistency were previously — 
characterized as possessing less intense inner 
dynamics, for example, watching, modeling, — 
stretching, stealthy. At the same time, the par- “a 
ticular choices made by the subjects in the pres- 
ent experiment are sometimes obviously con- 
sistent with the modal perceptual responses ob- — 
tained by Sarbin and Hardyck, as for example, 
Figure 19 which was perceived as rejecting and 
matched with red, black, negative feeling, stable, 
downward, and backward. Others are not as ob- 
viously related in a consistent fashion, although 
they are not inconsistent, for example, for Figs 
ure 16, the modal perceptual response was — 
puzzled, while the referent choices were green, f 
gray, positive, stable, upward, and forward. It is 
particularly the latter type of result which 
suggests the necessity for further investigation, 
namely, to determine the configurational char 
acteristics of the postural gestures which medi- 3 
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ate the modal percepts and the referents so that 
they are consistently related to each other for 


" the subjects. 


This study takes on added significance when 
considered in relationship to our previous find- 
ings with vocal-type gestures (Langer & Rosen- 
berg, 1964). It was found that speaker subjects 
could create vocal symbols as new names for 
chromatic color referents and complex spatial 
referents, for example, tall and wide, short and 
narrow, etc. Listener subjects matched these 
vocal gestures with referent choices, including 
color and complex space. Determination of the 


_ magnitude of speaker-listener agreement was pos- 


sible since the listener’s choices were from the 
same referents for which the speakers produced 
the symbols. A significant degree of speaker- 
listener agreement obtained for both referent 
dimensions. In addition, it was possible to assess 
the magnitude of agreement between listeners for 
the color and complex spatial referents, as well 
as on other referent dimensions, for example, 
direction. Again high consistency obtained. These 
findings suggested that speakers may indirectly 


„ communicate a variety of implicit meanings via 


the gestural-intonational qualities of their vocali- 
zations, when conditions such as the delimitation 
of the ‘boundaries of the dimension of reference 
obtain (see Langer & Rosenberg, 1964, for a 
fuller discussion). 

In the present study it was not possible to 
study postural-gestural communication between 
producers and observers; it was only possible to 
determine the magnitude of agreement between 
observers which was high. It is therefore likely 
that the same postural qualities, used by the ob- 
servers to determine their referent choices, would 
be used by producers when creating postural ges- 
tures for those referents. After all, in the ordi- 
nary communicative situation the same individual 
acts as both producer-speaker and observer-lis- 
tener. Nevertheless, this hypothesis is subject to 
empirical determination. 

Both studies are relevant to the psychology of 
esthetics, particularly, to the study of artistic 
Production and comprehension. For example, 


the findings are consistent with long-held notions 


m dramatics that actors may supplement the 
meaning of their verbalizations by variations in 
Intonation and posture (Aldrich, 1963). Vocal- 
gestural properties are used as a means of con- 
Veying qualities such as color, direction, etc.; 
and postural-gestural properties are used as a 
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medium for denotational and connotational ref- 
erence. y 

In conclusion, the present findings on postural 
gestures and the previous findings on vocal ges- 
tures suggest that nonverbal expressions may re- 
sult in uniform judgments. This is consistent with 
the theoretical presumption that nonverbal ex- 
pressions may be used to support verbal-lin- 
guistic communication (Werner & Kaplan, 1963). 
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INTERNAL VERSUS EXTERNAL CONTROL OF REINFORCEMENT 


AND DECISION TIME* 


JULIAN B. ROTTER anp RAY C. MULRY 


University of Connecticut 


Previous research has shown generality of individual differences in an expect- 
ancy that reinforcement is contingent on one’s own behavior (internal control) 
versus an expectancy that reinforcement is determined by luck, chance, fate or 
powerful others (external control). The hypothesis was investigated that “in- 
ternals” and “externals” differ in the value placed on the same reward depend- 
ing upon whether it is perceived as contingent upon chance or skill. To test this, 
decision time was measured in a difficult matching task, and described to 4 the 
Ss as skill, and to + as chance determined. Results show significant’ interaction 
between internal-external control and chance vs. skill instructions. As hypothe- 
sized, internals take longer with skill instructions, externals with chance instruc- 
tions. The study extends the construct validity of the internal-external control 
variable and has implications for personality, cultural differences, and decision 


theory. 


A series of previous investigations has dem- 
onstrated that the perception of a situation as 
controlled by chance, luck, fate, or powerful 
others will lead to predictable differences in 
behavior, in comparison to situations where a 
person feels that reinforcement is controlled by 
his own behavior (Holden & Rotter, 1962; James 
& Rotter, 1958; Phares, 1957, 1962; Rotter, 
Liverant, & Crowne, 1961). Other studies have 
indicated that individuals differ reliably in the 
degree to which they perceive reinforcement in 
a variety of ambiguous social situations to be 
controlled by their own characteristics and/or 
behavior versus by external forces. Such differ- 
ences have been observed in children and adults 
and appear to be generalized over a wide variety 
of social situations. 

Results of these investigations have shown 
that the individual who tends to perceive rein- 
forcement as contingent upon his own behavior 
is more likely to take social action to better his 
life conditions (Gore & Rotter, 1963), is more 
likely to attend to, and to learn and remember 
information that will affect his future goals 
(Seeman, 1963; Seeman & Evans, 1962), and is 
generally more concerned with his ability, par- 
ticularly his failures (Efran, 1964). The indi- 
vidual who seems to be more internal also ap- 
pears to have a greater need for independence 
(Crowne & Liverant, 1963) and is resistive to 
subtle attempts at influence (Getter, 1963; Gore, 


1The data for this research were collected while 
both authors were at the Ohio State University. The 
investigation was supported in whole by the United 
States Air Force, under Contract No. AF49(638)- 
741 monitored by the Air Force Office of Scientific 
Research, Office of Aerospace Research, 


1963; Strickland, 1963). Ethnic, cultural, and 
social class differences among groups on this 
variable appear to be consistent with the above 
description (Battle & Rotter, 1963). 

In social learning theory (Rotter, 1954), this 
characteristic is conceptualized as a generalized 
expectancy regarding the relationship between 
one’s own behavior, efforts or characteristics and 
reinforcement. It is related to the sociological 
concept of alienation in the sense of “power- 
lessness” (Seeman, 1959), 

Earlier studies of internal-external control as 
a personality variable have been concerned 
primarily with the generality of this character- 
istic and with construct validity. The present 
study is concerned with a different aspect of 
this problem, namely, that of potential differ- 
ences in the value or importance placed upon 
different kinds of reinforcements by individuals 
who can be characterized as more or less 
internal in their general attitudes. 

Previous research by Barker (1946) and 
Lotsof (1956) has shown that if expectancy for 
receiving a reinforcement is held constant, then 
decision time for choosing one of a closely 
matched pair of possible reinforcements in- 
creases with the importance of the reinforce- 
ment. This research, along with common-sense 
considerations, suggests that individuals who 
place greater value on an outcome would take 
longer in making a difficult discrimination. 
Specifically, the study tests the hypothesis that 
internals will take longer to make a difficult 
discrimination in a task which they perceive to 
be skill determined and that, relative to others, 
externally oriented individuals will take longer 
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to be determined by luck or chance. Essentially, 
_ it seems logical that an individual who felt 
` that what happened to him depended on his own 
skills would place higher value on demonstra- 
tion of skill (since it indicated a promise for 
future rewards) than would a person who felt 
that reinforcements were arbitrarily dispensed 
independently of his own actions. The latter 
individual might, on the other hand, regard 
“luck” as a personal although unstable attribute 
and would have greater concern with whether 
or not he was a lucky or unlucky person. In- 
creased value in turn would lead to longer 
, decision time in both cases. Since the discrimina- 
tion was very difficult the person with more 
involvement in being correct would spend more 
A time comparing the alternatives. 

The measurement of individual differences has 
been accomplished primarily by a forced-choice 
questionnaire (the I-E Control Scale) which 
has evolved through a series of refinements to 
a 29-item test including 6 filler items.? The test 
has demonstrated adequate internal consistency 
and test-retest reliability. 

The test primarily samples general attitudes. 
` None of the items deal directly with the values 
placed, upon skill reinforcements versus luck or 
chance reinforcements. A sample item follows: 


a. In my case getting what I want has little or 
nothing to do with luck. 

b. Many times we might just as well decide what 
to do by flipping a coin. 


The score on this test is simply the number of 
“external” choices made by the subject. Typical 
college student means are approximately 8.5 and 

medians 8.0, 

The present study was designed to provide 
Subjects, dichotomized into internal and external 
control groups, with a difficult discrimination 
Problem. Half the subjects were instructed that 
the discrimination was so difficult as to be a 
matter of luck. The other half were told that 
although the discrimination was difficult previous 
Tesearch had shown that some people are more 
|: skilled in making such discriminations. 

To test the hypothesis it was necessary to 
control the frequency and order of reinforce- 
Ment. Previous research by James (1957) has 
indicated that it was possible to control rein- 

> forcement and to have subjects accept the 
instructions of chance or skill control in an angle 


Baas scale was developed by S. Liverant, J. B. 

‘otter, M. Seeman, and D. Crowne. An article pro- 

f% Viding extensive data on the development, validity, 

and ze pe ity of this test is now in preparation by 
Otter. 
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matching task. In this task there were actually 
no correct matches, but for every angle to be 
matched there were four equally close alterna- 
tives. After each trial the experimenter merely 
said right or wrong and subjects accepted 
the experimenter’s statement without apparent 
questioning. 


METHOD 
Subjects 


The subjects included 61 females and 59 males. 
The subjects were obtained from the elementary 
psychology course at the Ohio State University 
where all subjects were required to take part in 
some research but were able to choose, on the 
basis of minimal descriptions, the experiment they 
preferred. 


Procedure 


The subjects were tested by two experimenters.? 
The subjects were randomly assigned to either 
chance or skill groups, Half received the I-E Con- 
trol Scale before the experiment, and the other half 
after the experiment. Without scoring the tests in 
the case of the former group the subjects were 
assigned on an alternating basis to either the chance 
or the skill group and given the angle matching 
tests. For the first eight trials, referred to as the 
training trials, all subjects received the same order 
of right and wrong reinforcements. They were told 
they were correct on 75% of the first eight trials. On 
the second and fourth trials they were told they were 
wrong. Following this sequence an extinction series 
was given on which they were told they were wrong 
on every trial. Fifty extinction trials were given 
and the experiment was terminated if the subjects 
did not extinguish before this point. Before each 
trial the subject was asked to state a probability 
from O to 10 as to whether or not he expected to 
be successful on the next trial. The points along 
this continuum were defined for the subject. During 
the extinction series if the subject stated an 
expectancy of O or 1 for two consecutive trials 
he was considered extinguished and the experiment 
terminated.* 

The task consisted of 28 standard cards mounted 
on a board. On each 6 X 6 inch card was an angle 
made by black tape. The standards were spaced at 
5-degree intervals. The smallest angle was 40 de- 
grees and the largest was 105 degrees. Two standards 


8 The subjects were tested by the junior author 
and by Mary Kay Moats. Grateful acknowledgment 
is made to Mary Kay Moats for her contribution 
to this study. 

4 Extinction is used throughout this article to 
refer to the reduction of a verbalized expectancy 
for success related to continuous failure trials. 
Holden and Rotter (1962) have previously shown 
that behavioral choice criteria (other than verbal) 
are affected in a similar fashion to verbalized ex- 
pectancies with continuous failure experience. 


600 


for each angle were made. The matching stimuli, 
referred to as samples, consisted of 13 angles 
mounted on 4X4 inch cards beginning with 42.5 
degrees and spaced at 5-degree intervals so that the 
largest was 102.5 degrees. None of the samples 
matched any of the standards but for each sample 
there were four, nonadjacent equally close angles 
(2.5 degrees from the sample) with their apexes 
turned in different directions and the sides of un- 
equal lengths. The task was so difficult, therefore, 
that the experimenter could say right or wrong on 
any trial and the subject did not apparently dispute 
his statement. Postexperimental interviews with pre- 
test subjects indicated that subjects only rarely 
(1 or 2 of a group of approximately 20) questioned 
the accuracy of the experimenter’s judgment during 
extinction only and none questioned whether or not 
a true match was possible. 

Since there were only 13 sample angles and the 
experiment could continue for a maximum of 58 
trials, standard samples were presented in order and 
after Trials 13, 26, 39, and 52 the standards were 
rotated 90 degrees and the same sequence repeated. 

Decision time was measured on a running second 
hand, the experimenter standing slightly behind the 
subject so that the subject was not aware that he 
was being timed. Timing began when the subject 
was handed the sample card and ended as soon as 
he indicated which of the standards was the correct 
match, 


Instructions 


The verbatim instructions given to the subject are 
presented below: 


We are doing a series of experiments to test 
visual discrimination under conditions of per- 
ceptual confusion. In this experiment we are 
concerned with the judgments of degrees of angles 
when the differences between them are small and 
when there are a number of confusing elements 
present. 

[For skill groups only]: We have found that 
some people have a special skill at this and do 
consistently better than others. 

[For chance groups only]: We have found that 
under these conditions success is entirely a matter 
of chance, 

The way in which the test works is as follows: 

There are a number of angles on this board. 
They are of different degrees and are set in 
varying positions. As you can see, some of the 
angles on the board are very similar to each other. 

I also have here a series of cards such as this 
one [experimenter shows sample card] with vari- 
ous angles on them. The angles on these cards are 
of different degrees and are placed in varying 
positions on the cards, 

On each trial I will show you a card such as 
this. The idea is to look at the angle on the card 
and then to choose, from the angles on this board. 
the angle which you think is the same degree Be 
the one which I show you. Thus, if you think 
this angle [experimenter holds up sample card] 
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is the same degree as this one on the board you 
would say [experimenter points to the M card] 
M. I have the set of answers here and I will let 
you know whether you are right or wrong. The 
jdea is to concentrate only on the degree of the 
angle. All of the cards are of approximately equal 
difficulty. 

Do you have any questions so far? [If the 
subject asked a question, the relevant parts of the 
instructions were repeated to him.] 

The idea of the test, of course, is to get as 
high a score as you can. I will be keeping score 
and at the end of the experiment I will give you 
your total score and let you know how you did 

“compared to other people. 

Now there is another factor which can affect 
your total score. Besides scoring the angles which 
you get right, I am also interested in how accu- 
rately you can predict how you will do on each 
trial. Before each trial I would like you to state 
what you feel the probability is of your being 
right on that trial. You can rate this on a scale 
going from 0 to 10. For example, if you feel sure 
that you will be right you would state a high 
number like 9 or 10. If you feel only moderately 
that you will be right, you would state a lower 
number like 5 or 6. If you feel that you won't be 
right, you would state the lowest numbers like 
0 or 1. You might consider those numbers which 
you state as being estimates that you are, making 
on the degree of confidence you have that you 
will be right. If you are right on the trial, the 
number you stated will be added to your total 
score; if you are wrong on the trial, the number 
you stated will be subtracted from your total 
score, From this you can see that it is very im- 
portant that the estimates you make correspond 
closely with how you really feel you will do. 

Do you have any questions? 

[For skill groups only]: Although the discrimina- 
tions required here are at a difficult level, we 
have found that some people are highly skilled 
at this and are able to get consistently high 


scores. The results depend entirely upon your ` 


ability. Do as well as you can and we will see 
if you have some skill at this. 

[For chance groups only]: Although the discrimi- 
nations required here are at a level which makes 
it entirely a matter of chance, some people: are 
lucky and get high scores. The results depend 
entirely upon guessing. Do as well as you can 
and we will see if you are lucky at this. 


Summary of Design 


Four groups of subjects were compared on a series 
of dependent variables. To obtain these groups sub- 
jects were divided into “internals” and “externals” 
by splitting the groups at the median and then 
further subdividing on the basis of whether they 
were given chance or skill instructions. Since the 
determination of whether they were internals °° 
externals was not made until after the experimen 
procedure was completed (thereby avoiding possible 
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TABLE 1 
+ MEANS AND STANDARD DEVIATIONS OF Major DEPENDENT VARIABLES FOR THE Four EXPERIMENTAL GROUPS 

= 
3 Internals Externals 
' $: 
Decision time 
y Chance Skill - Chance Skill 
$ (W = 29) (N = 36) (N = 29) (N = 26) 
A Training trials 
Seconds 
M 24.6500 37.7400 28.6800 26.2000 
SD 16.4400 25.1100 17.0100 14.7300 
Logs* 
iL 1.2985, 1.5090, 1,3847a» 1.3414, 
SD 3020 P 2440 «2500 -2504 
_ Extinction trials G 
"Seconds 
M 29.2700 49.1000 35.4800 32.7900 
on SD 18.3600 30.5200 25.3200 18.4800 
€ Logs! 

S M 1.3850» 1.5978, 1.4591» 1.4379, 
g SD .2845 3070 3710 -2690 
~ Mean expectancy 

First 8 trials 
qj M 5.9400 6.1800 5.8700 5.7600 

SD 1.5300 1.5200 1.6200 1.5400 
Number of trials to extinction 
16.1700 18.3600 19.1000 16.7300 
SD 14.2600 14.7400 14.7000 10.9800 
Unusual shifts 
First 10 trials 
M -7590 7770 1.2060 1.1920 
SD° 1.2700 1.4600 1.1630 1.8400 


f a Significant differences in means at the .05 level or better are indicated by letter subscripts, as determined by the Duncan 
Tange test. Means with the same subscript are not significantly different from each other. 


bias), the groups are slightly unequal. While the 

main comparison among groups was decision time, 

additional indices were computed to determine 
whether or not any other factors might be influ- 
© encing decision time. These included the mean ex- 
_ Pectancy for the training trials and number of trials 
i to extinction of the verbalized expectancy for 
4 Success. In addition, the frequency of unusual shifts 
in expectancies was computed. The latter measure 
is Simply a count of the number of times that a 
Subject raises his expectancy after a failure or 
lowers it after a success. Previous studies have 
shown that externals tend to make more such 
y Unusual shifts than internals but findings have not 
always been consistent. 


RESULTS 


__ The average I-E Control Scale score was 8.48 
‘and the median was 8.0. Therefore, all sub- 
Jects with scores of 9 or more were classed as 
els and with 8 or less as internals. The 
tour experimental groups, then, included 29 sub- 
jects classed as internals who were given chance 
structions and 36 subjects classed as internals 
4, Who were given skill instructions, There were 29 
Sie who had been given chance instructions 

26 externals who had been given skill in- 


structions. The 59 males and 61 females were 
randomly distributed among the four groups. 
Analysis of variance indicated that males and 
females did not differ significantly on any of 
the experimental measures and, consequently, 
sex did not appear to be a factor in accounting 
for any of the results. Analyses of variance were 
performed on all variables to determine whether 
data collected by the male experimenter dif- 
fered in any way from data collected by the 
female experimenter. No significant differences 
were obtained. Similar analyses of variance were 
performed on the data obtained from the 60 
subjects who took the test before in comparison 
with the 60 subjects who took the I-E Scale 
after the experimental procedure. In this case 
a significant difference in decision time only 
was found. Subjects who took the test before 
the experimental procedure had significantly 
longer (p < .01) decision times on the first eight 
trials of the experimental procedure. Decision 
time for the extinction trials was also longer, 
but not significantly so, for those subjects who 
took the test before. There was no significant 
interaction in these effects with the chance and 
skill or internal-external control variables. That 
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is, the effects were “across the board” and did 
not account for any of the differences between 
internals and externals or chance and skill 
groups. What these findings suggest is that by 
giving tests first the subjects became more 
involved in the experimental procedure than 
when the testing followed the angle matching 
task. 

The major experimental results are given in 
Table 1. Decision time for the first eight trials 
and extinction trials are given in seconds and in 
logs. Conversion to logs of mean decision times 
was considered advisable to reduce the effect 
of a few extreme scores. While these subjects 
did not, on the whole, run counter to the rest 
of the group in the direction of their deviance, 
they tended to raise the variance for the group 
in which they appeared and therefore affected 
group differences to an unjustified degree. Analy- 
sis of the data was made using log conversions. 

The main hypothesis of this study was that 
internals would place greater importance on 
whether or not they were correct in the skill 
instruction situation than in the chance instruction 
situation and vice versa for externals. The most 
important test of the hypothesis was the 
decision time for the first eight trials in which 
all the subjects received the same order and 
number of positive reinforcements (75%). To 
be sure that differences in decision time were 
not dependent upon significant differences in 
mean expectancy for success, which could also 
influence decision time, a test was made of 
differences among the four groups on mean 
expectancies for success for the first eight trials. 
The means are presented in Table 1. Analysis 
of variance indicated no significant difference 
on any of the variables so that it may be 
concluded that differences in decision time among 
the four groups were not a function of differences 
of expectancy for success. 

An analysis of variance for log decision times 
was computed. Neither internal versus external 
control nor chance versus skill alone was a sig- 
nificant variable on decision time, but the pre- 
dicted interaction of these two was highly 
significant (F = 6.6667, df=1/116, p< .025). 
The mean differences are shown in Table 1. 
Internals take much longer to decide on the 
correct match when they are instructed that the 
task is a matter of skill than when they are 
instructed that it is a matter of luck or chance, 
Externals, on the other hand, take longer to 
decide on the correct stimulus when they are 
told it is a matter of luck than when they 
are told it is a matter of skill but not signifi- 
cantly so. The Duncan range test for internals 
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under chance versus skill is significant. The dif- 
ference is not significant and is numerically — 
smaller for externals under chance versus skill, 
It is clear that most of the variance is con- 
tributed by the internals who take much longer 
when they are told it is skill than do the other < 
groups. Internals in the skill condition are also 
significantly different from externals with skill 
instructions. 

A further comparison on decision time could 
be made for the extinction trials if the number 
of such trials was not significantly different for 
the various groups. The mean trials to extinction 
are given in Table 1. Analyses of variance show i 
that neither internal-external control, chance- 
skill, or an interaction is significant. The number 
of trials to extinction was not significantly 
related to decision time (r = —.16). 

Decision time for the extinction trials then 
could serve as a cross-validation of decision 
time for the training trials. An analysis of vari- 
ance for log decision time was computed. The 
chance-skill variable approaches significance but 
only the interaction is clearly significant (F 
= 6.850, df =1/116, p< .025). It can be seen” 
from the means in Table 1 that the results were 
essentially the same as those for the eight train- 
ing trials, but decision times in general were 
somewhat longer during extinction. The longer 
decision times in extinction could be expected as” 
Lotsof (1958) has shown that decision time in- 
creases with increased expectancy for failure: 
However, since the groups had approximately 
the same mean number of trials to extinction, 
one could assume that they had equivalent ex- 
pectancies for failure. Again, according to the 
Duncan range test the internals took significantly 
longer to decide the correct match under skill 
instructions than under chance and significantly 
longer in the skill condition than externals in the” 
skill condition. 

Previous studies of internal-external control 
have shown that externals tend to make shifts” 
in expectancy up after failure or down after 
success with greater frequency than do internals. 
The tendency is somewhat stronger with children 
(Battle & Rotter, 1963; Bialer, 1961) but has 
also been observed with adults (James, 1957; 
Phares, 1957). Mean tendencies in the present 
study are in the same direction. The mean 
unusual shifts for the four groups are shown in 
Table 1. Since all subjects had only the first 
10 trials in common, unusual shifts for only the 
first 10 trials were calculated. The majority © 
subjects made no unusual shifts. "i 

Analysis of variance of unusual shifts indi- 
cated no significant variable although intern: 


= 


= 
© 
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versus external control was at the <.20 level. 
A chi-square analysis of those subjects who 
showed two or more unusual shifts versus one 
or none indicates a tendency for externals to 
more often have two or more unusual shifts 
(p<.10). The findings are consistent with 
previous research. Externals more often make 
unusual shifts in expectancy, presumably fol- 
lowing the “gambler’s fallacy” of raising the 
expectation after failure or dropping it after 
success. The trend is not strong, however, with 
college student subjects and only approaches 
significance, 4 


° 
DISCUSSION 


The findings of this study are relatively clear- 
cut. Individuals who can be characterized as 
internals from scores on the I-E Control Scale, 
take longer to decide in a matching task when 
the task is defined as skill controlled than when 
it is defined as chance controlled. The opposite 
tendency is found with subjects who are classed 
as externals. Externals tend to take longer to 
decide on the correct match when the task is 


+ defined as chance than when it is defined as 


skill controlled. The interaction is highly signifi- 
cant koth for a predominantly positive reinforce- 
ment sequence of training trials and for a series 
of continuously negative extinction trials, Inter- 
nals under skill conditions are significantly dif- 
ferent from intervals under chance conditions, 
and internals under skill conditions are signifi- 
cantly different from externals under skill condi- 
tions. On the other hand, while externals differ 
under chance and skill conditions in the pre- 
dicted direction, these groups are not significantly 
different. In other words, most of the difference 
1s attributable to the longer time taken by 
internals under skill conditions. Perhaps this is 
ndt surprising since we are dealing with a select 
Population which is probably more internal than 
the general population. The college students we 
are characterizing as externals, relative to the 
other subjects, are probably closer to the middle 
of a theoretical distribution. This study extends 
the construct validity of the internal-external 
Control dimension and indicates that people dis- 
tributed along this dimension not only differ 
Predictably in expectancies regarding their own 
behavior-rein forcement contingencies but also 
in the degree to which they value chance 


determined and skilled determined rewards. 


j These results have some interesting implica- 
tions for the study of cultural differences. If it 
18 true that groups who learn to expect chance 
or fate or powerful others to control the en- 


+ vironment tend also to place value on reinforce- 
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ments which they see as controlled by these out- 
side influences than those which they perceive 
as a function of their own skill, it would imply 
that they would be less motivated towards an 
increase in skill or achievement. The findings 
also suggest support for Strodtbeck’s (1958) 
hypothesis that differences in value placed upon 
achievement in Jewish and Italian subcultures 
relate to a generalized attitude concerning the 
individual’s ability to control the environment 
which he felt differentiated these subcultures. 

Internals may tend to select activities in 
which they can demonstrate skill, and externals 
activities in which they can demonstrate luck. 
This is a testable hypothesis. Studies of prefer- 
ence for games and different types of games and 
gambling activities could readily be accom- 
plished. Similarly, the concern with superstitious 
signs of good and back luck will be related, and 
we would expect to find that externals more 
frequently consult fortune tellers, have their 
horoscope read, etc. 

This study also has implications for decision 
theory. Feather (1959) makes the assumption 
that reinforcement in a chance task is based 
on the value of the reinforcement alone but in 
skill tasks it is a combination of the value of 
the reinforcement and the value of success 
which he sees as based on achievement need. 
Presumably the ego involvement would always 
be higher in skill tasks when other factors are 
controlled. The finding that externally oriented 
individuals do not decrease decision time under 
chance instructions, but tend if anything in the 
opposite direction, suggests that Feather’s results 
will obtain only in predominantly internally 
oriented samples. 
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REACTION TIME AND HEART RATE AS FUNCTIONS OF MAGNI- 


TUDE OF INCENTIVE AND 


ROGERS ELLIOTT 
Dartmouth College 


The effects of 3 probabilities of success (PS) and 3 magnitudes of incentive 
(MI) upon reaction time (RT) and heart rate (HR) were examined in 3 Ss, 
each repeating the matrix of conditions 10 times. Results depended upon Ss’ 
experience, with significant motivational effects on HR during the early ses- 
sions, but not later, and significant effects on speed during the later days, but 
not the earlier, The effects of PS and MI upon RT were interactive. j 


This experiment was done to assess the influ- 
ence of two motivational variables upon human 
perceptual-motor performance, and upon arousal. 
The two motivational variables are probability of 
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PROBABILITY OF SUCCESS* 


success (PS) and magnitude of incentive (MI). 
The former refers to the expectation on the part 
of the subject that his response will satisfy some 
requirement for success in some task; the latter 
refers simply to the size of reward given for 
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successful responses. The arousal response was 
heart rate (HR), and the perceptual-motor re- 
sponse was reaction time (RT). 

Both Atkinson (1958) and Zipf (1961) have 
studied human performance as a function of PS 
and MI, among other variables. Atkinson found 
that performance was better at the higher of two 
monetary incentives and also better at a PS of 
.50 than at any PS either higher or lower, There 
was, in his study, no interaction of effects of PS 
and MI. Zipf’s design was more complex, but 
her findings with regard to MI and PS were 
these: there was no interaction between them 
(in common with Atkinson’s study); MI had no 
effect upon performance (different from Atkin- 
son); and quality of performance was a mono- 
tonic decreasing function of PS (also different 
from Atkinson). 

The two studies were different in many ways 
in design and detail—but there is still no obvious 
reason why MI was an effective variable in one 
study but not in the other. Furthermore, the 
failure of PS and MI to interact in effects upon 
performance (the one finding common to both 
studies) is a surprising one. A person might not 
work at all at any level of PS save certainty if 
the incentive is very small; but for a very large 
inceftive, the function of performance on PS 
would certainly be different, in some as yet un- 
known way, from the function obtaining at the 
very small incentive. Some of the disharmony 
and unconvincingness of the results may have 
been due to the fact that the use of a single 
session with verbal or written instruction to 
establish the levels of independent variables may 
not have been sufficient to persuade subjects that 
the experimenters meant just what they said. As 
Zipf pointed out, “need to achieve, desire to 
please the experimenter, and other personality 
factors” may have been involved. 

The present design was constructed to obviate 
some of the difficulties just pointed out. Repeated 
sessions replaced the single session; money was 
paid as it was won, and it was won with the 
probability which the experimenter had estab- 
lished, so that the subjects could become con- 
vinced of the reality of variables. The range of 
incentives was larger than that used by Zipf or 
Atkinson, so that any interaction that exists be- 
tween MI and PS might appear. And, finally, 
the fluctuations in mean daily payoff were a 
function of how “good” the subject was that 
day at his “work,” so that the laboratory task 
had some relevant extralaboratory qualities. 

The variables of MI and PS might well have 
effects not only upon speed of reaction, but also 
upon the arousal of a subject. Although it is 
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sometimes hypothesized that arousal itself has 
effects upon performance (e.g., Duffy, 1962), it 
is probably simpler in most cases to conceive of 
both the performance and the arousal response 
as affected, each in its own way, by some indes 
pendent variable, like the motivational ones 
employed here, and to examine the covariation 
of overt and covert effects without any implica- 
tion of causality. The present design offers an 
opportunity to examine in what ways, if any, a 
vegetative response differs motivationally from a 
perceptual-motor one. 


METHOD 
Subjects 


The subjects were three young adult males, two 
undergraduates and one graduate student at McGill 
University. All had had extensive experience with 
the lab and the RT program in previous studies, 
and so had produced the distribution of reactions 
needed by the experimenter to predict the proba- 
bilities with which they might perform above a 
certain speed. 


Apparatus 


The RT system was activated by a power switch, 
which started a cam rotating. The rotating cam 
activated a microswitch which closed two relay 
circuits: one circuit started the operation of the 
Dymec Model Dy 2500 variable time base counter, 
accurate to 1 millisecond, and the other circuit turned 
on the audio oscillator, which presented the 1000 
cycles-per-second stimulus tone. The gain of the 
audio oscillator was set to deliver a tone 70 decibels 
above the average threshold of two observers of 
normal hearing. 

Pressure on the reaction key broke the timer 
circuit, stopping the counter. The relay circuits con- 
necting the reaction key with the timing devices had 
a release time of 4-8 milliseconds. When the counter 
stopped, a Hewlett-Packard digital recorder (Model 
560A) printed the RT in milliseconds on a paper 
tape. Rotation of the cam broke the circuit pre- 
senting the stimulus tone after the tone had been 
on for .70 second, and further rotation continued to 
control the stimulus program and timing circuits 
until the power was turned off. 

One rotation of the cam delivered 24 tones, The 
24 interstimulus intervals (tone-onset to tone-onset) 
varied from 3.2 to 4.7 seconds, with a mean interval 
of 4.0 seconds. Half of the intervals were shorter 
than this mean, and half longer, and these long and 
short intervals occurred in random order. 

The reaction key of this apparatus was attached 
to the right side of the subject’s chair, and its posi- 
tion was adjustable so that the subject’s finger could 
rest comfortably upon it. One ink-writing galva- 
nometer recorded the onset and termination of each 
tone and each response on one of the three chart 
drives of a custom built Edin EEG-EMG, described 
elsewhere (Malmo & Surwillo, 1960). 
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The HR was also recorded on the same chart 
drive by means of a continuous electrocardio- 
graphic tracing. The HR electrodes were of cellu- 
lose sponge soaked in electrode jelly, and at- 
tached by elastoplast tape to the right and left 
biceps of the subject. 


Procedure 


Each subject came for a session each day for 10 
days, Tuesday through Saturday, and Monday 
through Friday. Before the first session the nature 
of the experiment was explained—that is, that the 
subject would be reacting for varying amounts of 
money with varying probabilities of winning. The 
HR electrodes and earphones were then put on, the 
subject was seated comfortably, and was given a 
warm-up block of 21 trials (reaction tones). The 
warm-up block was introduced with the following 
instructions: 


For this set, you have a one in three chance of 
being faster than (say) 140 milliseconds. You 
ought to beat 140 milliseconds about 7 times, and 
each time you do, you win 3¢, or about 21¢ in all. 


The warm-up condition allowed the experimenter 
to adjust his predictions of the subject’s perform- 
ance to take into account day-to-day variation in 
speed, After this condition there were nine others, 
consisting of the combinations of three levels of 
PS—.10, .50, and .90—and three levels of MI— 
$.01, $.02, or $.04 per winning reaction. Instructions 
like that given before the warm-up block but modi- 
fied to fit the particular treatment combination 
were given before each new block. After each block, 
the experimenter announced to the subject his num- 
ber of winning trials and the total won. It can be 
seen that the expected range of payoffs per block 
was from about $.02 to about $.72, and that the 
expected mean for the session (excluding the warm- 
up block) was $2.10. 

The method of constructing the order of the nine 
treatment combinations can be best expressed by 
example, For Sı, the first day order was 5-4, 9-2, 
1-1, 9-1, 1-4, 5-2, 1-2, 5-1, 9-4. Each third of this 
series has all levels of MI and of PS. On the sec- 
ond day, the order was reversed. On the third day, 
the same plan was employed with a different order, 
with care that the extreme conditions of Days 1 and 
2 were no longer extreme. Thus, a pair of days was 
counterbalanced, and more than one pair assured 
some degree of control over the adventitious effects 
of a particular order. The different subjects had 
different orders. 

The method of establishing the level of PS de- 
pended, as noted, upon the previous distribution of 
reactions given by the subject, with adjustments on 
the basis of the daily warm-up, In addition, the 
experimenter allowed himself 3 milliseconds plus or 
minus the stated speed requirement for the purpose 
of keeping the number of wins close to the pre- 
dicted level. Thus, at PS=.50, the subject might 
have to beat 145 milliseconds. He might then have 
17 RTs faster than the requirement, but only 14 
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TABLE 1 


SIGNIFICANT EFFECTS IN THE OVERALL ANALYSES 
oF REACTION Time AND HEART RATE 


Effect af F 

Reaction time 

Blocks 4/8 17.77*** 

MI 2/88 4.27* 

Subjects X blocks 8/88 Blots 
Heart rate 

Blocks 4/8 5.02* 

Subjects X blocks 8/88 45.11*** 

Subjects X MI 4/88 5.88*** 

Blocks X MI 8/88 2.16* 

*p <.05. 
> <.01. 
wD <.001. 


faster than 142 milliseconds. The experimenter 
would use the harder requirement, calling out the 
number of wins (14) closer to the prediction. These 
small adjustments were made on one-third of the 
blocks, They were indiscriminable to the subjects, 
who when interviewed reported never having sus- 
pected them. 


Analyses 


An overall analysis of variance was done on: both 
RT and HR data using the dimensions 3 (subjects) 
X 3 (levels of PS) X 3 (levels of MI) X 5 (counter- 
balanced blocks of days). The results of this overall 
analysis were refined somewhat, to bring out the 
influence of experience, by two further analyses 
based on the data of four pairs of days, distinguished 
as early and late: that is, the combined Days 1-4, 
and the combined Days 7-10. The RT score for any 
subject for any treatment combination was the 
mean of the medians of the 21 RTs given to that 
condition on each of the four days being combined. 
And the HR score for any subject for any treatment 
combination was the mean of the four rates ob- 
served in that condition on each of the four Days 
1-4 or 7-10. All rates were based upon a count of 
all the QRS deflections occurring during the 80-sec- 
ond period starting just after the first stimulus tone 


and ending at just about the onset of the last stimu- 
lus tone. 


RESULTS AND Discussion 
Prediction of PS 


Subjects succeeded, or won, 12% of the time 
at the predicted .10 level, 45% of the time at 
the predicted .50 level, and 85% of the time at 
the predicted .90 level. At the .10 and .90 levels 
of PS the obtained percentages of wins varied 
only slightly from subject to subject and from 
one level of MI to the next. A considerable 
error on prediction occurred, in fact, on only 
one of the nine treatment combinations—that is, , 
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TABLE 2 
AVERAGE REACTION TME (MILLISECONDS) OF THE SUBJECTS UNDER ALL CONDITIONS 
DURING Days 1-4 COMBINED AND 7-10 COMBINED 
Days 1-4 Days 7-10 Y 
Ebaby Magnitude of incentive Magnitude of incentive 
$.01 $.02 $.04 $.01 $.02 $.04 

-90 
Si 145.75 146.75 145.00 129.00 131.75 131.25 
S2 151.75 150.00 - 152.75 143.00 146.00 139.25 
Ss 160.00 162.75 » 155.00 139.25 144.00 140.75 
M 152.50 153.17 150.92 137.08 140.58 137.08 

50 
Sı 144.00 141.75 144.50 132.25 131.25 128.00 
Sa 151.50 145.75 145.75 144.75 141.00 139.25 
S3 159.75 161.25 154.25 148.25 141.50 139.25 
M 151.75 149.58 148.17 141.75 137.92 135.50 

.10 
Sı 142.00 144.00 146.50 132.00 130.50 133.75 
S: 153.25 153.00 146.75 140.00 138.50 141.75 
Sa 153.25 150.75 149.50 145.00 138.75 143.00 
M 149.50 149.25 147.58 139.00 135.92 139.50 


when PS was .50 and MI was $.01, subjects suc- 
ceedèd only 39% of the time. The source of the 
error is not known, but accuracy of prediction 
will probably improve when predictions for a 
given condition are based only upon responses 
made in that condition. The subjects, in post- 
experimental questioning, believed that the level 
of PS stated was in fact the level they could ex- 
pect. 


Efects of PS and MI 


The major results of the overall four-way 
analyses of variance are shown in Table 1, for 
both RT and HR. 

The main effect of blocks for RT reflects the 
fact that RT declined steadily in all subjects 
over the 10 days, an average of 20 milliseconds. 
For HR, the main effect of blocks reflected the 
fact that HR rose for all subjects on Day 5 or 
6, and then declined to original levels. In every 
case the rise corresponded to a fairly steep in- 
crease in reaction speed. For RT, there was an 
Overall effect of MI upon speed, and for HR, the 
effect of MI depended upon the point in the ex- 
periment one observed. A clearer idea of what 

ppened over time can be gained by examining 

. Separately results from the early (Days 1-4) 
ma Re late (Days 7-10) periods of the experi- 
ent, 


The breakdown of the overall effects into such 


-e early and late periods is shown in Tables 2 and 


3. Each cell represents a treatment combination 
and contains the score of the three subjects and 
the cell mean. 

In the case of RT, a 3 X 3 analysis of variance 
(Walker & Lev, 1953, pp. 363-372) revealed no 
significant main effects or interactions for Days 
1—4. It can be seen from the left panel of Table 
2 that subjects responded idiosyncratically to 
levels of MI and PS. But the analysis of the 
performance scores for Days 7-10 revealed a 
significant interaction effect of PS with MI (F 
= 8.06, df=4/8, p<.01), and the effect of 
MI alone was itself of borderline significance (F 
= 2.90, df = 2/12, p<.10). One can see from 
the right panel of Table 2 that the subjects came 
to resemble each other in the pattern of per- 
formance, presumably because of some common 
effect of the motivational variables, especially of 
MI. 

These results indicate that what one finds in 
a study of the effects of PS and MI upon per- 
formance is dependent upon the subjects’ experi- 
ence with the experimental situation and task. 
There is no way to tell whether the large groups 
used by Zipf and Atkinson for one session can 
be considered in any sense comparable to the 
small group used here for 10 sessions. If, indeed, 
the studies are comparable, then the present re- 
sults serve to resolve some of the discrepancies 
of the two earlier investigations. That is, the 
interactional influence of PS and MI upon per- 
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TABLE 3 
AVERAGE Heart RATE (BEATS PER MINUTE) OF THE SUBJECTS UNDER ALL CONDITIONS 
DURING Days 1-4 COMBINED AND 7-10 COMBINED 
5 Days 1-4 Days 7-10 
Pr cuauilty. Magnitude of incentive Magnitude of incentive 
$.01 $.02 $.04 $.01 $.02 $.04 
90 
Si 80.34 79.83 79.64 84.38 82.78 84.56 
Se 63.19 62.53 66.28 62.62 62.91 62.06 
Sa 88.22 87.94 91.59. 95.34 95.53 99.75 
M 77.25 16.77 79.17 80.78 80.41 82.12 
+50 
Si 79.59 79.41 81.09 82.97 85.22 83.81 
So 63.66 62.15 65.34 63.75 63.38 62.25 
Ss 87.38 89.25 91.69 95.53 99.38 97.69 
M 76.88 76.94 79.38 80.75 82.66 81.25 
10 
St 77.48 80.39 81.75 83.16 83.34 84.84 
Sa 62.34 63.09 63.38 62.91 62.16 62.81 
Ss 86.72 87.75 92.81 97.41 96.56 98.62 
M 75.52 17.08 79.31 81.16 80.69 82.09 


formance may have occurred in this study, but 
not in the earlier ones, because it includes the 
different results of each of the earlier studies. To 
illustrate, using the data of Table 2, when MI 
was $.02 per reaction, quality of performance 
was a decreasing function of PS—the result 
found by Zipf; but when MI was $.04 per re- 
action, quality of performance was an inverted 
U shaped function of PS—the result found by 
Atkinson. Since the two incentive values used by 
Zipf were both lower than the lowest of the two 
used by Atkinson, these results, based on a broad 
range of incentive values, provide replication of 
both earlier studies. 


Effects of PS and MI on HR 


Table 3 illustrates the results concerning moti- 
vational effects upon HR. In the case of the 
vegetative response, as distinct from the per- 
ceptual-motor one, there is a direct main effect 
on HR associated with MI during the early days 
of the experiment (F = 11.87, df=2/12, p< 
.005), but no effect whatever during the last 4 
days, As can be seen from Table 3, the early 
effect arose primarily because of the strong ac- 
celeration in HR when the stake per reaction was 
$.04. 


Relations between RT and HR 


It is already clear that the two responses, 
overt and covert, did not respond to the inde- 
pendent variables in parallel fashion. While speed 


of reaction became less and less variable from 
subject to subject and more and more under‘ the 
control of the motivational variables, heart rate 
came, with experience, to respond with a varia- 
bility which could not be accounted for. 

From the data of Tables 2 and 3 one can get 
some notion of the covariation of the two meas- 
ures within any subject. For Sı there was a 
strong inverse relation between HR and speed 
for Days 1-4 combined (rho=—.69, N = 9), 
such that RT was fastest at the lowest heart 
rates; but during the last 4 days of the experi- — 
ment, RT was best at the middle ranks of heart ` 


rate, so that Sı manifested the inverted U relation: | 


hypothesized by activation theory (Duffy, 1962). 
But in Sə there was no relation between the two — 
responses in the early days; and a strong nega- 
tive (tho = —. 70) relation in the last 4 days. 
And in Ss there was no relation, monotonic or — 
bitonic, between HR and RT at any time. These 
results are similar in import to the recent find- 
ings of Kimmel and Schultz (1964), wherein, in 
a group of nine subjects, there were significant 
Positive correlations between GSR and verbal 
Tesponse in two subjects, a significant negative 
correlation in one subject, and no other nonzero 
relations. Given a state of affairs in which vari- i 
ations in both overt performance and covert 

physiological responses may or may not covary 
in linear or curvilinear fashion depending upon 
the subject one observes and the time one ob- _ 
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serves him, it seems unwise to describe any gen- 
eral relation between activation and performance. 
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THE INFLUENCE OF THE PERCEIVED CAUSAL LOCUS OF PARTNER’S 
DEPENDENCE ON THE USE OF INTERPERSONAL POWER + 


JOHN SCHOPLER anp MARJORIE WALL MATTHEWS 
University of North Carolina 


In recent years a number of studies have examined the conditions affecting 
the extent to which a powerful person, at some cost to himself, will help a 
totally dependent partner. This study was conducted to check the hypothesis 
o that a powerful person who perceives his partner’s dependence to be caused 
by “external” (environmental) factors will help more than a powerful person 
who perceives his partner’s dependence to be caused by “internal” (personal) 
factors. 48 Ss participated in a laboratory study which verified the hypothesis. 
The results are discussed in terms of the conditions arousing the norm of 


social responsibility. 


It has traditionally been assumed that the 
person who possesses more power than his part- 
ner will enjoy many interpersonal advantages, In 
a dyad the more powerful person receives def- 
erence and flattery, dominates the interaction 
and, if a conflict arises, is able to resolve it in 
his own favor. The empirical research document- 
ing such effects has been summarized by Schop- 
ler (in press). It has also been assumed that the 
likelihood of securing these advantages is directly 
related to the size of the power advantage pos- 
sessed by one of the participants. In the limiting 
case, if Person A has absolute power over Per- 
Son B, A can resolve every conflict to benefit 
himself maximally, regardless of the consequences 
to the partner, However, as Berkowitz and Dan- 
iels (1963) have suggested a person who has 
absolute power will not necessarily use it and 
may even sacrifice some personal benefit in order 


` This research was supported in part by Grant 
GS-182 from the National Science Foundation. The 
ata were collected by the junior author for a 
cenior honors thesis at the University of North 
Carolina, 


to help his partner. Berkowitz and Daniels con- 
tend the partner’s dependence is a cue for arous- 
ing the norm of social responsibility in the. pow- 
erful person. The norm specifies the limits to 
which self-seeking advantages can be appropri- 
ately exploited without regard for the partner. 
It has been demonstrated that a person with 
absolute power will work harder for (i.e., give 
more help to) a partner who is highly dependent 
on him, compared to one who is only minimally 
dependent (Berkowitz & Daniels, 1963, 1964; 
Berkowitz, Klanderman, & Harris, 1964; Daniels 
& Berkowitz, 1963). 

Other conditions impeding or enhancing the 
help obtained through being dependent have also 
been studied. Dependence is more effective in 
eliciting help if the powerful person likes his 
partner (Daniels & Berkowitz, 1963), has him- 
self received help prior to being in the situation 
in which he has absolute power (Berkowitz & 
Daniels, 1964), believes his own behavior is 
under the partners immediate surveillance 
(Berkowitz & Daniels, 1963), and incurs only 
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a small cost for helping (Schopler & Bateson, 
1965). 

The present study was designed to test an- 
other condition affecting yielding to dependence 
*—the perceived causal locus of the partner’s 
dependence. In the tradition of the Thibaut and 
Riecken (1955) experiment, differences between 
“internal” and “external” causation were studied. 
It was expected that someone who chooses to 
become dependent (“internal” locus) is not 
likely to arouse the social responsibility norm 
because he will be seen as responsible for his 
own fate. The powerful person would be ex- 
pected to offer him little help. In contrast, the 
partner who is forced by circumstances to be 
dependent (a characteristic of every experiment 
in this area) is more likely to be seen as some- 
one for whom the social responsibility norm is 
appropriate. His dependence is not of his own 
making and he may therefore be seen as “de- 
serving” of help. The specific hypothesis to be 
tested was: an individual who perceives his 
partner’s dependence to be caused by the en- 
vironment will yield more than an individual 
who perceives his partner’s dependence to be a 
matter of personal choice, 


PROCEDURE 


Subjects were recruited from an introductory 
psychology course in which participation in research 
was part of the course requirement. Forty-eight 
male undergraduates were used in the experiment in 
groups of four. Subjects were escorted to the experi- 
mental room and were seated in one of four adjoin- 
ing booths, separated by 7 foot high partitions, ar- 
ranged around a semicircle facing a common front. 
Each booth contained a set of headphones, which 
were attached to an Ampex tape recorder. Two 
slots were located in the front of the booth, These 
permitted the sending and receiving of messages. 

Standard procedure. When all booths were occu- 
pied the instructions were played on the tape re- 
corder, The subjects heard that they were taking 
part in a study of “partnerships, particularly of the 
different working arrangements and decisions part- 
ners make.” They heard that two of them would 
be “directors” and two would be “associates,” al- 
though every subject was in fact a “director.” Both 
members of the partnership would be working on 
separate puzzle tasks. Each puzzle represented an 
abbreviated form of a typical crossword puzzle and 
consisted of one horizontal and one vertical row. In 
the director’s puzzle the horizontal row contained 
four boxes and was intersected by a vertical row 
containing three boxes. Three letters were always 
placed in the boxes and therefore only three boxes 
remained blank. The director’s turn came first. He 
would always be given a pool of four letters from 
which he was to finish his own puzzle, He would 
be paid at the rate of $.01 for each letter, but only 
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for completed words. The puzzles had of course 
been pretested to insure their being sufficiently 
simple for all subjects to complete correctly. 

The instructions were then addressed, ostensibly, 
to the “associates.” The associates had similar, but 
shorter, puzzles. To complete them they would have 
to request letters from their directors. The inde- 
pendent variable, whether this was seen as an ex- 
perimental requirement or as an associate’s choice, 
was introduced at this point, and will be described 
in detail below. On each trial the associates were to 
receive a carbon copy of the director’s letters. They 
were entitled to request as many as two letters on 
any, given trial. Associates were also to be supplied 
with message forms on which they would write 
their requests. (These were prepared beforehand by 
the experimenter and were standard for all subjects.) 

Prior to working on the first item, each subject: 
received a “self-description form” presumably com- 
pleted by his associate. It contained innocuous back- 
ground information, along with a reminder of the 
independent variable manipulation. 

This entire scenery was necessary in order to give 
the subject power over a dependent partner and to 
control the cost of helping the partner. Out of a 
total of 15 trials, 12 of the associate’s requests in- 
cluded letters which the subject had placed in his 
own puzzle, Helping the associate therefore cost the 
subject money, because it destroyed a completed 
word, On three trials requests were made for non- 
functional letters in order to maintain the credibil- 
ity of the situation. It was of course essential that 
all subjects completed every puzzle correctly so that 
helping the partner incurred the same cost for all 
subjects. As a check on this requirement, subjects 
were instructed to write their earnings on the puzzle 
slips and to pass their completed puzzles to the ex- 
perimenter. The solution was checked and passed 
back to the subject along with the partner’s request 
slip. If the subject then destroyed a word, in order 
to give the partner a letter, he had to cross out the 
previous total and write in the new amount. 

After the 15 trials were completed, subjects filled 
out a postexperimental questionnaire. It assessed the 
independent variable manipulation and requested 
ratings of the partner and of the task. When these 
had been completed the experiment was finished. 
The purpose of the experiment was explained to each 
subject and arrangements for payment were made. 

Independent variable induction. In order to ma- 
nipulate the perceived locus of the partner’s de- 
pendence an “internal locus” (IL) condition and an 
“external locus” (EL) condition were used. The in- 
duction was introduced during the tape recording, 
repeated in the partner self-description form, and 
again in the message the subjects received after the — 
fifth trial. Subjects in the EL condition heard that 
associates could complete their puzzles only by re- 
questing letters from the directors; “the associates 
have no choice in this matter.’ The taped instruc- 
tions for subjects in the IL condition stated that 
associates had a choice between getting letters from 
a random pool or requesting letters from their , 


. 


` 
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directors. The choice had to be made before the 
first trial and could not be changed. The induction 
was repeated when the subjects saw the self-descrip- 
tion form their partners had completed. The last 
item asked, “Are you clear about the partnership 
arrangement?” In the EL condition the response 
read, 


I think so the experimenter said I had to ask 
my director for letters. It’s the only way I can 
complete the puzzles. 


In the IL condition it read, 


I think so the experimenter said I could either 
get letters from a random pool or ask ygu» for 
some. I figure my bes? bet is asking you. | 


Finally, after the fifth trial the associate in the EL 
*condition sent a message which said, 


I have no choice—please give me as many letters 
as you can. 


In the IL condition it read, 


I decided to ask you—please give me as many 
letters as you can. 


In all other respects subjects in both conditions were 
treated in identical ways. 


RESULTS 


o 

In the first item of the postexperimental 
questionnaire, the subject had to check one of 
three descriptive statements. In turn, these 
stated: the partner had no choice in requesting 
letters, had a choice in requesting letters, or 
that there was no partner. Each of the 48 sub- 
Jects endorsed the appropriate statement for his 
condition,? 

The mean yielding score for each block of 
five trials is presented in Table 1.8 It can be 
Seen that the IL condition produced significantly 
less yielding than the EL condition. Those sub- 
Jects who thought their partners had chosen to 
be dependent yielded significantly less than those 
who thought their partners had no choice. 

In both conditions the yielding scores increased 
during Trials 6-10, compared to Trials 1-5. In 
the EL condition 22 subjects yielded more, 1 
Subject yielded less, and 1 subject the same 
amount. In the IL condition the frequencies 
Were 16, 6, and 2, respectively. Contrasting the 


* Three subjects had to be discarded: one checked 
the incorrect induction and two said they had no 
Partner, 
ate Probabilities are based on two-tailed tests. 
lo e variances in the IL condition were significantly 
seis than in the EL condition. When two of the 
Ho for unequal population variances recommended 
2 Winer (1962, pp. 36-39) were calculated, the 
MBnificance levels remained unchanged. 
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TABLE 1 


Mean YæLDING Score By BLocks OF TRIALS 
AND £ VALUES FOR BETWEEN-CONDITION 


COMPARISONS 
. 
Trials 
Condition Total 
1-5 6-10 11-15 
Internal locus 3 1.2 8 23 
External locus | 1.1 3.3 2.7 71 
t 2.16* | 5.49** | 3.57** | 5.36** 
x ‘05. 
33 £85, 


subjects who increased their yielding with those 
who remained the same or who decreased, pro- 
duced a highly significant effect for the EL con- 
dition and a marginally significant one for the IL 
condition, This increase undoubtedly reflects the 
effect of the note requesting more help, which 
was sent after the fifth trial. 

The postexperimental questionnaire provided 
additional information consistent with the main 
hypothesis. The subjects in the EL condition, 
compared to those in the IL condition, saw their 
partners as less competitive (x? = 9.41, df=1, 
p<.01), more reasonable (x?= 5.37, df= 1, 
p <.05), and as having had more influence (x? 
= 5.33, df=1, p<.05). The two conditions 
did not differ with respect to ratings of the diffi- 
culty of the task or the enjoyment of the experi- 
ment. 

The postexperimental questionnaire also con- 
tained one item relevant to the speculation that 
the social responsibility norm would be aroused 
less in the IL condition than in the EL condi- 
tion. This was an open-ended item which asked 
the subject to state his reasons for whatever 
actions he had taken toward his partner. On the 
assumption that the arousal of the social re- 
sponsibility norm would lead subjects to explain- 
ing their behavior by reference to the partner, 
while an absence of the norm leads to explana- 
tions containing only self-references, responses 
to that item were coded according to whether 
they made only self-references or whether they 
also included reference to the partner. In keep- 
ing with the speculation, the IL subjects made 
significantly more self-references and fewer part- 
ner references than the EL subjects (x? = 10.22, 
df=1, p<.01). 

Perception of the locus of the partner’s de- 
pendence seems clearly to relate to the extent of 
help he can obtain. If the partner is seen as a 
victim of circumstances he will be helped more 
than if he is seen as being dependent of his own 
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volition, It has been assumed that voluntary de- 
pendence lessens the salience of the social re- 
sponsibility norm. It is likely that this effect has 
an important counterpart on decisions which are 
„made in such contexts as child rearing or punish- 
ing deviant behavior. The criminal who is seen 
as a victim of circumstances is more likely to 
elicit sympathetic, helping responses, than one 
who is judged to have voluntarily committed the 
deviant act. 
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MOOD CHANGE AS A FUNCTION OF STRESS OF PROTAGONIST 
AND DEGREE OF IDENTIFICATION IN A FILM-VIEWING 
SITUATION * 


PERCY H. TANNENBAUM anv ELEANOR P. GAER 2 
University of Wisconsin 


After a period of initial buildup of stress for the Protagonist the ending of a 
short film was manipulated to represent different degrees of stress resolution— 
a happy ending in which this stress is reduced, a sad ending in which it is 
increased, and an indeterminate ending. A fourth control group received no 
ending at all. Ss exposed to the respective film endings reported significant 
degrees of the corresponding stress state. Ss identifying more with the hero 
experienced significantly more stress induction than those identifying less, The 
same tendency—more of the corresponding stress state within a condition in 
high vs. low identifiers—existed in the stress resolution phase, although the 
difference was significant only in the happy-ending condition. 7 


The capacity to induce varying kinds and de- 
grees of emotional reaction has been attributed 
to dramatic presentation, both in popular folk- 
lore and by students of persuasive and esthetic 
communication. We speak of “being moved” by 
a stage or screen play, and the presence of vio- 


1 This research was conducted as a project of the 
Mass Communications Research Center, and was 
supported in part on a grant from the Graduate 
Research Committee with funds supplied by the 
Wisconsin Alumni Research Foundation. It consti- 
tuted part of the second author’s master’s thesis re- 
quirements. 

2 Presently a graduate student at the University 
of Ilinois. 


lence on television has recently generated con- 
siderable debate and some research regarding its 
relative instigational or cathartic properties (cf. 
Berkowitz, 1962, 1964). Such a concern is par- 
ticularly acute when children are involved, and 


is underlined by reports of stress arousal in chil- ” 


dren during the viewing of a dramatic film (e.g 
Dysinger, 1933; Thomson, 1959). 

: What prompts and promotes such emotional 
involvement and reactivity? A common thesis 
has been that advanced by Maccoby and Wilson 
(1957) who suggest that as the viewer becomes 
absorbed in the drama, he identifies with a main 
character and “covertly reproduces” many ele- 
ments of the perceived behavior and emotions of 
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that character. Similarly, Emery and Martin 
(1957) claim the viewer “tends to suffer, feel re- 
lief and affection as the figure is shown to be ex- 
periencing them,” The theoretical rationale is 
that, through the mechanism of identification, the 
viewers vicariously experience emotions similar 
to that of the protagonist. 

The present study was undertaken to test cer- 
tain consequences of such a theoretical position 
by varying the protagonist’s stress through ma- 
nipulated film endings, and examining for varia- 
tions in the stress state of the viewer. If, after 
an initial buildup of tension for the film’s, hero, 
his stress is resolved,ethe audience should report 
corresponding moods of reduced stress. If, on 
the other hand, his stress is heightened or main- 
tained, the viewers should report experiencing 
more depressive and stressful mood states. 

Moreover, if identification is the main mecha- 
nism linking viewer to protagonist, then viewers 
with a higher degree of identification with the 
hero should demonstrate more of the appropriate 
mood state than those who identify less. In the 
present study, we were not able to subject this 
variable to direct experimental manipulation, but 
we did obtain an identification measure antece- 
dent to the manipulated endings, and thus were 
able to contrast between different degrees of 
identification within each of the experimental 
groups. 


METHOD 
Subjects 


Male and female students in an introductory psy- 
chology class at the University of Wisconsin served 
as subjects in the experiment. All subjects were 
tested in their respective quiz-section meetings, a 
Separate quiz section for each of the four experi- 
mental conditions. A total of 123 subjects was origi- 
nally used. 


Film Materials and Experimental Conditions 


The film segment employed was a special short, 
distributed under the title Due Process of Law De- 
nied, which is actually a 30-minute cutting from 
one of Hollywood’s classics, The Ox-Bow Incident, 
issued in 1943, This film segment was selected for a 
number of reasons: its high dramatic quality; it ap- 
Peared to provide ample opportunity for identifica- 
tion with a well-defined hero; it presented a highly 
Stressful situation for the protagonist; and it also 
lent itself readily to minor editing in order to create 
the desired experimental conditions. That it was able 
to meet these criteria within a relatively short film 
Sequence also contributed to its usefulness. 

Although the film is a Western in setting, it is 
not stereotyped in plot and characterization, and is 
marked by rather fine acting. The cutting opens with 
a posse being formed to track down some rustlers 
who had murdered a local rancher and stolen his 


613 


cattle. Some conflict is apparent, with several mem- 
bers urging a delay until the sheriff arrives. How- 
ever, they are not heeded, and the posse subsequently 
encounters three men—the young protagonist (Don- 
ald Martin), his father-in-law, and a Mexican helper 
—who have the branded cattle in their possessiot. 
Martin claims that he bought the cattle from the 
rancher earlier that day. He is not believed but is 
adamant in his claim of innocence. The posse pre- 
pares to hang the trio, over strong objections from 
several members who again argue that they should 
await the sheriff’s arrival and have the men stand 
trial in a proper court of law. As Martin is being 
dragged to the rope he struggles and pleads. 

The above sequence takes approximately 20 min- 
utes to portray, and was shown to all subjects in 
each group. Subsequently, after a short testing in- 
terval, the three experimental groups were shown 
their respective endings. 

The sad-ending condition has the posse agreeing to 
Martin’s pleas to wait until morning. However, the 
sheriff does not arrive with the dawn and there is 
again agitation and conflict about proceeding with 
the hanging on the spot. A vote is held, the majority 
ruling to hold the lynching. The trio pleads and prays, 
but to no avail. The ending took approximately 7.5 
minutes, and is similar to the actual film. 

The happy-ending condition proceeds the same as 
the above until the time of the vote. The film was 
spliced at this point to a later sequence (not included 
in the sad condition) in which the posse, riding back 
to town, meets up with the sheriff, who tells them 
the rancher was only wounded and that the real 
rustlers have already been apprehended. This end- 
ing, lasting a little more than 5 minutes, leaves the 
clear impression that Martin and his companions 
are let free. 

In the indeterminate ending the scene is stopped 
after the men wait through the night for the sheriff’s 
arrival but prior to the vote. The viewer cannot de- 
termine the outcome, the ambiguity about whether 
the hanging will take place or not being quite pro- 
nounced. The ending lasted about 2.5 minutes. 

In the fourth condition, a no-ending control group 
saw all of the initial segment and then performed a 
variety of unrelated tasks (simple arithmetic, cross- 
ing out letters in a paragraph, and digit-symbol sub- 
stitution) for approximately 7.5 minutes. 


Mood Stress Measure 


The measure of mood change, the main dependent 
variable in the study, was derived from ratings on a 
set of 18 scales, selected from the work of Green 
and Nowlis (1957). Each scale is defined by a single 
adjective, the subject being asked to rate the appro- 
priateness of the adjective to his mood at that time 
in one of five categories—definitely applies, some- 
what applies, slightly applies, undecided, and defi- 
nitely does not apply. 

The selected scales, and the factors they repre- 
sented, had been shown in earlier research on films 
(Green & Nowlis, 1957; Nowlis, 1959) to be sensi- 
tive to variations in induced mood states. Since the 
direction of change could be specified ahead of time 
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in terms of higher or lower mood stress, and since 
each scale was scored on the same equal-interval ba- 
sis, a single composite score was derived for each 
subject at each test period—the higher the score, the 
greater the degree of stress. 


Identification Measure 


The assessment of degree of identification of a 
subject with the film’s protagonist was based on an 
index of the similarity in the subject’s perception of 
himself and of the hero, Martin, in the manner sug- 
gested by Lazowick (1955). Judgments of these two 
concepts, along with several others representing other 
characters in the film, were obtained from each sub- 
ject on a set of 14 semantic differential scales se- 
lected on the basis of previous research (cf. Osgood, 
Suci, & Tannenbaum, 1957) to be representative of 
the major dimensions of connotative judgment and 
deemed sensitive to the judgments of individual 
characters. The generalized distance function, D, be- 
tween the two profiles is taken as a measure of pro- 
file similarity or difference, and hence as a measure 
of identification—the smaller the value of D, the 
higher the degree of identification. 


Procedure 


Subjects were told at the beginning of the test 
session that the study was designed to determine if 
a set of measures “will give us valid reactions to 
films.” Each subject responded on the mood scale 
before the movie was shown (To) being urged to 
give “honest” ratings of their mood “at this mo- 
ment.” They were then exposed, in their separate 
group sessions, to the initial film segment. At this 
point, the film was stopped, and the second mood 
scale administration took place (T:). The semantic 
differential ratings, required for the identification 
index, were also obtained at this juncture. Each ex- 
perimental group was then exposed to its respective 
film ending, while the control group performed its 
assigned chores, followed by a third administration 
of the mood scales (Tz), To help maintain the alleged 
purpose of the study, several questions were asked 
of the subjects at Ta inquiring about their reaction 
to the film. These included a key query: “Have you 
seen the full-length movie from which this segment 
was taken?” 


RESULTS 


Affirmative responses to this last question 
about having seen the film before were used to 
eliminate subjects whose previous viewing and 
recall of the film might contaminate the intended 
experimental manipulations. Of the original 123 
subjects, 31 were discarded on this basis, Since 
the resulting differences in group size were rather 
minimal, all groups were reduced to the size of 
the smallest (W = 20), by randomly discarding 
cases where required, to facilitate the computa- 
tional work. 

‘ Prior to the data analysis, the various distribu- 
tions of the identification (ID) measures ob- 


Brier ARTICLES 


tained at T} were examined to determine the 
best manner of subdividing our experimental 
groups. A simple dichotomy of high versus low 
identifiers was decided upon—partially because 
of the relatively small sizes of the experimental 
groups, and partially because the distributions 
generally conformed to the normal curve. Since 
the distributions and median for each group were 
highly similar to those for all subjects, the di- 
vision was made on the basis of an above- versus 
below-median split on ID within each group. 


Stress Induction 


The main purpose of the first phase of the film 
exposure was to acquaint all subjects with the 
protagonist and his plight. Since Martin is shown 
to be undergoing extreme stress at the T, stage, 
it was expected that most subjects would exhibit 
an increase in mood stress level at this point. A 
sign test (Siegel, 1956) confirmed this expecta- 
tion, indicating a significant (p < .01) number of 
subjects increasing in mood stress. 

It was also hypothesized that subjects indicat- 
ing a higher degree of identification with Martin 
at this point would show more stress than those 
with lesser identification. To examine this pre- 
diction, the T} — Tọ change scores were used as 
the main dependent variable in a 4 X 2 design 
of the four experimental groups each subdivided 
into high and low ID. Moreover, since examina- 
tion of the Ty scores revealed some substantial 
differences in these original mood ratings, and 
since it was felt that some of the indicated 
change could be a function of initial position, 
an analysis of covariance was conducted of the 
Tı— To change scores, using the Tọ score as 
the covariate. 

The results supported the prediction for the 
high (adjusted M = 10.58) identifiers changing 
more than the low (5.90), the difference between 
the two being significant (F=6.97, df = 1/11, 
p<.02). Similarly, the interaction was not sig- 
nificant. 


Stress Resolution 


Here the concern is with the degree of mood 
change as a function of type of film ending and 
of degree of identification. Again, an analysis of 
covariance was conducted, this time using the 
Tə— T; change scores as the dependent vari- 
able with the T, score as the covariate. Table 1 
presents the adjusted means for the various con- 
ditions, while Table 2 presents the results of the 
covariance analysis. 

The results indicate a highly significant differ- 
ence in mood stress change as a function of the 
film endings. The sad-ending group actually in- 
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creases in stress level, while the other three 
groups show decreases, The stress reduction is 
most pronounced in the happy-ending group, but 
relatively moderate in the indeterminate-ending 
and in the no-ending control groups. The results 
of a Duncan (1955) range test applied to four 
group means are also included in Table 1, and 
indicate a significant difference between all groups 
except between the indeterminate and control. 
The fact that there is no significant difference 
between high and low identifiers across all groups 
is really not unexpected. The theoretical predic- 
tion is for differential change between high and 
low ID within eachegroup but not across all 
groups, the latter representing a canceling-out, of 
positive and negative changes within the different 
groups. i 
This differential direction and degree of change 
account for the significant interaction, and is 
made more apparent when we compare between 
high and low ID within each group separately. 
Within the sad-ending condition, both subgroups 
show an increase in stress. As the identification 
theory would predict, the increase is more 
marked for the high ID subgroup, but the dif- 
ference here (evaluated by ¢ test using the pooled 
error term from the analysis of variance) is not 
significant (= 1.26, p> .20). In the happy- 
ending condition, both subgroups reduce their 
mood stress, and, as predicted, the reduction is 
significantly greater for the high identifiers (t = 
2.24, b <.05). In the indeterminate-ending con- 
dition, with no apparent resolution obtaining, the 
high identifiers actually increase their stress level 
slightly while the low ID subjects show a de- 
crease. The difference, however, does not reach 
Statistical significance (f= 1.27, p> .20). We 
also find expected changes in the control group 
where the passage of time engaged in simple un- 
related tasks is accompanied by a decrease in 
mood stress—somewhat, but not significantly 


TABLE 1 


Apjustep Mrans IN Moop CHANGE ACCORDING 
TO DIFFERENT Firm ENDINGS AND 
IDENTIFICATION LEVEL 


Film ending High ID Low ID Total 
Sad +4.84 41.63 | +3.240 
Happy —9.45 3.71 — 6.58, 
Indeterminate | + .86 —2.40 — 7 
Control —136 —3.42 —2.3% 

Total —1.28 —1.97 —1.63 


cunote-—Positive changes indicate an increase, and negative 
M ses 2 decrease, in mood stress. ifi 
To i i x F 

cantly different at the 03 bus ee S Prip are nO sem 
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TABLE 2 


ANALYSIS OF COVARIANCE OF Moop CHANGE 
(T: — T, CHANGE WITH T As COVARIATE) 


Source af MS Foie. 
Between ID (A) 1 6.63 — 
Between endings (B)| 3 318.72 9.58** 
AXB 3 106.39 3.20* 
Error 71 33.28 

*p <.05. 
**p <.001. 


more so in this case, among the low identifiers 
(t < 1.00). Thus, while the results in ID level 
comparisons within each experimental group are 
in the predicted theoretical directions, the ob- 
tained differences are significant only in the case 
of the happy-ending group. 


Discussion 


The findings of the present study provide sup- 
port for the theoretical position that viewers of 
films are apt to experience similar, albeit re- 
duced, emotional reactions to those apparently 
experienced by film characters. Viewers exposed 
to different film endings reflecting different kinds 
of stress resolution of the central film character, 
reported correspondingly different kinds and de- 
grees of stress change within themselves. The 
sad and happy endings produced the most ex- 
treme—and opposite—changes, while the two 
conditions reflecting uncertainty of the outcome 
led to rather moderate stress reduction. 

It should be noted that in all conditions where 
stress reduction is reported, the reduction is sub- 
stantially less in magnitude than was the origi- 
nal stress induction in the initial phase of the 
film exposure. Even in the happy-ending condi- 
tion, where the mood stress reduction is greatest, 
it is significantly (p < .025, one-tailed Wilcoxon 
matched-pairs test, Siegel, 1956) less than the 
corresponding stress induction. Such a finding, it 
may be noted in passing, would tend to support 
the contention of some authorities (e.g., Ban- 
dura, Ross, & Ross, 1963; Berkowitz, 1962) re- 
garding the instigational rather than cathartic 
properties of filmed aggression. Even when the 
protagonist stress is seemingly resolved through 
the manipulation of a “happy, Hollywood-type” 
ending, there is still a residue of viewer stress— 
just as there is of aggressive tendencies even 
when the “bad-guy” aggressor is punished (Ban- 
dura et al., 1964b) or when the aggression ap- 
pears justified (Berkowitz, 1964). 

This experiment also provides some support 
for the thesis that identification with the pro- 
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tagonist is a mechanism through which such vi- 
carious emotional experience is accomplished. 
This is clearest in the stress induction phase of 
the present study where relatively high identifier 
subjects reported significantly more mood stress 
than those identifying less. In the stress resolu- 
tion phase, we find that while the results are con- 
sistent in direction with the theoretical predic- 
tions, they fail to reach accepted levels of sta- 
tistical significance, except in the happy-ending 
condition. The relatively small numbers of sub- 
jects, along with the high degree of variance 
within conditions generally characteristic of the 
types of measures such as the self-report mood 
scales employed here, may in part account for 
the lack of more substantial differences. 

Even if the findings were more impressive sta- 
tistically, the present investigation would still 
fail to provide more than circumstantial evidence 
for the identification theory. This is because 
the critical identification variable was not inde- 
pendently and systematically manipulated. Under 
these conditions, one could just as readily ac- 
count for the obtained differences between ID 
level in the initial stress induction phase in in- 
verse fashion: that a higher degree of stress in- 
duction led to—rather than was caused by—a 
higher degree of identification. To be sure, such 
a claim is less applicable to the stress resolution 
phase, since the ID measure was at least ob- 
tained prior to the experimental manipulations of 
film endings. But even here, lacking independent 
manipulation of ID level, the evidence is still 
circumstantial at best. 

A major difficulty is that the identification 
variable does not lend itself readily to experi- 
mental manipulation. For one thing, it is a rather 
amorphous concept in the psychological litera- 
ture, not always made distinct from such re- 
lated concepts as imitation (cf., e.g., the differ- 
ences in approach of Lazowick, 1955, and of 
Bandura et al., 1964a) or even pure attitude to- 
ward the model (as can be attested to by at- 
tempting to manipulate one without affecting the 
other). Zander (1958) has suggested a method 
of telling subjects that they are more or less 
similar to a model on a particular set of charac- 
teristics, Another Promising avenue, particularly 
if young children are involved, is through ma- 
nipulating the apparent success of the leader or 
model (eg., Bandura et al., 1964b; Zajonc, 
1954). Such attempts would seem to be indicated 
in view of the growing importance of identifica- 
tion as an explanatory concept in social learning 
(cf. Bandura, 1962). It is certainly indicated if 
the identification theory accounting for the vi- 
carious experiencing of emotion through film 
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viewing is to be subjected to more direct and 
critical examination. 
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RELATIONSHIPS OF TEST ANXIETY AND HOSTILITY TO 
DESCRIPTION OF SELF AND PARENTS? 


IRWIN G. SARASON anp KARL P. KOENIG 
University of Washington 


Questionnaire-assessed test anxiety and hostility were related to the verbal 
productions of male and female college students. Each S’s task was to describe 
himself, and his mother and father. These descriptions were content analyzed 
for positive and negative evaluations of self and parents. Test anxiety was 
found to be related to the incidence of negative self-references, but not to 
negative references to parents. Hostility was found to be unrelated to self- 
references, Fewer positive references were emitted by high than low hostility 


scorers, o 


If, as Cronbach (1960) has suggested, indices 
of personality should tap “typical behavior,” 
then one might hope for significant positive re- 
lationships between different measures of per- 
sonal characteristics, If a positive relationship 
were found between a simple and convenient, 
but highly structured index of personality and 
an index derived from a freer and more complex 
situation which possesses considerable face va- 
lidity, the use for prediction purposes of the 
more convenient index would seem quite defensi- 
bleo Put another way, if one wishes to regard a 
particular personality characteristic inferred from 
verbal behavior as an independent variable, then, 
if a positive relationship obtains between rela- 
tively convenient and inconvenient indices of 
the characteristic, use of the simple one would 
seem indicated, 

An example of what we are referring to may 
be found in recent studies in which negative self- 
references on a relatively simple true-false ques- 
tionnaire were related to negative self-references 
in a free verbalization situation (Sarason & 


„ Ganzer, 1962, 1963). The questionnaire was the 


Test Anxiety Scale (TAS), which has been in- 
terpreted as reflecting the tendency to make 
negative evaluations of one’s behavior in testing 
Situations (Sarason, 1960), The free verbaliza- 
tion index was the tendency to make negative 
Self-references (e.g, “I am a lazy person”) in 
the course of self-description in an interviewlike 
Situation. It has been shown that high TAS 
Scorers emit more negative self-references in 
Such a situation than do low TAS scorers. 

This result raises some interesting questions. 
One of these has to do with the generality of the 


_ * This research was facilitated by State of Wash- 
ington Initiative 171 funds for Research in Biology 
and Medicine and by Grant M8830 from the Na- 
tional Institute of Mental Health, United States 
Public Health Service. 


tendency to make negative self-references. Do 
TAS scores reflect the typical behavior of mak- 
ing negative self-references or do they reflect 
the tendency generally to make negative refer- 
ences? Current interpretations of anxiety have 
emphasized individual differences in self-involve- 
ment, self-derogation, and the perception of one- 
self in an unfavorable light. If, however, it were 
found that high-anxious scorers had as strong a 
tendency to describe others in a negative fashion 
as to describe themselves in a negative fashion, 
such an interpretation would be subject to ques- 
tion. 

In the present investigation, the verbal be- 
havior of high and low TAS scorers was com- 
pared. The comparisons, however, were not re- 
stricted to self-description. In addition to talk- 
ing about himself, each subject was asked to 
describe two other persons, his mother and 
father. If anxiety is a measure of specific kinds 
of self-preoccupation one might expect that TAS 
would be related to persons’ self-descriptions 
but not necessarily to descriptions of others. On 
the other hand, if TAS is simply an index of the 
tendency to make negatively toned comments, 
then it might be expected that not only would 
high scorers describe themselves more negatively 
than low scorers, but that they would describe 
their parents more negatively as well. The pres- 
ent experiment was designed to shed light on 
these possibilities. 

One other personality measure, besides test 
anxiety, was employed in the experiment. Its 
inclusion resulted from evidence that groups of 
persons differing in scores on a true-false Hos- 
tility (H) scale do not differ in their emissions 
of negative self-references (Ganzer & Sarason, 
1964; Sarason, Ganzer, & Granger, 1965). Since 
one can be critical of others without necessarily 
being critical of oneself it seemed worthwhile to 
compare high and low H scorers in terms of self- 
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descriptions and descriptions of others, in this 
case parents. Comparing the free verbalizations 
of subjects differing in TAS and H provided the 
possibility of finding out if persons differing in 
anxiety might be divergent in self-description 
and persons differing in hostility might be di- 
vergent in the description of others. 

In addition to studying the relationship of 
individual differences to self-description, the 
present study also involved one experimental 
variable, It was felt that a favorable evaluation 
of the subjects’ self-descriptions and descriptions 
of others might have an influence on their subse- 
quent verbal behavior. Previous research has 
shown that evaluative statements by the experi- 
menter exert a significant influence on the sub- 
jects’ performance and learning (Sarason & 
Harmatz, 1965). Each subject in the present 
experiment was asked to describe himself and 
others twice. For half of the subjects a favorable 
evaluation of his descriptive ability was given 
prior to the second sequence of speaking. The 
remaining subjects had a control condition in 
which they did not receive this favorable evalua- 
tion, 

The experiment reported here, then, was con- 
cerned with the relationship of two individual 
difference measures (actually there were three 
individual difference measures since sex was in- 
cluded as a variable), and one manipulated 
factor to verbal behavior in an interviewlike 
situation. 


METHOD 
Subjects 


The subjects were 24 male and 24 female under- 
graduate students enrolled in introductory psychol- 
ogy classes at the University of Washington, High 
and low test-anxiety scorers were drawn from the 
upper and lower thirds of the TAS score distribu- 
tions for males and females. When the subjects’ 
verbalizations were compared in terms of hostility, 
they were recategorized in terms of H scores. This 
H categorization was based on the median rather 
than extreme thirds of the distribution, In order to 
achieve equal Ns in the various treatment combina- 
tions, 8 subjects were eliminated in the H analyses. 
Thus, the W per cell was 6 for TAS comparisons 
and 5 for the H comparison. It should, of course, be 
noted that conclusions drawn on the basis of the 
H analyses must be regarded as weaker than those 
drawn from the TAS analyses since the former ones 
did not involve independent sampling. In Previous 
research, correlation between paper-and-pencil anxi- 
ety measures and H of the order of + .20 have been 
obtained (Sarason, 1961). 

Preliminary instructions to the subjects were simi- 
lar to those used by Sarason and Ganzer (1962). 
Each subject talked during eight 3-minute Periods. 
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These periods were defined by the topic the subject 
was asked to talk about. The first four periods in- 
volved the subject describing himself generally, 
himself in school situations, his mother, and his 
father. 

Following these four periods, half of the subjects 
received this communication from the experimenter; 


Well, this completes the first part of my study. 
[Pause] That was excellent! You really did a 
good job in your descriptions. I can see that 
you're a good observer of yourself and of the 
people around you. You seem to have a grasp of 
what makes people behave in the way that they 
do, 


+ 

This communication was considered to be a reas- 
surance condition, The remaining subjects received 
this communication: £ 


Well, this completes the first part of my study. 
We’ll go on to the next in just a moment, 


This was considered a nonreassurance or control 
condition. 

Following either the reassurance or control condi- 
tion all subjects again described themselves, their 
school experiences, their mothers, and their fathers. 

The content of each tape-recorded 3-minute seg- 
ment was rated without knowledge of the subjects’ 
TAS or H scores. (Similarly, the experimenter did 
not know the TAS and H scores of the subjects.) 
Content was categorized four ways for each topic: 

1. Positive references (those which describe the 
person under discussion in a favorable or desirable 
light) 

2. Negative references (those which describe the 
Person under discussion in an unfavorable or unde- 
sirable light) 

3. Ambiguous references (those which are relevant 
to the topic under discussion, but not assignable to 
the categories of positive or negative statements, 
e.g., self-references which are neither positive nor 
negative toned) 

4. Other references (those which are not relevant 
to the assigned topic, e.g., a reference to one’s social“ 
life while discussing one’s father) 

Using statement categorization criteria similar to 
those employed by Rogers (1960) and Sarason and 
Ganzer (1962), two raters independently analyzed 
four of the tape-recorded sessions. One of the raters 
was quite familiar with the system of content analy- 
sis employed; the other was not. In order to train 
the naive rater, the two raters collaborated in ana- 
lyzing four tapes (two of male, two of female sub- 
jects). However, for the four tapes used to establish 
interrater reliability, the two raters had no contact 
with each other. 

The reliability coefficients, over all topics, were 
-94 and .86 for the protocols of the two female 
subjects. For the male subjects, the Pearson product- 
Moment correlations were 82 and .75. These relia- 
bilities seemed sufficiently high to permit one of the 
Taters to analyze tapes of the remaining subjects 
himself. Two of these tapes were randomly selected — 


o 


o 
o 


( 


y 


è 
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and scored twice by this rater with a time interval 
between scorings of 6 months. The two reliabilities 
were .90 and .97. 


RESULTS 


The results were analyzed by means of 2 X 
2X2X 2 analyses of variance—High and Low 
TAS or H X Sex X Reassurance-Control Condi- 
tion X Two Sets of Topic Descriptions (trials). 
Analyses were made both for the numbers of 
responses in the four content categories and 
percentages of responses in the categories (i.e., 
responses in a given,category divided by “the 
total number of responses during a given 3- 
„minute segment X 100). Since the two methods 
of data analysis yield similar results, the find- 
ings will be presented primarily in terms simply 
of numbers of responses. 


Test Anxiety 


A series of analyses were made of the total 
number of responses and the numbers of posi- 
tive, negative, other, and ambiguous references. 
(Results concerning ambiguous references will 
not be presented here.) 

Tħe analysis of total number of responses 
yielded two significant Fs, those for Trials (p < 
.005) and for the TAS X Trials interaction (p 
<.025). The main effect was due to a smaller 
number of responses in the second as compared 
with the first set of topic descriptions. (This 
effect was observed both overall and for most 
individual response categories, e.g., positive ref- 
erences,) The interaction resulted from the fact 
that during the first set the two TAS groups did 
not differ, but during the second set low TAS 
Subjects gave fewer responses than the high TAS 
group. 

_Among other significant findings in the overall 
(ie., overtopics) analyses were these: 

1. Female subjects made more positive ref- 
erences than did male subjects (p < .05; M = 
21.02 and 15.21, respectively). 

2. High TAS subjects made more negative 
Statements than did low TAS subjects (p < .025; 
M = 13.13 and 7.79, respectively). 

3. There was a significant TAS X Sex inter- 
action for other responses (p < .05). Low TAS 
males made more other responses than high TAS 
males; the reverse was the case for females. 

The major results for self-description were as 
follows: 

„1. More negative self-references were made by 
high than low TAS subjects (p <.01; M = 4.00 


‘e nd 1.85, respectively). 
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2. There was a significant TAS X Trials inter- 
action for total number of statements (p <.025). 
Low TAS subjects made more statements of all 
types during the first trial than did high TAS 
subjects; the reverse was the case for the second 
trial. 

These findings were statistically significant for 
academic self-descriptions: 

1. High TAS subjects made a greater number 
of negative academic self-references than did 
low TAS subjects (p < .01; M = 3.48 and 1.77, 
respectively). 

2. Females emitted more positive self-refer- 
ences than did males (p< .05; M=4.23 and 
2.75, respectively). 

3. Males made more references to others dur- 
ing the periods calling for academic self-descrip- 
tion than did females (6 <.05; M=.83 and 
.31, respectively). 

Among the results for subjects’ descriptions of 
their fathers and mothers were these: 

1. Whereas high TAS subjects described them- 
selves more negatively than low TAS subjects, 
high and low TAS groups did not differ with 
regard to negative references to their fathers and 
mothers. i 

2. Females made more positive references to 
mother than did males (p<.025; M = 6.90 
and 4.67, respectively). 

3. There was a significant TAS X Sex inter- 
action for references to others during the periods 
devoted to description of mother (p< .025). 
For males, low TAS subjects made more other 
references than did high TAS subjects; vice 
versa was the case for females. 

4. For fathers, there was a significant Reas- 
surance X Trials interaction (p < .025). Under 
the control condition, more references to per- 
sons other than father were made during the 
second than during the first trial; under reas- 
surance, vice versa was the case. 

5. There was a significant TAS X Sex X Reas- 
surance interaction for negative references to 
father (p < .05). This seemed attributable largely 
to the fact that high TAS females made more 
negative references to father under the reassur- 
ance than control condition; vice versa was the 
case for low TAS females. 

6. The TAS X Sex X Reassurance interaction 
was significant for total number of statements 
made during the two periods devoted to descrip- 
tion of father (p < .05). This seemed attribut- 
able primarily to a greater number of state- 
ments by high TAS females under the control 
condition. Just the opposite was the case for 
low TAS females under these conditions. 
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Hostility 

The major results of the analyses were these: 

1. For total number of responses, high H sub- 
jects made fewer positive references than did 
òw H subjects (p < .025; M = 15.10 and 22.05, 
respectively); interaction (p < .05). 

2. For positive references during the period 
devoted to academic self-description, there was a 
significant H X Sex X Reassurance interaction 
(p <.05). Two groups contributed primarily to 
this result. Female low H subjects made more 
positive academic self-references under the reas- 
surance condition than did female high H sub- 
jects, and male low H subjects under the control 
condition made more positive academic self- 
references under the control condition than did 
male high H subjects. 

3. High H subjects made fewer positive state- 
ments about their fathers than did low H sub- 
jects (p < .01; M = 5.00 and 8.15, respectively). 

4. For negative references to father there was 
an interaction between H and Reassurance (p < 
.05), Under reassurance, low H subjects made 
more negative references to father than did high 
H subjects. Under the control condition, just 
the opposite was the case. 

5, For references to others, during the period 
devoted to description of fathers, there was a 
significant H X Sex interaction (p <.05). Low 
H males made more such references than all 
other groups combined. 

6. Only one significant F pertained to the sub- 
jects’ descriptions of their mothers. This was the 
interaction between H and Trials for references 
to others during the periods devoted to descrip- 
tion of mother (p < .05). Low H subjects made 
more references to others during the second 
period than did all other groups during either 
the first or second period. 


Discussion 


One of our aims was to determine the rela- 
tionships of two questionnaire-inferred person- 
ality characteristics, test anxiety and hostility, to 
persons’ descriptions of themselves and others, 
As was mentioned earlier, free verbalization 
studies have tended to focus on one response 
class, negative self-references. Test anxiety, but 
not hostility, has been found to be related to 
emission of this type of response. The present 
study corroborates this finding. High test-anxious 
subjects tended to be more self-critical and self- 
debasing in their descriptions of themselves than 
low test-anxious subjects. Groups differing in 
hostility scores did not show significant differ- 
ences in emission of negative self-references, 
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The present findings suggest, however, tha 
assessed hostility is related to certain aspects of 
verbal behavior. The overall results may be 
taken as suggesting that while TAS scores are 
related to the incidence of negative self- refer- 


positive references to others. High H scorers 
were found to be less likely to make favorable ~ 
comments about their fathers than low H scorers, 
Hostility as a main effect was found to be un- 
related to the subjects’ descriptions of them- 
selves, Interestingly, test anxiety while related ` 
to self-descriptions was unrelated as a main effect” 
to the subjects’ descriptions of others. 

The results for test anxiety are consistent 
with previous research and speculations in thal 
high TAS subjects seem to be more self-centered 
and self-critical and seem to communicate more 
often their personal preoccupations than low ~ 
TAS subjects. This greater need on the part of 
high TAS subjects to communicate was perhaps i 
reflected in the finding that while the total num- ~ 
ber of responses made by low TAS subjects de- 
clined sizably from the first to the second verb- 
alization period, the decline for high TAS sub- i 
jects was negligible. The fact that high TAS © 
scores were consistently associated with unfayor- 3 
able self-descriptions, and, in particular, un- 
favorable descriptions of the academic self, but T 
not with descriptions of others, shows that test- i 
anxiety scores are not a function simply of a 4 
tendency to emit socially undesirable statements. 
Rather, as already suggested, they seem to reflect — 
self-preoccupation and low self-esteem. A 

High-hostile scorers as contrasted with high- = 
anxious scorers do not seem to be especially = 
self-critical. They do seem to be relatively stingy 
in making favorable comments about others, in É 
the present instance, their parents. This is con- 
sistent with a conception of hostility scores as" 
reflections of oppositional or, at least, unrecep- ` 
tive attitudes towards others. It would be inter- A 
esting to compare persons differing in hostility 
with respect to their reactions to reinforcements 


others, 


The third individual difference variable in 
this study, sex, showed a noteworthy and sug- 
gestive consistency in relation to the verbal de- q 
scription task. Females tended to be more posi- 
tive and optimistic in their evaluations of them- 
selves and of others than males. Thus, female 
subjects when compared with male subjects 7 
made more positive statements overall, more 
positive statements about themselves as students, 4 
and more positive statements about their moth- 4 
ers. A number of interesting problems for future 


investigation stem from the question: What is 
the meaning of this sex difference? Is it a func- 
tion of a general difference in response style or 
sets? Are females simply more placid and opti- 
|, mistic than males? If so, what are the social and 
“perhaps, biological antecedents of these differ- 
ences? To what degree might the sex and status 
of the experimenter be related to females’ ap- 
parent optimism? 

While the experimental variable of reassur- 
ance entered into several significant interactions, 
it did not seem to exert a powerful influence 
over the subjects’ verbal behavior. Two facters 
„~ seem important in expetimental efforts to influ- 

ence the atmosphere in which persons describe 
themselves. One is the nature of the atmospheric 
manipulations, and the other is their timing. It 
seems possible that systematic investigation of 
these variables might lead to interpretable influ- 
ences over person description, 

A final word need be said concerning the ob- 
jects of persons’ descriptions. In this experiment 
all of the objects were “significant,” that is, they 
were persons (including the individual himself) 

+ with whom the subjects were highly involved in 
a personal and affectional sense. The tendency of 
high-hgstile subjects to withhold favorable evalu- 
ations of their parents may be either a specific 
tendency in relation to their mothers and fathers 
or a general tendency in relation to other per- 
sons. It would be worthwhile to compare descrip- 
tions of others by groups differing in hostility in 
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Numerous studies of the effect of reinforce- 
g; ment upon attitudes have been reported (Das & 

es 1963; Ekman, 1958; Hildum & Brown, 
_, 1956; Maccoby, Maccoby, Romney, & Adams, 
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a situation where the variable of the personal 
relevancy of the object is systematically varied. 
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VERBAL REINFORCEMENT OF ATTITUDE 


s CHESTER A. INSKO 
University of Hawaii 


Students in an introductory psychology course at the University of Hawaii 
were contacted by telephone and positively or negatively reinforced with 
“good” for agreement or disagreement with a series of opinion statements 

x relating to the creation of a Springtime Aloha Week. Approximately a wk. 
$ later a “Local Issues Questionnaire” was passed out in the introductory class. 
’s of the way through this questionnaire occurred an item measuring attitude 
toward the creation of a Springtime Aloha Week. It was found that verbal 
reinforcement produced an effect on the telephone responses that carried over 
to the attitude questionnaire. These results were interpreted as indicating that 
verbal reinforcement in an interview situation can produce a genuine effect 
upon attitude and not just a transitory effect upon interview responses. 


1961; McGuire, 1957; Scott, 1957, 1959a, 1959b; 
Staats & Staats, 1958; Staats, Staats, & Heard, 
1960; Weiss, Rawson, & Pasamanick, 1963). Of 
these studies two, Ekman, and Hildum and 
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TABLE 1 


ANALYSIS OF VARIANCE OF THE TELEPHONE 
Interview Data 


Source af MS F 
Reinforcement (R) 1 1020.01 15.31** 
Sex (S) 1 110,01 1.65 
Assistants (A) 5 51.69 0.78 
RXS 1 11.69 0.18 
RXA 5 45.78 0.69 
SXA 5 207.58 EN Phi 
RXSXA 5 60.98 0.92 
Total between 23 129.21 1.94* 
Within 48 66.63 

Total 71 
* 
) St 


Brown, were concerned with the effect of verbal 
reinforcement. Ekman found that “good” was an 
effective reinforcer for modifying agree-disagree 
responses to a series of opinion and attitude 
statements regarding capital punishment. The re- 
inforcements were delivered during a structured 
interview situation. Hildum and Brown reported 
that “good” but not “Mm-hmm” was an effective 
reinforcer for modifying agree-disagree responses 
to a series of opinion statements regarding the 
Harvard philosophy of general education. The 
reinforcements were delivered via a telephone 
interview. For both of these studies there is a 
question as to whether verbal reinforcement ac- 
tually produced attitude change or simply modi- 
fied the immediate responses in the interview 
situation. The present study was undertaken in 
an attempt to gain some information on this 
question. 


METHOD 


Six male graduate assistants? each contacted by 
telephone 12 randomly assigned students (6 male 
and 6 female) from the experimenter’s course in 
introductory psychology at the University of Hawaii 
(N = 72). The subjects were asked to either strongly 
agree, agree, disagree, or strongly disagree with a 
series of 14 statements regarding the creation of a 
Springtime Aloha Week.? The statements, which 
were so phrased that agreement with one half and 
disagreement with the other half indicated a posi- 
tive attitude, were assertions of opinion and not 
attitude. The statements did not say that the crea- 


1The author is indebted to Richard A. Haag, 
John W. James, Creighton U. Mattoon, Paul O. Mc- 
Gaffey, Clarence M. Souza, and Frances C, Wong for 
their assistance in collecting the data. 

2 Aloha Week is a festival in Hawaii 


occurrin; 
annually every fall. = 
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tion of a Springtime Aloha Week was good or bad 
but simply that it would lead to certain ends suc 
as more tourists dollars or growing traffic con 


indicating positive attitudes for 6 
males and 3 females) and responses ( 
negative attitudes for 6 subjects (3 males and — 
3 females). Reinforcement was given by saying 
“good” after the appropriate agree or disagree re- 
sponse. 

Approximately 1 week after the telephone inter- 
views were completed the experimenter passed out | 
a “Local Issues Questionnaire” in his introductory 
class, and requested that the 225 students in the 
class cooperate in giving him some information on 
student attitudes toward ‘various local issues. The 
questionnaire contained a series of 15-point attit 
tude scales dealing with such issues as the creation 
of a state subsidized interisland ferry, and higher 
salaries for public school teachers. The attitude 
scales ranged from “definitely in favor of” to “defi- 
nitely opposed to.” The key item concerned with 
the creation of a Springtime Aloha Week occurred 
two-thirds of the way through the questionnaire. FF 


RESULTS 


Each response to each statement in the tele- 
phone interview was given a score between O and 
3, with high scores indicating a favorable atti- 
tude. An analysis of variance of the three’ inde- 
pendent variables, type of reinforcement, sex, 
and assistants, is presented in Table 1. f 

As predicted the total between-groups F and 
reinforcement F are significant. The mean posi- 
tive reinforcement score is 28.22 and the mean 
negative reinforcement score is 20.69. The only 
other significant F is the interaction between 
assistants and sex. This interaction indicates that 
regardless as to the type of reinforcement some 
assistants obtained more positive opinion re- 
sponses from males and some more positive opin- 
ion responses from females. oa 

Twelve of the 72 subjects contacted by tele- 
phone were not present in class when the atti- 
tude questionnaire was passed out the first time. 
The experimenter attempted to get these 12 sub- 
Jects to fill out the questionnaire by requesting 
during the next class period that anyone who 
had not previously filled out the questionnaire 
do so after class. It was truthfully explained 
that this was necessary in order to have a non- 
biased sample. By this method completed ques- 
tionnaires from 10 more subjects were obtained 
giving a total N of 70. One of the missing sub- 
Jects was a male, reinforced for positive attitude, © 
and the other a female, reinforced for negative — 
attitude. Due to the existence of unequal Ws in 
the 24 cells and the nonsignificance of the rein- 
forcement interactions in the previous analysis, 
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the data were simply analyzed with a ¢ test. A 
comparison of the mean positive reinforcement 
score of 10.40 with the mean negative reinforce- 
ment score of 8.20 resulted in a significant ¢ of 
2.03 (p <.05). The mean positive reinforcement 
score is in the area of the scale labeled “‘some- 
what in favor of,” and the mean negative rein- 
forcement score is in the area labeled “uncer- 
tain.” 


Discussion 


The finding of an effect for reinforcement in 
the interview situation is consistent with the re- 
sults of Hildum and Brown (1956) and of Ek- 
man (1958). The additional finding of an effect 
for reinforcement on the attitude questionnaire 
‘gives support to the assertion that verbal rein- 
forcement results in attitude change and not just 
in a temporary modification of responses in the 
interview. From a strictly operational point of 
view what has been demonstrated is response 
generality. The experiment demonstrates gener- 
ality across time, experimenters (the experi- 
menter or graduate assistant), situational context 
(home or classroom), media of stimulation (oral 
or written), media of response (oral or written), 
‘and type of response (agreement-disagreement 
with “opinion statements or favorability-unfavor- 
ability to an attitude object). 
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The findings of the APA’s Project on Scientific Information Exchange in Psychology 
attention to the difficulties of the scientist in establishing and maintaining contact with cur 
work relevant to his interests. Accordingly, the project is sponsoring a series of changes dui 
the next 12-15 months which are designed to enhance scientific communication. These chan 
are planned as limited trials. Moreover, they are wholly supported by the Project and henc 
do not cut into our page allotment. i 

The Editor has agreed to make possible a test of one of the changes, that is, the titles z 
authors of accepted papers will be listed following the last articles in the journal. Such li 
will make it possible for our readers to become aware of research many months in the advance 
of its journal publication. At the end of the year’s trial the results of the trial will be evaluated 
and further consideration will be given at that time as to whether the service is worth continui 

This listing plus those published in the preceding 1965 issues are the entire backlog of manu 
scripts accepted by this journal through May 1965, E 
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EFFECTS OF EXPECTANCY ON WORKING AND 
WAITING FOR LARGER REWARDS* 


WALTER MISCHEL anp ERVIN STAUB ? 


Stanford University 


This study investigated the effects of situational and generalized expectancies 
for success on choices of immediate, less valuable, noncontingent rewards as 
opposed to more valuable contingent rewards. Measures of generalized ex- 
pectancy for success were administered to 8th-grade boys who later worked on 
a series of problems and obtained either success, failure, or no information for 
performance. Thereafter, each S chose between less valuable, noncontingent 
rewards and more valuable rewards: whose attainment was contingent on suc- 
cessful solufions of problems varying in their similarity-dissimilarity to the 
original problems and/or an additional delay period. As predicted, contingent 
rewards were chosen more after success than failure and Ss discriminated be- 
tween specific contingencies. The effects of situational success and failure 
tended to minimize the effects of generalized expectancies. Moreover, in the 
no-information condition children with high generalized expectancies for suc- 
cess chose more contingent rewards than those with low expectancies and be- 
haved like subjects in the success condition. Children with low generalized 
expectancies who received no information about their performance behaved 
like those with similarly low generalized expectancies who had obtained failure. 
Following failure, generalized expectancies for success affected willingness to 
wait for larger rewards even when their attainment was independent of 


a performance, 


Altkough there has been extensive specula- 
tion about the antecedents of delay of grati- 
fication (e.g., Freud, 1959), and widespread 
recognition of the importance of this aspect 
of self-control, experimental investigations in 
this area have been relatively scarce (e.g., 
Block & Martin, 1955; Livson & Mussen, 
1957; Mahrer, 1956). This study is part of a 
Program investigating the antecedents of 
choices in which immediate gratification is de- 
ferred for the sake of larger, more valued but 
Nonimmediate outcomes. The paradigm con- 
fronts subjects with choices between immedi- 


_ ately available, less valued rewards and de- 


layed, more valued rewards, The antecedents 
and correlates of such choices were found to 
be associated with other indices traditionally 
Subsumed under the “ego-strength” construct 
(€g., Mischel, 1961a, 1961b, 1961c; Mischel 


i This study was supported by Research Grant 

» 700830 from the National Institutes of Health, 
nited States Public Health Service. Grateful ac- 

vial agement is made to the administrative offi- 
Syst and teachers of the Palo Alto Unified School 

€m who cooperated in this study. 

Hey in the Department of Social Relations, 

vard University. 
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& Gilligan, 1964). The task now is to further 
isolate the variables governing this behavior. 

The conceptualization of this choice be- 
havior is based on a social learning theory 
(especially Rotter, 1954) in which each choice 
is a function of the expectancy that it will 
lead to particular reinforcement in a given 
situation and of the value of the reinforce- 
ment. The expectancy that the delayed re- 
ward will be forthcoming is a main determi- 
nant of the choice to defer immediate grati- 
fication for the sake of a larger delayed 
reward. Support for this comes from studies 
by Mahrer (1956) and by Mischel (1963), 
which showed that preferences for delayed 
rewards are affected by the individual’s ex- 
perimentally manipulated promise-keeping 
history, and the length of the delay interval 
(Mischel & Metzner, 1962). 

Previous work studied choice conflicts be- 
tween smaller, immediately available rewards 
as opposed to larger rewards which could not 
be attained without a delay period, but in 
most life situations the contingencies for 
attaining larger rewards involve more than 
simple waiting. The present investigation 
deals with contingent choices when the 
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smaller reward is immediately available but 
the larger reward is contingent on instru- 
mental activity, both with and without addi- 
tional delay. It was reasoned that when at- 
tainment of the more valuable reward is con- 
tingent on satisfactory performance on a task, 
the individual’s expectancy for success will 
be a main determinant of his choice. More- 
over, when attainment of the more valuable 
reward is contingent on both successful per- 
formance and an additional delay period, the 
individual’s expectancy for success, as well 
as his expectancy that the reward will be ob- 
tained in spite of the delay period, both enter 
as choice determinants. 

Consider the components of expectancy 
itself, According to social learning theory, 
expectancy is a function of specific situational 
expectancies and of generalized expectancies 
prior to the specific situation, based on the 
individual’s previous reinforcement history in 
similar situations (Rotter, 1954). Most stud- 
ies on the generalization of expectancies con- 
ducted within social learning theory focus on 
how verbal expectancy statements are affected 
by success and failure on related tasks (e.g., 
Jessor, 1954; Mischel, 1958a).* The present 
experiment examined the interaction of gen- 
eralized and situational expectancies and 
their effects on subsequent choice behavior. 

A measure of generalized expectancy for 
success was administered to eighth-grade boys. 
Three weeks later, these subjects worked on 
problems and obtained one of three kinds of 
information about their performance. In one 
treatment subjects obtained success, in a sec- 
ond failure, and in a third no information. 
Following this, the children chose between less 
valuable, noncontingent, immediately avail- 
able rewards and more valuable but con- 
tingent rewards. There were five variations in 
the contingencies for the more valuable re- 
wards and each subject chose under all varia- 
tions. Namely, the contingency was: (@) suc- 
cessful performance on a task similar to one 
of the treatment tasks, (b) same as a but 
with an additional delay period, (c) success- 
ful performance on a task dissimilar to one 


®There have been important exceptions (eg., 
Feather, 1959, 1961) but these were conducted in a 
motive-expectancy-value model (e.g., Feather, 1963) 
and not in social learning theory. 
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of the treatment tasks, (d) same as c but 
with an additional delay period, (e) only a 
delay period. 4 

It was anticipated that subjects would 
discriminate between choice conditions as a | 
function of the particular contingencies on 
which reward attainment depends. Contin- 
gent rewards should be chosen most when the | 
probability for attaining them is greatest. | 
Therefore, when the contingency for attain: 
ing more valuable rewards includes success- 
ful performance (work) as well as a delay pe- 
riod, they will be chosen less than when” 
there is only one contingency (either working | 
or waiting). d 

Second, rewards contingent on success 
a task similar to one on which the person 
previously succeeded will be chosen more 
frequently than those requiring success on a 
task similar to one on which he previously 
failed. 

This study also explored the extent 
which obtained success and failure on tasks) 


tasks dissimilar to the treatment tasks andi 
when it is nof contingent on the subject's) 
own performance. The extent of such general- 
ization is an empirical question, and no] 
directional hypotheses were formulated. The) 
question here was: does success or failure on 
tasks affect, or generalize to, choice prefer 
ences for larger rewards dependent on cone 
tingencies dissimilar to those on which suc 
cess or failure was obtained, and on coni 
tingencies independent of the individu 
own performance (i.e., delay only)? Further 
if such generalization effects occur, are 
related to generalized expectancies? 
Finally, it was reasoned that when situa 
tional expectancies are minimal the effects 0 
generalized expectancies will be strong am 
the converse. Thus, when the person succeeds 
or fails on tasks similar to those involved in 
the contingency for attaining the more valua: 
ble reward, specific situational expectancy 181 
large and generalized expectancy should have 
little effect on choice. However, in the abn 
sence of information directly relevant to th 
contingencies on which the more valued 
ward depends, generalized expectancies fo 
success should affect choices, with more Con 
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tingent large rewards chosen by subjects with 
high generalized expectancies for success. 


METHOD 
+ Subjects 


The data came from 89 eighth-grade boys from 
the physical education class of a public school in the 
Palo Alto, California, area. 


Overall Procedure 


A measure of generalized expectancies for success 
was administered to all subjects. Three weeks later 
these subjects worked on problems and obtained 

. either success, failure, of no information, After 
these treatments, the children chose between immedi- 
ately available, noncontingent smaller rewards or 

f contingent larger rewards, with five kinds of re- 
quirements for attaining the larger reward. 


Measurement of Generalized Expectancies for 
Success 


This assessment was conducted in large groups in 
two consecutive hours by the same male experi- 
menter. The subjects were told that a series of prob- 
„lems had been administered to many junior high 
school students. They were asked to estimate, as 
accurately as possible, by writing the appropriate 
numbews: “how many students at your grade level 
out of one hundred in your school” and “how many 
students at your grade level out of one hundred all 
over the country” would “have better results than 
you” on two different problems, described as re- 
quiring, respectively, “verbal reasoning” and “gen- 
eral information,” 


Assignment to Treatments and Design 


/ Assignment of subjects to treatment groups was 
independent of generalized expectancy scores. To 
avoid Possible differences between classrooms, all 
subjects were drawn by prior arrangement from a 
large _Physical education class divided randomly 
Into six subgroups, with 13-25 children in each. The 
eXperimental procedures were administered to each 
$ nbsroup in small classrooms in random sequence. 
Bae Subgroups received no information and within 
ee pepetoups half the subjects received the failure 
an alf _ the success manipulations, with an odd- 
tia _ Seating procedure used to select which subjects 
ained failure and which success. An approximately 
ee number of subjects were exposed to each 
_ Heatment, 
Note that the six subgroups served only as test- 
ee and do not correspond with the cells of the 
nee After completion of all experimental pro- 
dea, (described below), but before scoring the 
' subdi lent data, subjects in each treatment were 
ieas vided into those with high and low general- 
L abla eee The four expectancy statements 
mens from each subject were summed and the 
of thi or each subject was computed. The median 
g 1s distribution of mean expectancy scores was 


in, 
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79.5 and subjects were dichotomized into those 
with “high” as opposed to “low” generalized ex- 
pectancies on the basis of their position above or 
below the median. This provided the six groups of 
the design shown in Table 1: success, failure, and 
no information with high and low generalized ex- 
pectancy groups in each treatment, and a range of 
13-17 subjects in each group. To equalize the num- 
ber of subjects in each group, 11 subjects were 
randomly eliminated, reducing each cell to the size 
of the smallest (N =13 per cell). The total mean 
for all subjects with “high” generalized expectan- 
cies for success was 88.46 (N =39) and for those 
with “low” generalized expectancies it was 45.21 
(N = 39). The adequacy of the sampling procedures 
was supported by the fact that the mean generalized 
expectancy scores of subjects classified as high did 
not differ significantly between treatments. Likewise, 
the mean generalized expectancy scores of subjects 
classified as low were not significantly different be- 
tween treatments. The overall design thus contained 
five sets of dependent measures (described below) 
for six groups. 


Experimental Treatments 


The experimental session was conducted by a new 
male experimenter described as coming from a dif- 
ferent university and working on another project, 
so as to minimize the connection with the first ses- 
sion. All subjects were told that they would be 
tested on several abilities in order to compare their 
performance with other boys at their grade level. 
They were urged to do as well as possible and 
given four sets of problems. Three problems were 
adaptations from the Digit Span, Similarities, and 
Coding subtests of the Wechsler Intelligence Scale 
for Children. The fourth was called a “Verbal Rea- 
soning” test and consisted of groups of words which 
had to be rearranged to relate meaningfully to 
neighboring words. 

Subjects in the no-information condition were 
given only the above instructions and then per- 
formed the tasks without obtaining information 
about their performance. Subjects in the success or 
failure groups received false results after their 
performance on each problem. 

In order to make the manipulated scores seem 
more plausible, subjects were told that sometimes 
boys who usually do well in many school subjects 
do not do as well on these tests and similarly that 
boys who may not be doing so well on some of their 
school work may perform well on these special abil- 
ity tests. To increase the ambiguity of the scoring 
procedure and the plausibility of the false informa- 
tion, the time allowed for each test was too short to 
permit completion. 

At the completion of each problem, worksheets 
were collected and in the success and failure groups 
the “score” was returned to the subject on a slip of 
paper with a letter grade and a percentile score. In 
the success group children received a “B” on the 
first problem and an “A” on each of the remaining 
three, with percentile scores ranging from 80 on the 
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first problem to 90 on the last. The percentiles indi- 
cated their standing relative to other students at 
their grade level. In the failure group, subjects re- 
ceived a “D” on the first problem and an “F” on 
each of the remaining three problems with per- 
centile scores indicating that their performance was 
below 80-90% of the students at their grade level. 
Communication about grades was not permitted. 


Dependent Measures 


After completing the problems, subjects in all 
groups were told that the experimenter wanted to 


find out how children and adults of different ages 
choose when they are offered choices between 
different objects. This does not have anything to 
do with abilities, but please consider carefully 
each choice you make so that you will really 
choose what you prefer, 


All subjects were given a series of 25 choices, each 
between an immediately obtainable smaller or less 
valuable reward as opposed to a larger or more 
valuable reward. The more valued rewards could be 
obtained only under one of five conditions (Table 
1), There were five choices in each condition and 
each subject chose under all conditions. 

In four conditions attainment of the larger re- 
ward was contingent upon successful performance on 
a task. In two of these the task was described with 
the same general label as one of the tasks (verbal 
reasoning) on which success, failure, or no informa- 
tion had been obtained earlier. These are the “con- 
tingent similar” conditions (1 and 2 in Table 1), 
and in one of them (2) attainment of the larger 
reward was also dependent upon a waiting period 
(“contingent similar delay” condition), In the next 
two conditions the larger reward was contingent 
upon successful performance on a task described 
with a new label (general information) that had 
not been included in the series on which subjects 
had performed earlier, These are the “contingent 
dissimilar” conditions (3 and 4 in Table 1). Again, 
in one of these (4) the larger reward was also de- 
pendent upon a waiting period. Thus in Conditions 
1-4 attainment of larger rewards was contingent 
upon successful performance on tasks similar and 
dissimilar to the initial tasks, with and without an 
additional delay period. In the fifth condition, the 
more valuable reward was not contingent upon any 
task performance and depended only on willingness 
to wait (“noncontingent delay”), The same set of 
five choice pairs was used in each of these five 
conditions, resulting in a total of 25 choices. 

In each of the five conditions, subjects were pre- 
sented with the following five objects as smaller, 
immediate rewards: $1.00; two Mad magazines: 
small bag of peanuts; one hit tune record; plastic 
checker set. These were paired, respectively, against 
the following five corresponding larger rewards: 
$1.50; three Mad magazines; large can of mixed 
nuts; three hit tune records; wooden checker set. 
The delay conditions always involved a delay of 3 
weeks before the larger reward could be obtained. 
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For example, the contingent similar delay condition 
refers to solving a problem requiring verbal reason- 
ing plus waiting 3 weeks in order to obtain the 
larger reward, whereas the noncontingent delay 
condition merely required 3 weeks of waiting before 
the larger reward could be obtained. 

The 25 choices were presented in the same ran- 
dom sequence, in the group administration proce- 
dure described previously (Mischel & Gilligan, 1964). 
Children were provided individual booklets con- 
taining on each page a brief description of a given 
set of paired objects and the associated contingency. 
After the experimenter had displayed both rewards 
and explained the contingency, the children were 
instructed to record their choice, and to turn the 
page in preparation for the next set of items. The 
subjects were also advised to choose carefully and 
realistically because in one of the choices they would 
actually receive the item they selected, either on the 
same day or after the prescribed contingency, de- 
pending upon their recorded preference. After the 
choices, subjects worked on one more problem on 
which they obtained high grades, This was ex- 
clusively for “therapeutic” purposes and was in- 
cluded because the explanation of the procedure 
and the purpose of the experiment was postponed 
until all data were collected. All children were in- 


structed not to talk about the experiment. f 


RESULTS ¢ i 
Overall Analysis 


As a first step, an overall analysis of vari- 
ance was performed for the effects of general- 
ized expectancy, treatments, and choice con- 
ditions on choices in all conditions (Lindquist, 
1953, Type III). The results (Table 2) show 
a significant effect of generalized expectancy 
(p< .05) and significant effect of choice 
conditions (p < .001). Although none of the 
two-way interactions reached significance, the 
triple interaction was significant (p < .05). 
The effect of treatments (F = 2.82, df= 
2/72) tended to approach but did not reach 
acceptable significance (an F of 3.13 is re- 
quired for p < .05). 

A selected comparison between the com- 
bined task contingent choice conditions (1-4) | 
and the noncontingent choice condition (5) 
showed that larger rewards were chosen less: 
frequently when they were dependent 0? 
successful task performance than when they i 
required only waiting (¢ = 23.94, df = 1, 
$< .001). A selected comparison betwee? 
task contingent conditions without delay (! 
and 3 combined) and task contingent CON- , 
ditions plus delay (2 and 4 combined) showed 
that the addition of a delay period reduced 
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TABLE 1 


EXPERIMENTAL DESIGN AND MEAN CHOICES OF LARGE REWARDS BY 
ALL GROUPS IN ALL CHOICE CONDITIONS 


Choice conditions (contingencies) 


oe 


Large rewards 
Large rewards contingent on task noncontingent 
on task 
Treatment and general 
expectancy 
Similar to A Dissimilar to A 
= —] Delay only 
Without delay Plus delay Without delay Plus delay (5) 
i (1) 3) (3) (4) 
Success on A 
. High 3.39 2.69 2.92 2.92 4.08 
e Low 3.27 3.00 3.69 2.92 3.38 
Failure on A 
High 2.31 1.54 3.00 3.08 4.08 
Low 2.08 1.38 2.77 2.23 2.46 
No information on A 
High 3.77 2.85 3.54 2.85 3.54 
Low 2.08 1.54 2.15 2.15 3.23 


Note.—“A" refers to treatment tasks. 


the number of large reward choices (t= 
14.54, df = 77, p < .001). 


Effects of Success and Failure 


A main aim was to investigate the effects 
of success and failure on an initial task upon 
subsequent choices in which the larger re- 
ward is contingent upon successful perform- 
ance on a similar task. This required com- 
parison of the success and failure groups in 
the contingent similar task conditions (1 and 
2 combined). Further, a comparison was made 
of the number of large reward choices of the 
Success and failure groups in the contingent 
dissimilar task conditions (3 and 4 combined). 
The question here was: Did success and 
failure treatments affect choice preferences 
When the larger reward was contingent on 
the successful performance of tasks dissimilar 
to those on which success or failure was 


initially obtained? An analysis of variance 


(Lindquist, 1953, Type III) was performed 
to examine the effect of success and failure 
treatments, generalized expectancy, and task 


» Similarity-dissimilarity on the number of task 


Contingent large reward choices (Conditions 
1-4). The results, summarized in Table 3, 
show significant effects of success and failure 
treatments (p < .05). The effects of similar 


versus dissimilar contingent choice conditions, 
the interaction between choice conditions and 
treatments, and the three-way interaction 
were all significant. The effect of generalized 
expectancy was not significant. 

The effects of success and failure treat- 
ments were consistent with the hypothesis. 
Following success as compared to failure, all 
subjects (irrespective of generalized expect- 
ancy) chose significantly more large reward 
when the contingencies were similar (t= 
3.52, df = 50, p < .001), but not when they 


TABLE 2 


OVERALL ANALysIS OF VARIANCE OF LARGE REWARD 
CHOICES IN ALL GROUPS AND ALL CHOICE CONDITIONS 


Source df MS F 
Between subjects 77 
General expectancy (B) 1 29.36 4.68* 
Treatment (C) 2 17.70 2.82 
BXC 2 10.35 1.65 
Error (b) 12 6.28 
Within subjects 312 
Choice condition (A) 4 17.39 13.69% 
AXB 4 1.10 87 
AXC 8 1.88 1.48 
AXBXC 8 2.71 2,13* 
Error (w) 288 1.27 
Total 389 
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TABLE 3 


ANALYSIS OF VARIANCE OF LARGE CONTINGENT RE- 
warp CHOICES (CONDITIONS 1-4) FoR THE SUCCESS 
AND FAILURE GROUPS 


Source af MS F 
Between subjects 51 
General expectancy (B) 1 50 
Treatment (C) 1 66.24 6.33* 
BXC 1 9.22 
Error (b) 48 | 10.47 
Within subjects 52 
Choice condition (A) 1 25.01 7.15* 
AXB 1 40 
AXC 1 | 21.24 6.07* 
AXBXC 1 | 27.79 7.19% 
Error (w) 48 3.50 
Total 103 
*p <.05, 
“p <o 


were dissimilar (ż < 1). Hence the significant 
double interaction in Table 3. 

This double interaction (Treatment x 
Choice Condition) was even stronger for sub- 
jects with low than with high generalized 
expectancies. Hence there was a significant 
triple interaction (Table 3). Separate com- 
parisons for subjects with high and low gen- 
eralized expectancy in similar and dissimilar 
contignent conditions following success and 
failure are in Table 4. Both subjects with 
high and low generalized expectancy chose 
more larger rewards on similar contingent 
choices following success than failure. Al- 
though this mean difference was highly sig- 
nificant for low generalized expectancy subjects 


TABLE 4 


COMPARISON OF PAIRS OF MEANS FOR ALL TREATMENTS: LARGE REWARD CHOICES IN CONTINGENT SIMILAR 
DISSIMILAR CONDITIONS By SUBJECTS WITH HIGH AND Low GENERALIZED ExpECTANCIES 
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(t =3.40, p < .01), it just reaches accept- 
able significance for those with high gen- 
eralized expectancy (t= 1.84, p < .05, one- 
tailed test).* Comparison of the success and 
failure groups on dissimilar contingencies 
showed no differences (£ < 1) in the choices of 
children with high generalized expectancy 
but for those with low generalized expectancy ` 
there was a nonsignificant trend (f= 1.63, 
dj = 24, p < .15, two-tailed test) for more 
large reward choices following success. 


Comparisons of Contingent Choices by Sub- 
jects with High and Low Generalized Expect- 
ancies in Success and Failure versus Nos 
Information Treatment 


Table 4 compares the mean choices of sub- 
jects with high and low generalized expect- 
ancy separately in the success, failure, and 
no-information groups on similar and dis- 
similar contingent choice conditions. The re- 
sults for success versus failure already have 
been presented. Comparing subjects with low 
generalized expectancy in the success and thi 
no-information groups showed that ‘after 
success they chose more contingent reward: 
on similar contingencies (p < .01) and like- 
wise on dissimilar contingencies, although the 
latter difference approaches but falls short 


4All ¢ tests in this paper are two-tailed except for 
tests of the explicitly predicted differences on con- 
tingent similar choices following success as Oppo! 
to failure which are one-tailed. 


Treatment comparisons 

General il 

Soe, i Seo aa amean | a ES ran a 
Contingent Contingent Contingent Conti: t ti Contingent 
similar milar similar dissimilar eee dissimilar 

High 

t 1.84* <1 <1 1 1 

Direction I> 5 TS $ 
Eo Il > I 

t 3.406% 1.63 2.694% 1.96 1 

Direction I> I> I> 1> mM = z 


=; i F 
Note.—+ tests comparing success with failure on contingent similar are one-tailed (explicitly hypothesized differences); # 


other ¢ tests are two-tailed; df = 24 
eens if for each test, 
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of acceptable significance with a two-tailed 
test (£ = 1.96, df=24, p < .10). In con- 
trast, these comparisons did not reach or ap- 
proach significance for subjects with high 
generalized expectancy (both ¢ values <1). 
Comparing subjects with high generalized 
expectancies for success in the failure and no- 
information groups showed that after failure 
they chose fewer contingent rewards on 
similar but not on dissimilar contingencies. 
For subjects with low generalized expect- 
ancies for success, choices were not signifi- 
cantly different following failure and no in- 
formation. 
» In sum, in the absence of information about 
performance, subjects with high generalized 
expectancies behaved like those who ob- 
tained success, whereas subjects with low 
generalized expectancies behaved like those 
with similarly low expectancies who obtained 
failure. For subjects with high generalized 
expectancies, failure (compared to no in- 
formation) significantly reduced large reward 
choices only on similar contingencies. For 
thos® with low generalized expectancies, 
Success (compared to no information) sig- 
nificantly increased large reward choices on 
similar contingencies with a strong trend also 
on dissimilar contingencies. 


Effects of Generalized Expectancy 


Recall that a significant effect of generalized 
expectancy was found in the initial overall 
analysis of variance (Table 2). The three-way 
Interaction in the overall analysis of variance 
indicated, however, that generalized expect- 
ancies did not affect all experimental groups 
equally in all choice conditions and the 
analysis of variance for success and failure 
8roups only on task contingent choices showed 
no significant effects of generalized expect- 
ancies although there were interactions. 

It was anticipated that in the no-information 
8toup subjects with high as opposed to low 
Seneralized expectancies would choose more 
„ task Contingent large rewards (Conditions 
l-4 in Table 1). This prediction was sup- 
Ported (¢= 2.45, df= 24, p<.03). Com- 
Parable ¿ tests in the success and failure 
Stoups did not reach significance (¢ < 1). 
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Noncontingent (Delay Only) Choices 


The effect of generalized expectancies and 
treatments was also examined when the larger 
reward required delay but was not contingente 
on performance (Condition 5 in Table 1). It 
was not predicted that generalized expect- 
ancies for success would affect choices in this 
condition but the data indicated differences 
as a function of generalized expectancies (see 
Table 1). Analysis of variance for all treat- 
ment groups on choices in this condition 
showed a significant effect of generalized 
expectancies (F = 8.72, df = 1/72, p < .01). 
Treatment effect and interactions were not 
significant (F < 2). Subjects with high gen- 
eralized expectancies for success waited for 
larger rewards more than those with low 
expectancies in all treatments (see means in 
Table 1 for Choice Condition 5). However, 
t tests showed that this mean difference was 
significant in the failure group (t= 2.60, 
df = 24, p < .02) while it did not approach 
significance in the success group (¢ = .92, 
df = 24, ns) and was negligible in the no- 
information group (¢ = .48). That is, follow- 
ing failure, subjects with high generalized 
expectancies for success chose to wait for 
larger rewards which were independent of 
performance more than those with low gen- 
eralized expectancies. Following success and 
no information, generalized expectancies did 
not affect choices in this condition. 


Discussion 


The highly significant effects of choice con- 
ditions clearly demonstrated the importance of 
the specific contingencies for attainment of 
larger or more valued rewards as determinants 
of waiting and working for them. Larger re- 
wards whose attainment required only waiting 
were chosen more frequently than those that 
required successful work. Moreover, when 
larger rewards were contingent on waiting as 
well as successful work they were chosen less 
frequently than when they required only 
successful work. The findings show that ac- 
curate predictions about this aspect of self- 
control require detailed analysis of the specific 
contingencies for attainment of the more valu- 
able outcome. The results suggest that be- 
haviors frequently used as indices of “ego 


632 WALTER MISCHEL 
strength” and treated as if they were re- 
ferents for relatively stable, general, and 
situation-free traits may largely be determined 
by situational contingencies. 

Multiple contingencies (successful work 
plus additional delay) presumably reduced the 
subjective probability for attaining contingent 
rewards. It is also possible that the combined 
contingencies altered the reward value of the 
contingent larger rewards, by making their 
attainment more aversive rather than less 
probable. The present study was not designed 
to differentiate between these possibilities and 
illustrates only that individuals discriminate 
between contingencies in these choices, with 
additional risk contingencies decreasing will- 
ingness to work and wait for larger rewards. 
Likewise, the fact that “delay only” resulted 
in the greatest number of large reward choices 
should not lead to generalizations beyond the 
sample and tasks used. In some circumstances 
or cultures it is likely that contingencies in- 
volving successful work (even with a high 
risk of failure) would be more preferred than 
those requiring only a waiting period. This 
seems especially plausible when trust in the 
“promise-maker” is minimal or when waiting 
is a highly noxious activity (Mischel, 1958b). 
Obviously there are great individual differ- 
ences with respect to this. Note, for example, 
that following both success and failure, chil- 
dren with low generalized expectancy for 
success chose slightly (not significantly) more 
larger rewards when the contingency required 
success on a task dissimilar to the initial 
task but without additional delay than when 
it required delay only (Table 1). 

The results demonstrate that generalized 
expectancies for success, presumably based 
on previous reinforcement in similar earlier 
situations, are determinants of choices when 
there is no information in the situation rele- 
vant to success probability. In the absence of 
such information, subjects with low general- 
ized expectancies behave like those with 
similarly low expectancies who actually ob- 
tained failure, whereas subjects with high 
generalized expectancies behave like those 
who succeeded. 

However, when specific expectancies about 
outcomes within the situation are relatively 
clear, as in the success and failure treatments, 
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they tend to minimize the effects of general- 
ized expectancy, particularly when the con- 
tingencies are highly similar to those on 
which situational success or failure was ob- 
tained. There are major individual differences 
in the extent to which subjects discriminate 
between similar and dissimilar contingencies 
and specific contingencies interact both with 
treatment and with generalized expectancy. 
Treatment effects on dissimilar contingencies 
were minimal in this study, with trends only 
for, subjects who had low generalized expect- 
ancies. This is consistent with the discussed 
data on specific contingencies and subjects’ 
discriminations between contingencies, as well 
as the interactions with generalized expect- 
ancy. 

It is also of interest that subjects’ direct 
estimates of generalized expectancy for suc- 
cess, rather than indirect inferences about 
motivational states, provided useful measures 
which predicted delay of gratification be- 
havior. The usefulness of such direct self- 
estimates from subjects supports earlier find- 
ings about the utility of self-predictious in 
a very different assessment context (Mischel, 
1965). 

Generalized expectancies produced an un- 
predicted significant effect in the noncontin- 
gent (delay only) choice condition. In this 
condition subjects with high generalized ex- 
pectancy for success waited for larger re- 
wards more than those with low expectancies. 


This difference was significant after the failure | 


treatment but not after success or no informa- 
tion. Tt seems plausible that generalized 
expectancy, as measured in this study, was 
also correlated with expectancies that the 
promised reward will be forthcoming, even 


when this is independent of the person’s ow? | 


performance and contingent primarily on the 


promise-maker. However, since there were no | 


significant differences between subjects with 
high and low generalized expectancies in 
willingness to wait for larger reward in the 
no-information and success groups, and since 
the difference was significant only in the 
failure group, it appears more reasonable that 
“confirmed failure” for subjects with low 


generalized expectancies decreased their dis: | 


criminations about specific contingencies an 
generalized to choice situations in which at- 
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tainment of a larger reward was independent 
of their own performance. 

Extrapolating to life situations, in the 
absence of new information about outcomes 
relevant to the contingencies on which re- 
wards depend, individuals with low general- 
ized expectancies for success behave as if 
they cannot fulfill the contingency. That is, 
they behave like individuals who already 
failed on tasks similar to those on which 
reward is contingent. Following actual failure 


- on tasks similar to those on which rewasd is 


contingent, they aré less willing to wait for 
larger rewards than subjects with high gen- 


© eralized expectancies for success, even when 


the reward is not contingent on their own 
work and requires only waiting. However, 
when given information indicating that they 
can probably fulfill the relevant contingencies 
(success treatment) their willingness to work 
and wait for larger rewards increases and 
indeed they behave like subjects with high 
generalized expectancies for success. The 
implications for increasing an individual’s 
wilkingness to work and wait for larger re- 
wards are clear: increase the probability that 
he can fulfill the necessary specific contin- 
gencies. While this is hardly surprising, it 
needs to be taken seriously in therapeutic 
programs designed to enhance ego strength 
and suggests that specific training to increase 
the expectancy for success with respect to 
working and waiting for more valued outcomes 
ls a potent means of strengthening this aspect 
of self-control. 
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AS A FUNCTION OF INSTRUCTIONS AND 
TYPE OF TASK* 
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2 tasks were administered to 28 high test-anxious (HA) and 28 low test- 
anxious (LA) 4th-grade boys. è of each group received test instructions and 
4 received game instructions. The Ist hypothesis was that HA Ss perform 
as if being tested regardless of instructions. On a simple gamelike tracing task, 
LA Ss in the test condition made fewer errors than LA Ss in the game 
condition, while HA Ss in the 2 conditions did not differ. The 2nd hypothesis 
was that fear of failure is basic to the arousal of test anxiety. On a 2-part 
drawing task Ss were required to make “a picture” and to draw a “poskon.” 
No anxiety group differences were found for the free drawing; in drawing 
a “poskon”—the failure task—HA Ss made smaller drawings and verbalized 
Jess. In addition, HA Ss tended to switch colors and materials more in the 


test than in the game condition while the reverse was true for LA Ss. 


Research supporting the theory of test 
anxiety developed by the workers at Yale 
(Sarason, Davidson, Lighthall, Waite, & Rue- 
bush, 1960) indicates that test anxiety in 
children interferes with performance on test- 
like tasks (Lighthall, Ruebush, Sarason, & 
Zweibelson, 1959; Sarason et al., 1960; 
Zweibelson, 1956) and on relatively unstruc- 
tured or creative tasks (Fox, Davidson, Light- 
hall, Waite, & Sarason, 1958; Sarason, David- 
son, Lighthall, & Waite, 1958b). However, 
facilitation of performance has been demon- 
strated on gamelike tasks (Lighthall et al., 
1959; Sarason et al., 1960; Zweibelson, 1956) 
and on tasks which reward cautious, depend- 
ent behavior (Ruebush, 1960b; Waite, 1959). 

In the present study, two major questions 
were raised. The first question was whether 
the test-anxious child discriminates between 
formal test and nontest situations or responds 
to all performance situations, in which an 
adult is present, as test situations. According 

1This paper is based on a dissertation submitted 
to the Institute of Child Development, University 
of Minnesota, in partial fulfillment of the require- 
ments of the PhD. The writer wishes to express 
sincere appreciation for help and advice in all phases 
of the study to Britton K. Ruebush, Harold W. 
Stevenson, James W. Barnard, Raymond O. Collier, 
and David S. Palermo. Manuscript preparation was 
partially supported by Research Grant MH-06809 
from the National Institute of Mental Health 
United States Public Health Service. i 

?Now at San Francisco State College. 
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to the theory (Sarason et al., 1960), test 
anxiety has its origins in early evaluative 
situations in the home which for the most part 
antedate the beginning of formal schooling. 
Generally, these situations involve repeated 
devaluation of the child’s performance by the 
parent. The end result is that the child in the 
school test situation has strong needs to avoid 
failure because of the highly threatening con- 
sequences he perceives it to have (loss of 
love, etc.). If test anxiety truly originates in 
early nonschool situations with the parents, 
then all situations which involve performance 
for an observing adult—not only formal test 
situations—should become “test” situations 
for the test-anxious child. 

To investigate this prediction, a simple’ 
gamelike task involving perceptual-motor 
skill (the tracing task) was administered to 
both high test-anxious (HA) and low test- 
anxious (LA) subjects with either test or 
game instructions. It was expected that given 
the presence of an observing adult, the LA 
child would respond to a noncompetitive 
game as a “game” while the HA child would 
respond to a noncompetitive game as a “test.” 
Therefore, it was predicted that although LA 
subjects given test (evaluative) instructions 
would perform significantly better than those 
given game (nonevaluative) instructions, HA 
subjects under test and game instructions 
would perform similarly. Because of the sim- 
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ple, gamelike nature of the task and the con- 
sequent unlikelihood that HA subjects would 
show interference under test instructions 
(Lighthall et al., 1959; Sarason et al., 1960; 
Zweibelson, 1956), it was predicted that 
under test instructions the performance of 
LA and HA subjects would not differ signifi- 
cantly, Under game instructions, however, it 
was predicted that HA subjects would per- 
form at a significantly higher level than LA 
subjects because of their perception of the 
situation as an evaluative one. A 

The second question raised concerns the 
extent to which fear of failure is basic to the 
arousal of the test-anxious reaction, Accord- 
ing to the theory (Sarason et al., 1960), the 
HA child anticipates failure in the test situa- 
tion and views the consequences of failure, in 
terms of the reactions from others, as highly 
threatening, It is this anticipation of failure 
and the fear of its consequences which arouses 
anxiety and results in interference in per- 
formance. Thus, in a situation in which the 
task is structured and “easy”—one in which 
the probability of failure is low—the HA 
child should not evidence interference in per- 
formance due to test anxiety (Lighthall et al., 
1959; Sarason et al., 1960; Zweibelson, 
1956). However, in a performance situation 
in which the HA child feels that failure is in- 
evitable, test anxiety should be aroused and 
should result in marked interference in per- 
formance, 

Because previous studies have not demon- 
Strated a differential effect of failure and suc- 
cess instructions with either HA adults or HA 


‘children (Mandler & Sarason, 1952; Waite, 


Sarason, Lighthall, & Davidson, 1958), a 
different approach to the creation of a failure 
Situation was used in the present study. A 
gamelike task (the drawing task) with two 
Parts was administered along with the tracing 
task discussed above. Each subject received 
the same kind of instructions (test or game) 
with the drawing task as he did with the trac- 
ìng task. In each part of the drawing task 
the subject was asked to make a drawing for 
which paper and an array of art materials 
were provided. In one part the child was 
asked to make a picture (free drawing), while 
im the other part he was asked to make a pic- 
ture of a posxon. Because the child was in no 
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way informed that poskON was a nonsense 
syllable, it was assumed that the majority of 
subjects would anticipate failure on this task. 
The reasoning was that if anticipation of 
failure is basic to the arousal of test anxiety, 
then the posxon subtask would arouse anxi- 
ety in the HA child which would be reflected 
in performance. Although the free drawing 
subtask of the drawing task was somewhat 
unstructured and might have been expected 
to result in impaired performance by the HA 
child (Fox et al., 1958; Sarason et al., 
1958b), it was predicted that HA and LA 
subjects would differ most in performance of 
the poskon subtask. 

Assuming, as in the first major hypothesis, 
that both situations (test or game) would be 
perceived as evaluative by the HA child, it 
was expected that anxiety would be reflected 
in the performance of the HA child on the 
POSKON subtask under both test and game 
instructions. Such anxiety would result in 
constricted behavior (Fox et al., 1958) as 
reflected by smaller drawings, the use of fewer 
types of art materials and with the exception 
of questions, fewer verbalizations. Questions 
were excluded in this prediction because 
previous findings (Sarason et al., 1958a, 
1958b) suggest that the HA child would ask 
more questions than the LA child as a re- 
flection of high dependency needs. 


METHOD 
Subjects and Design 


Four hundred and sixteen fourth-grade boys from 
six elementary schools in a predominantly middle- 
class suburb of Minneapolis, were pretested on the 
Test Anxiety Scale for Children (TASC; Sarason 
et al., 1960) and the Defensiveness Scale for Chil- 
dren (DSC; Ruebush, 1960a). After discarding those 
who had repeated a grade, those for whom a verbal 
score from the Lorge-Thorndike Intelligence Test, 
Form 3A, was not available, and those who were 
left-handed (the tracing task apparatus was designed 
for right-handed persons), 28 HA and 28 LA sub- 
jects were selected from those below the median 
on defensiveness (DSC). The HA subjects had 
scores >17 (eighty-second percentile) and the LA 
subjects had scores <7 (thirty-sixth percentile). 
Defensiveness was controlled because recent evi- 
dence indicates that defensiveness is an important 
variable in investigations of anxiety in children 
(Ruebush, 1960a, 1963). One half of each anxiety 
group was randomly assigned to the test condition, 
the other to the game condition. Within each of the 
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TABLE 1 


CA, LORGE-THORNDIKE VERBAL SCORE, TASC, 
‘AND DSC MEANS AND STANDARD DEVIATIONS 


CA in | Verbal | Verbal | TASC DSC 
Group | months | score IQ score | score 
HA-test 
M 118.5 | 45.4 | 106.9 | 20.2 9.6 
SD 3.6 | 11.0 10.8 3.3 4.0 
HA-game 
M 117.4 | 43.9 | 106.7 | 19.9 9.6 
SD 3.3 13.7 10.6 35. 2.8 
LA-test 
M 120.1 | 53.8 | 112.4 5.1 11.0 
SD 3.7 11.3 10.9 1.8 2.6] 
LA-game Q 
M 119.6 | 53.9 | 112.6 4.0 | 11.5 
SD 3.3 | 10.9 7.5 2.5 2.5 


groups the order of the two experimental tasks was 
counterbalanced; half received the tracing task first 
(order TD) and half received the drawing task first 
(order DT). This resulted in a 2 (anxiety levels) X 
2 (conditions) X 2 (orders) factorial arrangement 
with seven subjects in a cell. 

The characteristics of the four Anxiety X Condi- 
tion groups in the final sample with regard to age, 
verbal scores, and IQs, TASC scores, and DSC scores 
are presented in Table 1. It can be seen that the LA 
and HA subjects differed on mean Lorge-Thorndike 
verbal scores (t = 2.18, df = 54, p< .05). However, 
because the present study involved simple percep- 
tual-motor tasks, a difference at this level was judged 
not crucial. This was substantiated in later analysis. 


Tracing-Task Apparatus 


The apparatus consisted of a sloping wooden 
panel with an elevation of 4.5 inches in the rear and 
2.5 inches at the front. The panel was 18 inches 
wide and 14 inches deep. Centered on its horizontal 
face was a thin translucent white Lucite screen, 10.5 
inches wide and 12.5 inches deep. The screen was 
enclosed by a grooved frame at the bottom and two 
sides which was designed to accommodate 11 X 12 
inch copper-laminated plates, each bearing a picture 
drawn with black oil paint. 

Each picture contained figures of a parent and of 
a boy. A strip of copper fs inch wide was removed 
from around the outline of the boy, exposing the 
phenolic inner core. The outline of the figure was 
illuminated from below with six 6-watt, 110-volt 
pilot lamps placed under the panel and beneath the 
Lucite screen. On each of the copper plates, removal 
of the copper from around the figure of the boy was 
terminated .25 inch from where removal was begun 
to provide a pathway with a beginning and an end. 
The beginning of the pathway on each plate was 
indicated at the foot of the boy by a large yellow 
plastic tape arrow. 

A standard pin jack which served as a receptacle 
for a metal-tipped stylus was placed on the right 
side of the panel. 
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Six pictures, two with neutral content (Pictures 
1 and 2), two with dependency content (Pictures 3 
and 4), and two with aggressive content (Pictures 
5 and 6) were presented to the subject in the ap- 
paratus described above. 


Drawing-Task Materials 


Five different types of art materials were pro- 
vided for the drawing task, crayons of 8 different 
colors, round stick pastels of 12 different colors, 12 
square stick pieces of charcoal, pencils of 12 different 
colors, and felt-top markers of 6 different colors. All 
sets of materials were aligned in a wooden tray and 
presented in a different random order for each sub- 
ject.°Two sheets of 12X18 inch white drawing 
paper were used with each ‘subject. 


Procedure 


All subjects were tested individually in a small 
room at their school by one female experimenter 
who did not know the anxiety level of any of the 
subjects. After seating the subject at the table with 
the apparatus for the first task, the experimenter 
asked three standard questions dealing with base- 
ball, summer vacation, and favorite school subjects 
to establish good rapport. After completion of the 
first task, the experimenter and the subject moved 
to another table where the second task was admin- 
istered. Then the subject returned to his classroom 
with the experimenter. 3 

Each subject received instructions according to 
the condition and the order to which he had been 
assigned. The instructions were as follows: 


[Test condition] My name is Miss and 
I am in the schools today giving some tests. I 
have two tests to give you and I want you to do 
your very best. I will give you a score on each 
test and your scores will be compared with the 
scores of the other boys. It is very important that 
you do your best. These tests are hard, but I’m 
sure you'll do well. 

The first [second] test is the Construction 
[Psychomotor Dexterity] Test. Listen carefully to 
the instructions and then you can begin. As I said 
before, your scores will be compared with those of 
the other boys so it is very important that you 
do your best. 

[Game condition] My name is Miss and 
I have some games to show you. These games are 
games which I have made and I want to find out 
how boys your age play with them and like 
them. They are not tests and have nothing to do 
with how well you're doing in school. They're 
fun games and I think you'll like them. 

The first [second] game is the Drawing [Trac- 
ing] game, First I'll tell you how it goes and then 
ae can play it. It’s a fun game and I think you'll 

e it. 


For the tracing task, the experimenter explained 
that the idea Was to trace the outline of the boy in 
each of the six pictures, to pick up the stylus when 
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the picture was exposed, to start at the yellow 
arrow, and to do it as quickly as possible without 
removing the stylus and without touching the sides 
of the pathway. When the buzzer sounded it meant 
the sides had been touched. The set of six pictures 
was presented once to each subject in random order 
* vith the restrictions that Picture 1 always be first 
and that no two pictures with the same content be 
presented consecutively. Picture 1 served as a prac- 
tice picture although it was not designated as such 
to the subject. 

For each picture the experimenter said, “Ready? 
Begin,” and then removed the cover. When the sub- 
ject had reached the end of the path, the experi- 
menter then removed the picture and asked the 
subject to replace the stylus in the receptacle. 

The drawing task consisted of two parts presented 
in counterbalanced order, The subject was presented 
with paper and the tray of art materials and asked 
to make “a picture” (Subtask A) and to make a 
picture of a posxon (Subtask P). The nonsense 
name, POSKON, was printed on an 8X5 inch card 
which the experimenter presented to the subject as 
the experimenter pronounced the word. In neither 
condition did the experimenter intimate that POSKON 
was not a real word. In each subtask the subject 
was allowed to work 4 minutes from the time he 
began to draw. No mention of a time limit was 
made to the subject and 4 minutes appeared to be 
ample time for the majority of subjects. 

To*maintain the unstructured nature of the task 
all questions from subjects were answered unin- 
formatively but pleasantly with replies such as, “Do 
a you think,” “Draw whatever you think it is,” 

It is up to you to decide,” etc. 


Response Measures 


Tracing task. Two latency measures were used: 
the time in seconds that elapsed between exposure 
of a picture and removal of the stylus from the 
receptacle by the subject and the time in seconds 
that elapsed between removal of the stylus and the 
point at which the subject began to trace. These 


m latencies were obtained in order to test predictions 


about the ‘effects of picture content, As picture con- 
tent will not be discussed in this report, the analysis 
of these data will not be presented but are avail- 
able elsewhere (McCoy, 1962). The number of oc- 
Casions that the subject contacted the copper (er- 
Tors) and the time taken to trace each picture (time 
Score) were recorded automatically. 

Drawing task. The time in seconds between the 
end of each set of instructions and the point at 
Which the subject began to draw was measured by 
à manually operated clock. Because of skewed dis- 
tributions and heterogeneity of variance, the laten- 
“les were changed to log scores [log (X +1)]. 

remarks that the subjects made were recorded 


' by the experimenter and later categorized as ques- 


tions, exclamations, statements of inadequacy to the 
a Self-depreciatory statements, general state- 
ents, and sentence fragments, Questions were de- 


fined as interrogatory sentences; exclamations, as 


task, 
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interjections or emotional ejaculations; statements 
of inadequacy to the task, as expressions of inabil- 
ity to perform as requested; self-depreciatory state- 
ments, as devaluation of own attempts or accomp- 
lishments; general statements, as comments or any 
word or words expressing a thought that was note 
included in the other categories; and sentence frag- 
ments, as interrupted or unfinished sentences. Agree- 
ment was 95.6% when protocols of eight subjects 
were categorized by an independent rater. 

Other measures included number of different 
colors and materials used by each subject as well 
as the area covered by each drawing. Area utilized 
in the drawings was estimated by applying a 12 X 
18 inch grid with 216 1-inch squares to each draw- 
ing. The number of squares which contained a part 
of the drawing constituted an area score for that 
drawing. Because of skewed distributions these 
scores were also transformed to logarithms. 


RESULTS 
Tracing Task 


Analysis of time and error scores, In order 
to investigate the importance of intelligence 
level in performance of the tracing task, time 
and error scores were each plotted against 
Lorge-Thorndike verbal scores. Since there 
was no evidence of nonlinearity, Pearson 7’s 
were computed. Neither the r of .04 between 
errors and verbal scores nor the 7 of — .26 
between time and verbal scores was signifi- 
cant; thus the findings are not attributable 
to the difference between the two groups in 
verbal scores. 

Because each subject was instructed to 
trace as “quickly and carefully” as possible, 
success in the tracing task involved both the 
number of errors made and the time taken in 
making them. Computation of a product- 
moment correlation revealed an r of — .54 
(t = 4.01, p < .01) between total error scores 
and total time scores indicating that subjects 
who traced rapidly tended to make more 
errors than subjects who traced slowly. In 
order to determine differences in performance 
between groups with time held constant, an 
analysis of covariance was performed for 
errors with time as the covariate. The ration- 
ale for this use of analysis of covariance is 
presented by Lindquist (1953). The results 
are presented in Table 2. 

As can be seen in Table 2, the main effect 
of Condition was significant indicating that 
subjects in the game condition (adjusted M 
= 66.0) made more errors than subjects in 
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TABLE 2 


ANALYSIS OF VARIANCE AND ANALYSIS OF COVARIANCE 
or ERROR SCORES WITH TIME AS THE COVARIATE 


Analysis of variance | Analysis of covariance 


Source 


df | MS F | df | MS F 

Anxiety (A) 1 90.0 1 56.6 
Condition (C) | 1 |4131.4| 4.88* | 1 | 5916.0 | 10.67** 
AXC 1 |1794.4| 212 | 1 | 2640.9) 4.77* 
Order (0) 1 795.0 1 529.1 
AXO 1 | 2900.2 1 | 953.5 
c xo 1 920.2 1 49 
AXC XO 1 | 1794.4 1 745.3 
Error (within) | 48 | 846.5 47 | 554.3 

Total 55 54 

*p <.05 

KD < .005. 


the test condition (adjusted M = 45.4). The 
interaction of Anxiety X Condition which 
was predicted was also significant. The ad- 
justed means for this interaction are plotted 
in Figure 1. It can be seen that the prediction 
that the difference between the performance 
of LA subjects in the test condition and the 
game condition would be significantly greater 
than the difference between the performance 
of the HA subjects in the two conditions was 
supported. 

Employing the error variance of the dif- 
ference between two adjusted means (Lind- 
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quist, 1953), ¢ ratios were computed. The fa 
LA subjects made significantly fewer errors © 
in the test condition (adjusted M = 39.5) 4 
than in the game condition (adjusted M= 

74.0) (t = 3.84, p < .01), while the HA sub- ., 
jects did not differ in the two conditions 
(test: adjusted M = 51.2; game: adjusted M 
= 58.1, £ < 1). Although the HA subjects in 
the game condition performed considerably 

better than the LA subjects in the game con- 
dition, this difference only reached borderline 
significance (£= 1.78, p < .10). The differ- 
ence between adjusted.means for HA-T and _ 
LA-T subjects was not significant (¢ = 1.32). 


Drawing Task 


Analysis of verbalizations. Because of the 
low frequency of verbalizations in all but the 
questions category, data for exclamations, 
statements of inadequacy to the task, self- 
depreciatory statements, general statements, — 
and sentence fragments were submitted to chi- — 
square analysis using 2 X 2 contingency ta- | 
bles, The HA and LA subjects were divided 
as to whether each made a particular type of 
verbalization or not. Calculation of chi-square 
using the Yates correction for continuity re- 
vealed one category in which there was a | 
significant difference between anxiety groups. 
Regardless of subtask or condition signifi- 
cantly more LA subjects (20) made general 
statements, that is, “I think PIL use blue,” 
“Maybe I'll draw a dog,” than HA subjects | 
(10) (x? = 7.22, df = 1, p < .01). t 

As a further means of getting at the differ- 1 
ences in verbal behavior between the two * 
anxiety groups, analyses of variance for each ~ 
subtask of the drawing task were performed 
using scores based on the number of verbali- | 
zations or communications a subject made. 
Verbalizations in all categories except sen- 
tence fragments were included in these scores. 
The results of the two analyses revealed 4 | 
significant main effect of condition for Sub- | 
task A (freedrawing) (F = 10.52, df = 1/48; 
~<.001) and for Subtask P (poskon) (F , 
= 18.60, df = 1/48, p < .001), both reflect — 
ing more verbalizations in the game condi- — 
tion (Subtask A: M = 4.7; Subtask P: M= 
6.5) than in the test condition (Subtask A: 
M = 2.1; Subtask P: M = 2.8). The main ef- 
fect of anxiety was significant in the analysis 
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for Subtask P only (F = 4.56, df = 1/48, p 


<.05), indicating that HA subjects made 
significantly fewer verbalizations (M = 3.8) 
than LA subjects (M = 5.6) while drawing a 
poskon. Thus, the prediction that HA sub- 
jects would make fewer verbalizations than 
LA subjects in Subtask P but not in Subtask 
A was supported, 

In order to test the hypothesis concerning 
questions, an analysis of variance for each 
subtask was performed employing the num- 
ber of questions the subject asked during 
each subtask as a scere. The analysis for Sub- 
task P revealed a significant main effect 


_ ef condition, indicating that subjects asked 


ie 


fewer questions in the test (M = 1.4) than 


“in the game condition (M = 2.9) (F = 11.41, 
df= 1/48, p< .005). The prediction that 


HA subjects would ask more questions than 
LA subjects regardless of condition was not 
supported. 

Analysis of drawing behavior. For each 
subtask, analyses of log latencies for elapsed 
time before the subject began to draw re- 
vealed no significant differences. 

As one means of discovering how the 
groups differed in their use of the art ma- 
terials, the number of HA and LA subjects 
using more than one of the five types of art 
Materials was analyzed for each subtask and 
for each condition separately using 2 X 2 
Contingency tables. Subjects were divided 
according to whether they used one, or more 
than one, type of art material. Of the four 
analyses, the only significant difference be- 
tween groups was the difference between the 
LA subjects and the HA subjects in the game 
Condition in the free-drawing situation (Sub- 


fask A) (Fischer exact probability = .028). 


Whereas 10 LA subjects in the game condition 
utilized more than one type of art material, 
only 4 HA subjects in the game condition 
did so, 

In a further analysis of the use of the art 
Materials, a color score was given to each 


Subject for each subtask by totaling the num- 


ber of different colors used in making each 
drawing. Analyses of color scores were per- 


| formed for the two subtasks separately and 


, 


By 


_ Tevealed no significant findings for Subtask 
A. For Subtask P, however, there was a sig- 


nificant main effect of order (F = 7.29, df = 
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1/48, p < .05) indicating that more colors 
were used in order TD (M = 2.46), on the 
average, than in order DT (M = 1.61). The 
Anxiety X Condition interaction was signifi- 
cant for Subtask P (F = 9.92, df = 1/48, p 
< .005) indicating that HA subjects used 
more colors (M = 2.36) in the test condition 
than in the game condition (M = 1.57), 
while LA subjects used more different colors 
in the game condition (M = 1.50) than in 
the test condition (M = 2.71). The Anxiety 
X Condition X Order interaction was also 
significant for Subtask P (F = 4.10, df = 
1/48, p < .05) and indicates that the differ- 
ences reflected by the Anxiety X Condition 
interaction were more pronounced in order 
TD: 

The analysis of switching scores or scores 
obtained by counting the number of times a 
subject switched from one color and/or ma- 
terial to another for each of the subtasks re- 
vealed the same significant effects and inter- 
actions and reflected trends identical to those 
in the analysis of color scores discussed above. 

As a means of analyzing the drawings 
themselves, log-area scores were obtained and 
an analysis of variance of these scores was 
performed for the drawings in each subtask 
separately. In the analysis for Subtask A, the 
main effect of order was significant (F = 
7.01, df= 1/48, p< .05) indicating that 
the drawings tended to cover more area in 
order DT (M log = 2.35) than in order TD 
(M log = 1.74). In the analysis for Subtask 
P, there was a significant main effect of anxi- 
ety (F = 4.73, df = 1/48, p < .05) indicat- 
ing that regardless of condition, HA subjects 
used significantly less area (M log = 1.20) 
in drawing a poskon than did LA subjects 
(M log = 1.42). 


DISCUSSION 


The findings for the tracing task suggest 
that although LA subjects in the test condi- 
tion responded to the experimental situation 
in a manner entirely different from that of 
LA subjects in the game condition, this was 
not true for HA subjects. While both LA and 
HA subjects performed well in the test con- 
dition, the HA subjects, in contrast to the 
LA subjects, performed just as well in the 
game condition, as was predicted. The expla- 
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nation that seems most likely is that although 
HA subjects attended to the instructions, the 
fact that the HA child was required to per- 
form for an observing adult made the game 
situation evaluative and testlike. An alter- 
nate explanation for the findings is that HA 
subjects, in contrast to LA subjects, did not 
attend to external cues, that is, instructions. 
However, it seems unlikely that, if questioned, 
HA subjects in the game condition would 
have stated that they had been asked to take 
a test rather than to play a game, and partic- 
ularly unlikely in light of the simple, game- 
like, perceptual-motor task employed. 

Although these results need substantiation 
in further research, they certainly suggest 
that although the classroom teacher may be 
rewarded for instructing the LA child that he 
is being tested and should do his very best, 
such admonishing is unnecessary in the case 
of the HA child. 

With regard to the second major hypoth- 
esis, it was expected that the HA boys would 
show less freedom and spontaneity than the 
LA boys in the posKon subtask because of 
its failure implications, and that the anxiety 
aroused would be reflected in fewer verbali- 
zations and restricted use of the art ma- 
terials, Analyses of the results for the two 
subtasks in the drawing task revealed no sig- 
nificant effects of anxiety for Subtask A (free 
drawing), but for the posxon task HA boys 
made significantly fewer verbalizations and 
covered significantly less area with their 
drawings than LA boys, as was predicted. 
Contrary to prediction, however, it was found 
that HA boys in the test condition used more 
colors and “switched” back and forth among 
colors and materials more frequently than 
did LA boys, and, unlike LA boys, did not 
exhibit such behavior in the game condition. 
It was in the game condition, where explora- 
tory use of the art materials would seem en- 
tirely appropriate, that the HA boys used 
fewer colors. Also pertinent is the finding that 
the HA boys used fewer different types of art 
materials in the least anxiety-arousing sub- 
task (free drawing) and condition (game)— 
only 4 HA boys as compared with 10 LA 
boys used more than one type of art material 
in Subtask A in the game condition. Such 
findings suggest that in an unstructured game 
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situation in which the HA child cannot de- 
termine what is desired of him, his fear that 
his response will receive negative evaluation 
leads him to avoid performing or to respond 
as little as possible. When the unstructured 
task is presented as a test, the HA boy is 
forced to perform because of the certainty of 
failure if he does not. He is then choosing the 
lesser of two evils. This explanation is con- 
sistent with the theory, but remains, never- 
theless, a post hoc one. 

The finding that only 10 of the 28 HA 
boys made conversational remarks (general 
statements) during performance of the draw- 
ing task, whereas 20 LA boys made such re- 
marks, is an interesting one. It adds further 
support to the hypothesis that the HA child 
experiences anxiety in situations in which he 
must perform for an adult regardless of how 
the adult represents the situation, that is, 
game or test, or whether or not interference in 
performance occurs, 

It must be noted here that the differences 
reflected by the Anxiety X Condition inter- 
action in the analyses of color scores „and 
switching scores were present in order TD 
only, Although the tasks in the present study 
were counterbalanced in keeping with princi- 
ples of good experimental design, it was not 
expected that order of task administration 
would affect the results. Others who have 
investigated test anxiety have also been 
plagued with unexpected order effects (Doris 
& Sarason, 1955; Doris, Sarason, & Berko- 
witz, 1963; Mandler & Sarason, 1952) and 
no adequate explanations have as yet been 
offered which can incorporate all of the find- 
ings. Results involving order in the present 
study suggest the possibility that LA subjects 
may become less interested and motivated 
the longer they are kept in the task situation, 
while the HA subjects become more anxious 
with time, If this were true it would explain 
why significant differences appeared in draw- 
ing performance when the drawing task was 
administered last but not when it was ad- 
ministered first. This and other likely hypoth- 
eses have not, as yet, been adequately in- 
vestigated. : 

The expectation that HA subjects would 
ask more questions than LA subjects as @ 
reflection of high dependency needs was not 
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supported. Further consideration of the ex- 
perimental situation in this and in previous 
studies suggests an explanation of this find- 
ing. Previous studies (Ruebush, 1960b; 
_ Sarason et al., 1958a, 1958b) which have 
‘demonstrated dependency behavior in HA 
children have employed situations in which 
such behaviors were reinforced. In both sub- 
tasks of the drawing task, however, the only 
information provided in reply to questions 
was that the procedure was up to the child. 
Thus, in the drawing-task situation, asking 
questions did not result in task-relevant in- 
formation and served only to emphasize the 
unstructured nature of the task and the fact 
that no further information would be pro- 
vided, It is suggested, therefore, that the HA 
boy will continue to express his dependency 
by question asking only when he is reinforced 
for such behavior. Attempts to investigate 
differences in dependency behavior in HA 
children must utilize experimental situations 
in which reinforcement of such behavior is 


varied systematically, 
o 
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INTERNAL-EXTERNAL CONTROL AS A DETERMINANT 
OF AMOUNT OF SOCIAL INFLUENCE EXERTED * 


E. JERRY PHARES 


Kansas State 


Internally vs. externally controlled Es were used in the attempt to change 


female S’s expressed attitudes on variou: 


had filled out an attitude questionnaire. 


in their attempt to influence Ss. As hypo 


able to induce significantly greater changes in expressed attitudes than were 
externally controlled Es, Control Ss who merely filled out the questionnaire the 


2nd time without any attempted influen 
run by externally controlled Es. 


The induction of attitude change has his- 
torically been a crucial area of study in psy- 
chology and currently still represents a ma- 
jor psychological research effort. Both the 
nature of the individual to be influenced and 
the characteristics of the forces which apply 
the influence have been studied. One factor 
which may affect the amount of influence that 
one person exerts upon another in changing 
the latter’s attitudes is the concept of inter- 
nal versus external control of reinforcement. 
This is a concept which has developed out of 
social learning theory (Rotter, 1954) and re- 
fers to the extent to which an individual feels 
that he has control over the reinforcements 
that occur relative to his behavior. Externals 
feel that forces beyond their control are the 
essential factors in determining the occur- 
rence of reinforcements (such forces might 
include fate, chance, powerful others, the 
complexity of the world or its unpredictabil- 
ity, etc.). Internals, however, tend to feel 
that they control their own destiny and are 
the effective agent in determining the occur- 
rence of reinforcements. All of this repre- 
sents a continuum of individual differences 
that cuts across specific need areas. It is, in 
short, a generalized expectancy relating be- 
havior to reinforcement in a wide variety of 
learning situations. Seeman (1959) has dis- 
cussed this variable in terms of “powerless- 
ness” and its relationship to the sociological 
concept of “alienation.” The related notion of 
“mastery” has also been treated by Strodt- 
beck (1958). 


1This study was supported by Grant NSF GS-112 
from the National Science Foundation. 
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University 


s issues. Prior to the experiment, Ss 
Es read from standard instructions 
thesized, internally controlled Es were 


ce showed changes equal to those Ss 


A number of studies have shown thet} 
expectancies are differentially affected when 
tasks are perceived as dependent upon skill 
as opposed to chance or luck (James, 1951% 
James & Rotter, 1958; Phares, 1957, 1962 
Rotter, Liverant, & Crowne, 1961). Concur 
rent with this research, however, has been 
the development of a scale that would mei 
ure individual differences in the generali 
expectancy for internal-external control 6 
events. Starting with Phares (1957), the s 
was revised by James (1957) and sub: 
quently developed into a forced-choice scale 
(Rotter, Seeman, & Liverant, 1962). The test 
now being used is composed of 23 items andi 
6 filler items (Internal-External—I-E—Com- 
trol Scale). Illustrative items are presented 
elsewhere (Gore & Rotter, 1963). In geni 
the I-E concept has shown relationships 
a variety of behaviors including those in s 
learning situations (Bialer, 1961; Jame 
1957; Phares, 1957), conformity situations 
(Crowne & Liverant, 1963), and risk taking 
(Liverant & Scodel, 1960). It relates to Sp 
cific, logically selected tests (Holden, 19 
Simmons, 1959) and also to differences amoni 
known groups (Battle & Rotter, 1963; Crom 
well, Rosenthal, Shakow, & Zahn, 1961). 

Perhaps more directly related to the presi 
ent research are studies involving I-E ale; 
attitudes, social action behaviors, and expe 
menter influence. For example, Seeman an 
Evans (1962) demonstrated that patients 
a tuberculosis hospital who scored towā 
the internal end of the continuum knew mo! 
about their physical condition, were bett 
informed about the nature of tuberculosis 
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and were seen by hospital personnel as being 
more informed, better patients than were ex- 
ternals. Gore and Rotter (1963) have shown 
that scores on the I-E scale relate directly to 
social action-taking behavior. That is, the 
I-E scale predicted the type and degree of 
commitment behavior manifested by Negro 
college students to effect social change. An- 
other study by Gore (1962) relates to ex- 
perimenter influence. She used three groups: 
one in which the hypothesis was revealed to 
the subject; one in which subtle attempts 

: were made to influence the subject in the di- 
rection of the hypothesis; and a control 

' group. She found no differences between in- 
ternals and externals in her overt-influence 

- and control conditions but in her subtle-influ- 
ence condition the internals showed both less 
influence than the controls and also tended 
to respond more in’ the opposite direction of 
her subtle suggestion than controls, 

i Based upon the presumed construct prop- 
erties of the I-E scale as well as the preceding 
research it was hypothesized that internal 
subjects are able to exert more influence 
upon others than are external subjects. Thus, 
in influencing the attitudes of others, internal 
subjects should be more effective than exter- 
nals. Internals, having the generalized ex- 
pectancy that they are in control of their 
own behavior-reinforcement sequences, should 
thus be more effective agents in the induc- 
tion of change than individuals not having 
Such an expectancy, In effect, this study may 
be construed as a study of experimenter 

fluence where the true subjects are ex- 
perimenters. The prediction is basically that 

Internal experimenters will induce greater 
changes in attitude statements on the part of 
their subjects than will external experimenters. 


METHOD 


aie LE scale was administered to 179 males and 
T es in two large general psychology classes 
s q cansas State University. Scored in the internal 
Ree the means were 16.45 for males and 16.30 
Oi At the same time a so-called College 
‘vey on Survey (COS) was administered. This sur- 
ee of 12 items to which students re- 
7 ised each time on an 1l-point scale from 
le; gly disagree” to “strongly agree.” The survey 
trates die topics of current campus interest as illus- 
i by the following items: 
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1. Fraternities and sororities serve a vital and 
necessary part of college life. 

3. College athletics have gotten out of hand and 
should, therefore, be put in their proper perspec- 
tive. 

11. Class attendance ought to be completely op- 
tional for all students. 


Based upon the data from the foregoing two scales, 
two experimental groups were formed. The first was 
composed of 27 males (internals) with scores on the 
I-E scale ranging from 19 to 23 and the second 
group consisted of 27 males who received I-E scores 
from 0 to 14 (externals). Both groups were matched 
as closely as possible on the basis of COS scores. 
Since the COS consisted of diverse items, there was 
no possibility of deriving a single score that would 
meaningfully describe a student. Therefore, in match- 
ing groups, care was taken to insure that both mem- 
bers of each internal-external pair had the same 
number of COS items that fell into each of five 
designated intervals along the 11-point COS scale. 
This matching procedure accounts for the fact that 
only 50% of students from either I-E extreme was 
utilized. These two groups, representing the extremes 
on the internal-external control variable and matched 
as regards COS scores, served as experimenters for 
this study. They were the ones who attempted to 
induce attitudinal changes on the part of the sub- 
jects to be described below. 

The subjects for this study were two groups of 
females with scores on the I-E scale near the mean 
(16 or 17). Females with such scores were used ex- 
clusively since Gore (1962) reports data indicating 
that internal subjects are more resistant to social 
influence than are externals. The two groups were 
matched on the basis of their responses to the COS 
in the manner described previously for the experi- 
menters. In general, both groups tended toward the 
moderate ranges of opinion. Thus, in summary, 
there were two groups of female subjects with scores 
of either 16 or 17 on the I-E scale and matched on 
the basis of COS responses. One group of females 
served as subjects for the internal males, while the 
other matched group of females served as subjects 
for the external males. There were 27 subject-ex- 
perimenter pairs in each group. 

In addition, 30 females, drawn on a volunteer ba- 
sis from the remainder of the classes, served as a 
quasi-control group. These subjects were simply re- 
administered the COS approximately 10 weeks after 
the initial classroom administration. This procedure 
was designed to provide a rough index of the amount 
of attitude change that might be expected as a func- 
tion of the passage of time. This is only a quasi- 
control group inasmuch as the conditions of read- 
ministration of the COS were not entirely compa- 
rable. Indeed, these control subjects were also slightly 
different from the experimental subjects inasmuch as 
they were subjects with scores on the I-E scale 
other than 16 or 17 and were thus more heterogene- 
ous. However, the major purpose of this study was 
to demonstrate differences between two experimental 
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groups; not whether attitude changes could be ef- 
fected. 

Upon appearance, each subject-experimenter pair 
was checked to insure that they were not acquainted. 
» Following this, a research assistant led the experi- 
menter to a small room where the following instruc- 
tions were read: 


This is a study of social influence. The subjects 
that you will be running have previously taken an 
attitude test and we have their scores on this test. 
In this experiment, you will read a series of argu- 
ments to the subject to see whether or not it is 

` possible to alter her attitudes from what they were 
when she took the test. We do not know whether 
it is possible for experimenters like yourself to in- 
fluence subjects in their stated attitudes about 
these issues. In effect, we want to know whether 
it is or is not possible. Okay? 

Just read the instructions to the subjects. Say 
nothing else to them. If, for any reason, you 
should have to depart from this procedure, make 
a note of it and tell me afterwards, Please do not 
discuss this project with anyone until you are 
told it is alright. 


At this point, the research assistant showed the 
experimenter the instructions he was to read to the 
subject and went over them with him, answering 
any questions that arose. Following this, the experi- 
menter went back to the first room and brought the 
subject with him. The research assistant then left 
and the experimenter was on his own. The instruc- 
tions which the experimenter read to the subject are 
as follows: 


I am going to read you a series of statements 
that reflect positions on various attitudes that stu- 
dents have. It is possible that you may have filled 
out this questionnaire earlier in your General Psy- 
chology class—I believe it was used in some of 
them. 

Before you indicate whether you agree or dis- 
agree with the statement, however, I will read you 
some of the arguments that have been used re- 
garding the statement. In short, then, I will read 
you a statement and then give you some of the 
arguments that support a particular point of view. 
You will then rate your degree of agreement or 
disagreement with that statement using this scale 
as a guide. Okay? (No further explanation may 

» be given.) 


} The scale mentioned was an 11-point scale rang- 
ing from “strongly agree” (+5) to “strongly dis- 
agree” (—5). The attitude statements read were the 
same 12 items that appeared on the COS, 3 of which 
were illustrated earlier. Following each statement, 
the experimenter read the arguments, whereupon the 
subject indicated her degree of agreement-disagree- 
ment. An illustration is as follows: 


1.-Fraternities and sororities serve a vital and 
_ necessary part of college life. 
+ The arguments are: 
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a. Fraternities and sororities interfere with edu- 
cational goals. 

b. They too often foster discrimination and 
snobbishness. 

c. They emphasize superficial values. 

d. They often psychologically damage those per- 
sons who aren’t selected as members. 


What scale number indicates your degree of agree- 
ment or disagreement with the statement, “Fra- 
ternities and sororities serve a vital and necessary 
part of college life”? 


This general format was followed for each of the 
12 items. 


RESULTS AND DISCUSSION 


The essential data for this experiment ate 
the differences between the scale values as- 
signed to the 12 items prior to the exertion of 
social influence and those assigned subsequent | 
to that influence. It was hypothesized that 
internally controlled experimenters would | 
exert greater influence than externally con- 
trolled experimenters. In testing this hy- 
pothesis, each subject was given a score | 
which consisted of the sum of magnitudes 
of scale change for all items which were in 
the direction of the influence (arguments) 
presented by the experimenter. From this 
value was subtracted the sum of the mag 
nitudes of change which were in the direc- 
tion opposite to that of the influence. These 
data are presented in Table 1. The data 
indicate that internal experimenters were 
able to induce significantly more attitudinal 
change than were their external countet- 
parts. Indeed, the amount of influence 6%- 
erted by externals was essentially no different 
than the amount of change which occurred 
simply as a function of taking the COS fo! 
the second time (control group). Thus, thé] 
amount of change showed by the control 
group may be considered in part a reflection 
of the unreliability of the COS and may 
serve also as a base line against which t0 
compare changes which occurred in the tw? 
experimental groups. The variance in the 
three groups is rather high. The fact that it 
is high in all three groups may indicate that 
much of it is likely a function of the cos | 
or the general procedure. In testing for homo | 
geneity of variance between the internal 4” 
external control groups, an F value of border- 
line significance was obtained. Accepting t" 
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TABLE 1 


MEANS AND SIGMAS FOR SUBJECTS RUN BY INTERNAL 
AND EXTERNAL EXPERIMENTERS AND 
FOR CONTROL GROUP 


M N t 

27 |I versus E, 1.9* 
E versus C, ns 
30 |C versus I, 2.6** 


Internal experimenters | 9.7 
External experimenters | 3.1 
Control group 2.8 


Smo | a 
Ron 

x 

S 


*p < .03, one-tailed. 

*p < .01, one-tailed. 
variance as heterogeneous does not, however, 
‘significantly alter the confidence levels shown 
in Table 1. Leaving aside, for the moment, the 
COS or questions of its reliability, the fact 


a that significant differences between externals 


and internals were obtained is quite striking 
when one considers that this variable was the 
only one considered in selecting experimenters 
or subjects. Thus, no attempt was made to 
control such variables as personal appearance, 
skill in reading orally, social skill, or other 
organismic variables. Lack of control of such 
factors may also account for some of the high 
variance. 

Siffce data other than those of I-E and 
COS were not used in constituting the groups 
it may be argued that other variables might 
have accounted for the significant differences 
in experimenter influence. However, based on 
test scores available in the Kansas State Uni- 
versity Counseling Center, it was found that 
such an obvious factor as intelligence was not 
Seemingly operative. For example, the mean 
predicted grade-point average (based on ACT 
Scores and high school rank) for high I-E ex- 

: perimenters was 1.74 and for low I-E experi- 
Menters, 1.79, Similar scores for subjects run 
by high experimenters and low experimenters 
Were 2.27 and 2.22, respectively. Also avail- 
able were scores on the Marlowe-Crowne So- 
Cial Desirability (M-C SD) scale (Crowne & 
Marlowe, 1960). Such scores are presumably 


645 


indicative of a need for social approval so 
that females who score high on this scale 
might be expected to be more susceptible to 
social influence than those scoring low. In our 
sample, subjects run by low I-E experimenters’ 
had mean M-C SD scores of 15.50 while sub- 
jects run by high I-E experimenters achieved 
mean M-C SD scores of 14.96. Analogous 
scores for high and low experimenters were 
16.77 and 16.00, respectively. Thus, it is ap- 
parent that neither intelligence nor need for 
social approval, as measured by the above 
scales, contributed to the obtained differences 
in social influence. 

As further evidence for the hypothesis, fre- 
quency counts were made of the number of 
times each subject changed the scale value of 
an item. Separate counts were made of the 
instances when the change was in the pre- 
dicted direction (direction of the arguments) 
as opposed to change in the nonpredicted di- 
rection, These data are presented in Table 2. 
In testing for the significance of the differ- 
ence (.14; SE of difference = .045) between 
the external and internal proportions a z of 
3.1 was obtained which is significant beyond 
the .01 level. The same difference between in- 
ternals and controls (.13; SE of difference = 
045) yielded a z of 2.9, significant at better 
than the .01 level. 

Thus, regardless of whether the measure of 
influence is based upon the magnitude of 
change or upon the frequency of change, the 
indications are the same—internally con- 
trolled experimenters exert greater influence 
than do externally controlled experimenters. 
In fact, the influence exerted by externals 
(regardless of the measure) is no greater 
than those changes exhibited by subjects who 
merely retake the COS without any attempt 
at influencing them. 

The major hypothesis of this study was 


TABLE 2 


FREQUENCY OF PREDICTED AND UNPREDICTED (CHANGES IN 
ATTITUDE STATEMENTS FOR THREE GROUPS 


Proportion of predicted 
Predicted changes Unpredicted changes Total ee to 
Internal experi arr cea 244 .65 
` xperimenters 158 86 
xtemal experimenters 135 128 263 :51 
ontrol group 131 119 250 152 
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thus confirmed. These results strongly sug- 
gest that a major variable in the study of so- 
cial influence situations is the internal-exter- 
nal control dimension. A previous study has 
‘demonstrated that susceptibility to social in- 
fluence is mediated by I-E dimension (Gore, 
1962). The present study complements this 
finding by indicating that the I-E dimension 
likewise operates with those who would exert 
the influence. Indeed, the present findings 
were demonstrated under rather restrictive 
conditions, That is, the experimenters read 
rather routine, nonexpansive types of argu- 
ments and were not permitted to add any- 
thing to the printed statements. Also, subject 
and experimenter selection was based solely 
upon I-E scores and COS responses; other or- 
ganismic characteristics were not considered. 

These results seem striking. However, in es- 
sence, they serve to outline a future course of 
study. Several questions are left unanswered 
by this research. For example, the specific 
techniques by which internals are better able 
to exert influence remains for further re- 
search to elucidate. What are the specific 
mechanics (tone, gesture, facial expression, 
etc.) by which the influence is better exerted? 
Some of the experimenter bias research by 
Rosenthal and Fode (1963) suggests the im- 
portance of visual cues. Furthermore, should 
such differences be identified, what are the 
previous conditions which have led to the su- 
perior development or use of such skills on 
the part of internals as compared to exter- 
nals? Also left unanswered is the question of 
the interaction between internal-external con- 
trol and the strength of attitudes held. Thus, 
would the same results obtain when the sub- 
ject held very strong attitudes about various 
issues? 

An alternative hypothesis to account for 
the obtained results revolves around the de- 
mand characteristics of the situation. In view 
of the work of Orne (1959) and Rosenthal 
(1963), it could be argued that internal ex- 
perimenters were subtly influenced by the re- 
search assistant to perform in a manner dif- 
ferently from that of external experimenters. 
Such an eventuality is possible although it 
was reduced by the fact that the research 
assistant worked from lists of subjects and 
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experimenters and did not usually know 
which people belonged to which groups. At, 
any rate, systematic studies involving all the 
preceding points are either planned or under- 
way and should shed even further light on ¢ 
the construct validity of the I-E scale. 
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PATTERN RECOGNITION IN GROUPS: 
LABORATORY AND SIMULATION STUDIES* 


T. B. ROBY axo C. R. BUDROSE 
Tufts University 


The topic of investigation is the performance of small groups on a task re- 
quiring the identification of a certain pattern from discrete information items. 
2 factors hypothesized to affect performance were the complexity of patterns, 
defined by the number of pattern elements, and the number of group members 
who could identify the pattern directly. 4-person groups were run on a series 
of pattern-detection problems in which the factors were varied systematically. 
The results are compared with the results of a computer simulation program 
in which the same factors are varied. Discrepancies between the 2 sets of data 
are interpreted in terms of specific procedural differences which can be further 


investigated. 


Pattern identification may be very gener- 
ally defined as the process of establishing a 
correspondence between a set of observed 
stimulus elements and a specific (typically 
ordered) criterial set or “template” (Roby, 
1960). Thus the criterial set for the pattern 
Robin includes: thin yellow bill, slate colored 
back, reddish breast, hopping locomotion on 
the ground, level direct flight, caroling song. 
If an observed bird displays a sufficient num- 
ber of these features the bird is recognized 
as a robin (Collins, 1959), 

As this example indicates, the correspond- 
ence relationship between observed stimuli 
and the elements of the template set may it- 
self involve a secondary pattern recognition 
problem. “Hopping,” for example, is far from 
an elementary stimulus impression. In the 
simplest case, however, and in the present 
studies, this correspondence is one of direct 
identity. That is, the criterion for pattern oc- 
currence is simply the presence of a particu- 
lar set of stimulus elements, 

Even this comparatively rudimentary pat- 
tern-detection problem of ascertaining that a 
certain subset of elements is included in a set 
of observed stimuli may become quite difficult 
in the group situation. Unless there are effec- 
tive procedures for dissemination or routing 
of information, it may easily happen that the 
group members collectively have sufficient in- 
formation to detect patterns, yet no one in- 
dividual is in a position to put it all together. 
The purpose of the following studies was to 


1This research was supported under Contract 
Nonr 494(15) with the Office of Naval Research. 
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investigate this phenomenon and, more spe- 
cifically, to evaluate the extent to which it 
depends upon certain factors in the pattern- 
recognition task. | 

The particular factors here examined are, 
first, the number of elements that must be 
known before the pattern can be identified; 
and, second, the number of group members 
who are in a position to identify the pattern 
if they are given sufficient information. The 
first factor is an index of the complexity of 
a pattern, the a priori expectation being that 
patterns consisting of more distinct elements 
would be more difficult to detect. The second 
factor describes the central processing capa- 
bility of the group, and the assumption was 
that pattern identification would improve 
with an increase in the number of persons 
who could “put the information together.” 

The investigation here described includes 
both a conventional experimental study using 
live groups, and a “replication” of this study 
using a Monte Carlo simulation program. 
Comparison of the results for these two ap- 
proaches helps to clarify the substantive 
situation and affords several clear leads fot 
subsequent research. Thus a secondary pu- 
pose of this report is to illustrate the comple- 
mentary functions of laboratory and com- 
puter investigation of the type of problem 
here considered. 


LABORATORY STUDY 


Procedure 


The experimental task used in this study required i 
subjects working in four-man groups to detect the | 
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TABLE 1 
DISTRIBUTION OF TEMPLATE ĪNFORMATION (T) AND PATTERN ELEMENTS (x) OVER Group MEMBERS 


One element Two element 
. 
Subject Subject 
Pattern No. Pattern No. 

A B c D A B c D 
33 T x 41 Tiz x 
34 S x T 42 x Tix 
35 T ý Tx 43 Par ha Tx 
36 E T x T 4 E T, x x 
37 T,x T T 45 rT Tex T% 
38 T H T Tix 46 Tix Tx T 
39 T E < T E 47 PiX: Ta T T 
40 T T Tjk T 48 T, T AN Dex 

Three element Four element 
Subject Subject 
Pattern No. — — Pattern No. 

AE abe v D A B c D 
49 x x TX 57 x x Tix: x 
50 WS x x 58 x Dkr x x 
51 x Iy Tx x 59 T,x x x T,x 
52 Tax x x T 60 x T,x Tx x 
53 x T3 T a E S 61 T,x E T,x Tae 
54 T,x T LIES 5 x 62 Tx T,x x TE 
$5 T, x Tes EB T,x 63 x Tx Dex Tix 
56 i TA dK Tig 64 T Ta T,x Ty x 


Presence of certain sets of two-digit numbers which 
might occur on their collective displays. The four 
group members occupied isolated soundproof booths 
and were permitted to communicate by means of an 
intercom system that allowed only one two-way con- 
Nection at any time. Each booth contained a dis- 
Play panel upon which the two-digit numbers were 
shown, and the four display panels each contained 
different numbers on any trial. In addition each sub- 
ject had a sheet containing a list of 20 pattern tem- 
‘Mates subsets of 1, 2, 3, or 4 numbers that were 
i itrarily defined as patterns. The criterion for pat- 
i mm presence was that each of the numbers in the 
eee subsets must be displayed in one of the 
die ooths. The total list of pattern templates was 
i erent for each subject but there was some over- 

pe the particular templates as described below. 
aie booth was equipped with a knife switch 
D aa subjects used to record their responses. At 
Reggae curing a trial any subject could register a 
ae a is present” or a response “is absent.” How- 
en e trial „was not complete until all four sub- 
shear aN unanimously agreed upon the correct judg- 
Eatin ll responses were recorded on a 10-pen 

tline-Angus operations recorder. 

oe ee ma session consisted of 64 trials, 
digit rial comprising a distinct set of four two- 
s ets: On 32 of these trials some subset of 
for ae displayed numbers constituted a pattern 
plate. 2 One or more of the subjects had a tem- 
- in order to detect the pattern it was neces- 


sary for subjects to exchange information concern- 
ing the individual displays until one of the subjects 
who had the template could ascertain that all the 
numbers were present. On the other 32 trials, the 
display consisted of four numbers that did not ful- 
fill the requirements for a complete pattern even 
though the numbers might individually belong to 
various patterns. 

The 32 trials on which a pattern was in fact pres- 
ent represented two replicates of 16 pattern condi- 
tions. These 16 conditions were the factorial com- 
binations of four levels of pattern complexity and 
four levels of template distribution. Pattern com- 
plexity ranged from the simplest condition, in which 
a single two-digit number defined a pattern, to four- 
element patterns; similarly, from one to four group 
members might have the template for a given pat- 
tern on their typescript lists. 

The actual distributions of template information 
and pattern elements are shown in Table 1. A T in 
this table indicates that a given subject had the 
template for the particular pattern, and an x indi- 
cates that one element of the pattern appeared on 
his display. To illustrate, Pattern 43 is a two-ele- 
ment, two-template condition. The actual pattern is 
(07, 17) which appears in the template lists for Sub- 
jects B and D. The displayed numbers are: A, 07; 
B, 65; C, 03; and D, 17. Thus A has a relevant pat- 
tern element but no template; B has the template 
but an irrelevant number; C has neither a template 
nor a pattern element; and D has both. 
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TABLE 2 
VARIANCE ANALYSIS OF TRANSFORMED TIME SCORES 


Source af MS F 
Groups 11 148.71 4,.34** 
Pattern conditions 15 191.79 5.60** 

Complexity (A) 3 330.10 3.97* 

Templates (B) 3 379.20 4.56* 

AXB 9 83.20 2.43* 
Order 1 749.56 21.89** 
Pooled interaction | 356 34.24 

* 05. 

“p Sio 


Subjects were Tufts University undergraduates 
who were paid at an hourly rate for participation 
and also were offered incentive prizes for best group 
and individual performance. Twelve teams of four 
subjects took part in the experiment, and all groups 
were given the 64 mixed-pattern and nonpattern 
trials in the same random order. Thus all of the ex- 
perimental comparisons made below are on a within- 
team basis. 

Before beginning the experimental session proper, 
subjects were read a set of orienting instructions and 
were shown a mock-up of the displays and associ- 
ated apparatus with a few representative pattern and 
nonpattern presentations. They were then assigned 
to their isolation booths and, after familiarization 
with the intercom procedure, were given 10 sample 
trials under actual experimental conditions. Follow- 
ing this, they were given the 64 experimental trials. 
The time required for orientation and the experi- 
mental session proper ranged from 2 to 3 hours and 
was completed in a single session. 


Results 


The chief performance scores are the times 
to unanimous correct response on each trial. 
Visual inspection indicated that these scores 
were much skewed to the right, and an initial 
transformation was applied to each score. The 
transformation selected was defined by x’ = 
1/(x + k) where the constant, k, was deter- 
mined in such a way as to equate the median 


TABLE 3 
SPEED Score TOTALS FOR VARIOUS PATTERN TYPES 
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and midrange of the new distribution.? The 
result of this transformation is therefore a 
modified speed score which appears to have 
a satisfactorily normal distribution. 

Table 2 summarizes the analysis of vari- 
ance of the speed scores, All interactions, ex- 
cept that for pattern complexity by templates, 
are nearly identical in magnitude and have 
been pooled. With this error term, the main — 
effects are all highly significant. Thus there” 
are significant differences among performance | 
means for groups across pattern conditions © 
and among the 16 pattérn conditions. Order | 
effects were tested by comparing the first and — 
second occurrences of each of the 16 pattern 
conditions. As expected, performance is bet- 
ter on the second occurrence of most of the 
pattern conditions. 

Table 3 shows totals for the 12 groups of 
the speed scores obtained under various com- 
binations of pattern complexity and template 
possession. There is a steady improvement in 
performance as the number of templates in- | 
creases but the results for pattern complexity 
are not as clear. The best performance overall 
is obtained on the simplest (one-element) pat- 
terns and the next best on the two-element 
patterns. Poorest performance, however, is 
obtained on the three-element, rather than 
the four-element patterns. A second evidently = 
deviant result is that the poorest condition is 
that of a one-element pattern with one tem- 
plate.* This last result evidently accounts for 


ab + be — 2a 

a+c—2b ’ 
a, b, and c are, respectively, the lowest, median, ande 
highest scores of the original distribution. l 

3 There is a slight experimental artifact contained 
in this result. For all conditions other than the 1-1 
condition, at least one person who had the template” 
had at least one pattern element. This was not per- 


*The formula for k is k= where — 


Number of pattern 


Number of templates 


elements 
1 2 3 4 M 
1 338.3 545.2 532.9 535. 487.9 
388.9 456.7 495.3 1602 450.2 
3 362.7 333.7 387.5 442.8 381.7 
is 383.0 423.7 446.2 452.7 426.4 
368.2 439.8 465.5 472.7 436.6 


xf 
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TABLE 4 


Time DURING Wuicu Majority (3 or 4 Group Mrm- 
BERS) MAINTAINED AN INCORRECT RESPONSE AS A 
FUNCTION OF PATTERN CONDITIONS 


‘Number of templates 
Pattern 
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TABLE 5 


AN ILLUSTRATIVE Communication Process RESULT- 
ING IN PATTERN DETECTION 


Group members 


Epoch | Message 


size A B c D 
1 2 3 4 Total 
0 | Initial | T,x y Gh w 
1 612 | 101 10 0 723 1 | A>B | T,x x,y T: w 
2 300 113 0 27 440 2 DSC AY. T,w | w 
3 277 46 43 37 403 3) |-C= Balers w, x,» | T w| w 
4 438 44 0 0 482 4 D—C | T,= w,x,y | T,w| w 
Total 1627 304 53 64 2048 5 B>A |T, w,x,y | wx, y | T,w | w 
th . ifi e l . oe 
e signi! cant template complexity inter- COMPUTER SIMULATION STUDY * 
action in Table 2. 
Procedure 


One further analysis entailed a comparison 
of the mean performance scores for groups in 
the 32 trials in which there was a pattern with 
the mean performance on the 32 nonpattern 
trials. This comparison shows that for all 12 
groups the mean time to solution was less 
when a pattern was present. The natural im- 
plication is that groups in general established 
the absence of a pattern by a process of 
elimination, only after ruling out all sus- 
pecfèd patterns. 

The final set of empirical data, in Table 4, 

reflects the amount of time spent under each 
pattern condition on the false hypothesis of 
no pattern.” Specifically, the scores are the 
total number of 5-second intervals during 
which three or four of the group members in- 
Correctly agreed that there was no pattern 
present. These results, even more strikingly 
than the speed scores, emphasize the impor- 
tance of the template distribution and, in ad- 
«dition, the results suggest the explanation for 
the apparent anomaly in the one-element, one- 
template case. The very large number of false 
Negatives here indicates that the poor time 
Scores under that condition are attributable 
to instances in which subjects completely 
Overlooked the single element pattern and did 
not become aware of it until after they had 
failed to receive a “success” signal from the 
experimenter, 

Further discussion of these results will be 
reserved until after a description of the com- 
Panion simulation study. 


a in the 1-1 condition for obvious reasons. 
wath effect of this variable is shown in connection 
the simulation results, 


Following the laboratory study, the simulation 
study was based on a hypothetical group of four 
persons, each with a unique item of information, 
described as x, y, z, and w. On a given trial a cer- 
tain subset of one, two, three, or four of these items 
was defined as constituting a pattern. The template 
for this pattern might be placed with one, two, 
three, or four of the group members. Pattern identi- 
fication required that all of the elements in the 
pattern be collected by a group member with a 
template. 

Communication was assumed to be random and 
unidirectional. During each time interval one of the 
four group members sends a message to one of the 
other three group members. This message contains 
all of the former’s current information about the 
events w, x, y, 3 but no template information. In- 
formation is cumulative for each group member: 
they retain all of their original information plus all 
information sent to them. 

Table 5 illustrates the general process. A, B, C, D 
are the four group members; the pattern has three 
elements w, x, y; and these are initially known to 
D, A, and B, respectively. A and C have Template 
T telling them that w, x, y constitute a pattern. The 
first communication is from A to B; A advises B 
on Item x, but does not communicate the template. 
Next D transmits w to C; then C transmits w to B. 
At that point (Epoch 3) B has all of the elements 
in the pattern but does not know it to be a pattern. 
The next message, from D to C, accomplishes noth- 
ing as D has already transmitted all his informa- 
tion to C. Finally, however, B relays Items w and 
y to A, who recognizes the pattern. 

It will be noted that the simulation study, unlike 
the laboratory study, provides a pattern on each 
trial and does not allow for “not present” votes. 
Incorporating that feature of the earlier study would 


4This work was done in part at the Computation 
Center, Massachusetts Institute of Technology. 
Florence Gray provided the computer program for 
this study. 
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TABLE 6 
INNUMBER OF TRIALS BEFORE SOLUTION CLASSIFIED: BY EXPERIMENTAL PARAMETERS 
Pattern size 
ith 
» template | | Overlap 5 A 5 7: 
1 1 0.00 (4 7.20 (12) 9.56 (12) 11.11 (4) 
0 7.54 Ki 9.44 (12) 10.62 (4) =) 0) 
2 2 — (0) 4.53 (6) 6.99 (12) 8.08 (6) 
1 0.00 (12) 5.91 (24) 7.67 (12) — (0) 
0 4.88 (12) 7.28 (6) se — (0) 
3 3 — — 5.67 (4) 7.00 (4) 
2 ee 3.90 (12) 6.28 (12) | 
1 0.00 (12) 4.86 (12) sD ‘ a 
0 3.85 (4) = ae 
4 4 — — — 6.80 (1) 
3 = 5.28 (4) 
2 =. 3.47 (6) Su T 
1 0.00 (1) = y z 


Note.—Number of distinct distributions for each generic combination of parameters appears in parentheses. 


have required the addition of a more or less arbi- 
trary “giving up” rule in the computer program. 

Simulated runs of this kind were performed for all 
numbers of pattern elements from one to four, and 
for various conditions of template and initial in- 
formation distribution. With k elements in the 
pattern, there were () possible initial distribu- 
tions of those elements: 4, 6, 4, and 1 for k=1, 2, 
3, 4, respectively. Similarly, the assignment of the 
pattern template to n» persons could be done in 4, 6, 
4, and 1 different ways, Each of the resulting com- 
binations of information and template distributions 
was run for a total of 50 simulation trials. The 
mean and standard deviation of the trials were ob- 
tained for each combination. 


Results 


The data concerning mean number of trials 
before solution for various conditions are 
summarized in Table 6, The data are cross- 


classified in terms of pattern size and number 
of persons with the template. As will be seen, 
however, a further breakdown of the data is 
based on the number of persons having the 
template who also had one of the patteri, e ele- 
ments at the start of the problem. The distri- 
bution of such overlap conditions is of course 
a function of the number of patterns and 
templates, and the number of distinct distri- 
butions for each generic combination of 
parameters is given in parentheses. 

To summarize the results shown in Table 
6: 

1. Other things being equal, the greater 
the number of elements in the pattern, the 
more trials required to solution. This is indi- 
cated by the i increasing means in the left to 
right direction in each row of the table. 


TABLE 7 


STANDARDIZED MEAN DETECTION TIME SCORES UNDER COMPARABLE CONDITIONS 
IN THE LABORATORY AND SIMULATION STUDIES 


Pattern size 
ý i 
ASS 1 2 3 4 
Laboratory | Simulation | Laboratory Simulation | Laboratory | Simulation | Laboratory | Simulation 
1 2.77 54 90 81 68 1.32 73 1.76 
2 —76 | 1.18 00 00 7 AT — 33 A 
3 —.90 —1.66 —.95 54 65 07 =.54 
4 1.20 —2.00 =i —81 — Al —.20 —.62 


| 
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2. Other things equal, the more group 
members in possession of the pattern tem- 
plate the fewer trials required to solution. 
This is shown by the decreasing means in 
each column compared at equal overlap val- 
ues, 

3. Other things equal, the greater the over- 
lap the fewer trials required to solution. The 
relevant data are the steadily increasing 
means in the short columns associated with 
each pattern-template combination. 

These results are of course all expected on 
‘the basis of commop-sense reasoning and 
mathematical analysis, to the extent that the 
latter is possible. They also agree with the 
fesults of the parallel empirical study with 
the one glaring exception of the one-element, 
one-template condition. 


Comparison of the Results 


Table 7 shows the performance measures 
under comparable conditions in the two 
studies, standardized in order to facilitate 
comparison. The laboratory data are raw 
time seores rather than the transformed scores 
used in the analyses of those data. Simula- 
tion scores are taken from the overlap condi- 
tions matching those used in the laboratory 
study, 

As this table indicates, actual groups do 
relatively better in the larger patterns (their 
Scores are lower) and much more poorly on 
the one-element patterns. The most pro- 
nounced difference lies in the one-element, 
one-template condition. The product-moment 
Corrélation of these score means is .64, but 
the rank correlation, which is less affected by 
the extreme scores, is .87. 

Tn order to account for discrepancies be- 
tween the laboratory and computer results, 
‘three procedural differences may be adduced. 
The first difference has already been pointed 
Out; the fact that the computer was not per- 
mitted to give up and respond with pattern 


“ot present.” Thus, although the mean de- 


tection time for the one-element, one-template 

Condition is quite long for the computer, it 

S not inflated by false negative trials as in 
e live group study. 


653 


The second procedural difference is that 
the computer simulation performance was 
scored on the time required for basic infor- 
mation transmission only. The live groups, on 
the other hand, needed additional time for. 
the cognitive process of perception and for 
the process of obtaining concurrence among 
all group members. The effect of this pro- 
cedural difference shows up most clearly 
in the one-element, four-template condition 
which results in zero detection time for the 
simulated condition while it requires a small 
but appreciable time for the group. 

A final point of difference lies in the ran- 
dom communication procedure used in the 
computer program as compared with possible 
priority schemes used by most groups. Al- 
though there is no explicit evidence on the 


‘nature of the priority schemes used in this 


task, it seems reasonable to suppose that 
group members who had templates, or tem- 
plates and some of the pattern elements, 
would be more likely to initiate communica- 
tions than other group members. It appears 
that this factor may in part account for the 
relatively good performance of live groups 
on the three- and four-element patterns in 
which communications would be quickly ini- 
tiated. 

These hypotheses as to the differences be- 
tween the laboratory and simulation results 
are all quite testable. The next step therefore 
is to translate them into modifications in the 
laboratory or computer procedures and to 
determine whether closer correspondence is 
obtained. In any case, the general plan of 
parallel simulation and laboratory studies 
here illustrated appears to offer real promise 
as an approach to complex problems in group 
performance. 
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INEFFECTIVENESS OF “OVERHEARD” COUNTERPROPAGANDA*™ 


TIMOTHY C. BROCK 
Ohio State University 


90 college women heard an E take a rigged phone call in which he advocated 
an event which was either neutral, unacceptable, or acceptable to the listeners. 
For 4 the listeners, E took the call in their presence and, for the others, he 
ran around to his office saying “I can’t talk about that here.” Dependent 
variable consisted of opinions on compulsory ROTC, increasing, and. de- 
creasing tuition. Data corroborated Walster and Festinger: overheard com- 
munications were especially effective for involved listeners when the advocated 
position was acceptable. However, overhearing was not more effective for the 
unacceptable propaganda, Results were ordered to a model in which informa- 
tion receptivity and communicator credibility combine multiplicatively to 


determine propaganda effectiveness. 


Walster and Festinger (1962) have clari- 
fied the conditions under which a regular 
communicator is less effective than an over- 
heard communicator, that is, a speaker per- 
ceived to be unaware of his listeners: 


when the position advocated by the persuasive 
communication is one which the involved subjects 
would like to accept, overheard communications 
are more effective for involved subjects and not 
particularly more effective for subjects not directly 
involved in the issue [p. 400]. 


Communications advocating positions which 
the listeners would not like to accept, were 
not employed, but Walster and Festinger 
suspected that 


if the communication urges the listeners in a direc- 
tion in which they do not want to move, quite 
different results might be obtained [p. 401]. 


In the present version of the Walster-Fest- 
inger experiment, conditions were included in 
which the communicator advocated a position 
contrary to that held by the listeners, Em- 
pirical knowledge about the potency of over- 
heard counterpropaganda will facilitate theo- 
retical understanding of the effectiveness of 
overheard communications, 


METHOD 


An after-only design with appropriate con- 
trols was employed. Subjects participated in a 


1 Supported by Grant GS-343 to the first writer 
from the National Science Foundation. We thank 
Elaine Walster for her comments. The assistance of 
James Berns is gratefully acknowledged. The senior 
author was at Iowa State University during the data 
collection. 
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AND 


LEE ALAN BECKER 


Towa State University 


“fingertip-sight” experiment during which one 
of the experimenters received a rigged phone | 
call. The first independent variable was the | 
experimenter’s contribution to the “phone 
conversation” in which he strongly advocated | 
a course of action which was either unat 
ceptable, acceptable, or a matter of indiffer- 
ence to the listening subjects. The second in- 
dependent variable was the listening condi- 
tion. For half the subjects, the experiiaentet 
took the call in the same room and, for the 
other half, he “ran around” to his office after 
saying “I can’t talk about that here.” In 
both conditions the conversation was per- 
fectly audible to the subjects. The dependent 
variable consisted of opinion scores on three 
issues: compulsory ROTC, increasing tuition, 
and reducing tuition. 


Subjects 


The subjects were 90 experimentally naive under- 
class women who paid a quarterly fee of $99. 


Procedure 


_ Fingertip-sight experiment. The subjects were Tui 
in groups of five by two experimenters. The sub- 
jects began by reading the following instructions. 


There are two parts in today’s project. Part onè 
is an experiment on tactual color perception. Part 
two is a survey of student opinion. Please follow 
the instructions attentively and quietly. It has 
recently been discovered by psychologists in the 
United States and Russia that some persons can 
discern the color of an unseen object by touching 
that object. We are testing for the ability Wy 
judge color by touch at Iowa State. l 

After you have finished reading these instruc | 
tions, the procedure will be as follows. You W 
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place black goggles over your eyes so that you 
cannot see anything; three colored squares of 
paper, covered with thin glass, will be placed in 
front of you; two of the squares will be of the 
same color and the third will be of a different 
color. You judge which color is different from 
the other two by touching and feeling the squares. 
Place one color square aside from the other two 
to indicate your decision and then place your 
hands in your lap. When everyone has finished a 
set of colors a new set will be presented. The 
experimenter will say “Ready Start” each time 
you are to begin touching a new set of three 
colored squares. A report on tactual color per- 
ception research will be mailed to you. No talk- 
ing is permitted once he experiment has begin. 


After the subjects had indicated their goggles 
Were securely placed, the experiment began. A trial 
was completed when all five subjects indicated that 
they had made their decision about the odd color 
by placing their hands in their laps. The average 
time per trial was about 2.5 minutes. There were 
eight trials in the entire experiment. After the eighth 
trial the subjects were told to remove their goggles 
and fill out an opinionnaire. On the last page of the 
opinionnaire they signed a “voluntary agreement” 
to refrain from discussing the experiment for 2 
weeks, Since the entire experiment was run in 1 
day, contamination was precluded. A mimeographed 
repor, on fingertip sighting (Plumb, 1964) was 
mailed to each subject.2 

Manipulation of overheard versus regular listening 
conditions, A phone was on view to all subjects when 
they entered the experimental room; it was placed 
on a sill extended into an adjoining office. The sill 
is about 3.5 feet above the floor and the opening 
above it is about 14 X 14 inches. A square panel 
on hinges closes the opening, This arrangement is 
common in many so-called “temporary” buildings 
Constructed on campuses and elsewhere during World 
War II; it enables the occupants of both rooms to 
share the same phone. 

At the end of the second trial, the first experi- 
menter, the speaker, electrically signaled an ac- 
còmplice in another part of the building. The ac- 
complice immediately dialed the number of the 
Phone in the experimental room. The speaker an- 
Swered and said he was “busy,” “cannot talk to any- 
one right now” and quickly replaced the receiver 
after saying “PIL call you back,” “I’m sorry I have 
to hang up.” At the end of the fourth trial the 
Speaker signaled his accomplice and again the phone 
rang. The speaker insisted that he “cannot talk 
now,” “Pm all tied up right now” but his ostensible 
caller was adamant and the speaker was “forced” to 
engage in a conversation. (There were three stand- 
ardized Conversations as described in a later section.) 

© Speaker read his part of the conversation from 


` ê Script while holding his thumb over the earphone 
——————— 


ae detailed description of procedures and results 
it the fingertip-sight experiment is available from 
© second writer. 
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of the receiver. The accomplice played a tape record- 
ing of the script into the telephone. When the speaker 
lifted his thumb, for “pause” in the script, the 
subjects heard the metallic “squawking” of the 
voice of the ostensible caller. The words and phrases 
from the other end were mainly incomprehensible 
to the subjects but those that did come through 
were, because of the tape recording, germane to the 
speaker’s message. During the second phone con- 
versation the speaker made two “unsuccessful” at- 
tempts to break it off. In sum, the impression of 
an authentic phone conversation was conveyed. 
The effect was well established in extensive pilot 
research. 

At the beginning of the seventh trial, the speaker 
elicited a third phone call from the accomplice and 
terminated it quickly and irritatedly. The first and 
third phone calls contributed to the overall impression 
of an experimenter harassed by unwanted interrup- 
tions. 

In the regular condition, the speaker, upon “fail- 
ing” to dismiss his caller, simply took the call in 
the experimental room. In the overheard condition, 
the speaker said “I’m sorry I can’t talk about that 
in here. I’m going to run around to my office. 
Please hold on.” As the speaker rushed out, the 
second experimenter said “Should we wait?” and 
the speaker called from the corridor “Go ahead, 
TIl be right back.” The second experimenter insured 
that the speaker’s conversation was completely 
audible to all the subjects by adjusting the afore- 
mentioned panel covering the opening between the 
experimental room and the adjoining office. 

Three communication treatments. There were three 
persuasive messages. The first, compulsory ROTC, 
advocated a resumption of the ROTC obligation for 
all physically fit male students. The second, reduce 
tuition, presented reasons for reducing the quarterly 
fee from $99 to $33. The third, triple tuition, 
presented reasons for increasing the quarterly fee 
from $99 to $297. It was assumed that coeds were 
less involved in the ROTC issue than they were 
in the amount of tuition they paid. It was further 
assumed that reducing tuition is an acceptable course 
of action for virtually all students while tripling 
tuition is an unacceptable event for all students. 
Hence, the three messages made it possible to com- 
pare regular versus overheard treatments for in- 
volving and uninvolving messages and for messages 
advocating acceptable and unacceptable events, 

Each of the three scripts, including interpolated 
pauses, required about 2 minutes to read. To begin 
the speaker said the “faculty board” had met “more 
than a dozen times this year” and its “conclusions 
throughout have always been unanimous on this 
problem.” In the compulsory ROTC treatment the 
speaker forcefully presented the “faculty’s” argu- 
ments for resumption of obligatory ROTC. Principal 
themes were: voluntary system had failed; ad- 
vantages of being an officer; services get 85% of 
officers from ROTC; West Point, Annapolis can- 
not do job alone; new freshmen don’t know the 
score and should be required to give ROTC and 
the university a “full chance.” 
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In the reduce tuition treatment, the speaker 
forcefully presented the reasons behind the “faculty’s 
recommendation” to “cut the quarterly registration 
fee by two thirds—to around $33 per quarter for 
all students.” Principal themes were: “grim fact” 
of students in good standing dropping out because 
of financial difficulties; popular misconception that 
working during college is valuable; in space-race 
America cannot afford not to educate a single talented 
person; high tuition inimical to land-grant ideals; 
faculty disgusted that wealth rather than ability 
determines acquisition of an education; fees can never 
be reduced too much, 

In the triple tuition treatment, the speaker force- 
fully presented the reasons behind the “faculty’s 
recommendation” that the “quarterly registration 
fee should be tripled . . . right, tripled . . . to just 
less than $300 per quarter for all students.” Princi- 
pal themes were: “grim fact” of 15% loss of gradu- 
ate faculty owing to inadequate salary; young en- 
thusiastic professors leave first, old drones are left 
to drone on; attenuation of excellence limits gradu- 
ates’ choices in job market; “well-known fact” that 
the average salary of our graduates is only $5,000 
compared with $7,000 for graduates of east and west 
coast universities; university had to turn away one 
million dollars in federal research grants for lack of 
appropriate facilities; “temporary” buildings are 
“eyesores,” library, a “jungle”; faculty and students 
disgusted with third-rate status; investment in 
education can never be too great. 


Dependent Variable 


Three items, keyed to the three treatments, were 
included in an opinionnaire containing many items 
dealing with current campus issues. For the com- 
pulsory ROTC treatment the key item was “As 
regards making participation in ROTC compulsory 
at ISU for physically qualified male students, I am . 
...” For the reduce tuition treatment, the key item 
was “As regards reducing the quarterly registration 
fee at ISU, from $99 to $33 per quarter, Iam... .” 
For the triple tuition treatment, the key item was 
“As regards increasing the quarterly registration fee 


TABLE 1 
Mean OPINION Scores on Tuition AnD ROTC Issuzs 
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at ISU, from $99 to $297 per quarter, I am . 
Answers were indicated along a 75-point scale, 
fifteenth point identified by labels running 
strongly in favor (75.0) to strongly opposed | 
Many of the dummy items in the opinionnaire 3 
very specific, for example, one concerned the 
of time that should be devoted to movies 
introductory psychology course and another 
with the exact amount of traffic penalties 
which no report should be made in the student 
paper. a 
All subjects answered the same opinionnaire 

cept that the key item for a communication tı 
ment appeared before the items which were 
to the other two treatments. To illustrate, the o1 
of*appearance of key items in the triple tuii 
treatment was increase tuition item, compi 

ROTC item, and reduce tuition item. This order 
the opinionnaire was reversed in the reduce tui 
treatment. 


Summary of Design 


Ninety subjects were randomly assigned to 
groups of five. These groups were randomly as 
to six treatments resulting from the two lis 
conditions, overheard and regular, and the 
communication treatments, compulsory ROTC, 
duce tuition, and triple tuition. The dependent vi 
ble consisted of measures of opinions about Ti 
ing ROTC, reducing tuition, and increasing #il 
There were 14 instead of 15 subjects in three of 
six treatments because 3 subjects did not keep 
appointments. 


RESULTS 


Corroboration of Walster and Festing 
Findings 
Walster and Festinger (1962) found 
overheard persuasive communications V 
not more effective than regular commun 
tions when the issue was not personally 


Opinion issues 


Communication treatment 


Increase tuition Require ROTC 
N M SD M SD 
Regular, compulsory ROTC 15 5.47 5.69 
Overheard, compulsory ROTC | 15 5.93 6.82 Sher 2030 
Regular, reduce tuition 15 4,93 5.48 22.27 16.21 
Overheard, reduce tuition 14 5.93 8.77 | 21.79 12.79 
Regular, triple tuition 14 1457 | 1361 | 2707 | 21.40 
Overheard, triple tuition 14 16.43 16.29 28.07 19.53 


Note.—The higher the score, the more favorable the opinion, 
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volving for the listeners. When the event 


advocated by the communication was both 
involving and acceptable to the listeners, the 
“ overheard communication was more effective 
than the regular communication, The present 
''yesults, shown in Table 1, corroborated the 
Walster-Festinger findings. 
Favorableness toward compulsory ROTC 
was generally greater in the compulsory 
ROTC treatment than in the control treat- 
ments, reduce tuition and triple tuition. Of 
the eight possible comparisons between the 
mean opinion scores in the compulsory ROTC 
‘treatments and the mean scores in the other 
treatments, six were statistically significant 
sat the .05 level, or better, by £ tests* and 
the other two comparisons were in the same 
‘direction. Although the compulsory ROTC 
treatment was shown to be effective, there 
was no evidence that, within that treatment, 
overheard was more powerful than regular. 
The reduce tuition communication was also 
_ effective. Favorableness toward reducing tui- 
tion was generally greater in the reduce tui- 
tion treatments than in the control treatments, 
compifisory ROTC and triple tuition. Mean 
opinion scores under reduce tuition were sta- 
tistically larger: all comparisons yielded ż- 
test p values of .05 or less. Within reduce 
tuition, the mean opinion score for the over- 
heard treatment was more favorable than for 
the regular treatment: t= 1.93, p < .06. 


Ineffectiveness of Overheard Counterpropa- 
ganda 


„The triple tuition communication was gen- 
erally effective: favorableness toward increas- 
_ Mg tuition was greater in the triple tuition 
treatments than in the control treatments, 
compulsory ROTC and reduce tuition. Of 
the eight possible comparisons between the 
: Mean scores under triple tuition and those in 
ine other treatments, five were statistically 
‘nificant by ż test at the .05 level and the 
Other three, at the .06 level. The mean score 
| for the overheard treatment was slightly 
ae than that for the regular treatment, 
hut this effect must be attributed to chance 


3 i 

i aa £ tests were two-tailed. Where variances 
t a heterogeneous an approximation by Edwards 
9; p. 168) was employed. 
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for the ¢ did not even approach reliability. In 
sum, the triple tuition treatment was effective, 
but there was no evidence that the overheard 
treatment was more powerful than the regular 
treatment. 


Correlations between Opinion Scores 


Correlations between scores on the three 
key items were computed for each of the six 
treatment combinations. Correlations involv- 
ing the compulsory ROTC item hovered close 
to zero in all treatments. The relationship 
between favoring an increase and favoring a 
reduction in tuition was reliably inverse in 
each of the treatment combinations. The 7’s 
ranged from — .45 to — .50. 

Since favoring an increase and favoring a 
reduction in tuition are opposed responses, 
one may ask why scores on the increase tui- 
tion item were not lower under reduce tuition 
than under compulsory ROTC. Similarly, 
those subjects who heard the triple tuition 
communication should have been different in 
their responses to the reduce tuition item 
than those who heard the compulsory ROTC 
communication. Both of these apparent in- 
consistencies were attributed to the strong 
opposition to increasing tuition. Scores on 
the increase tuition item were at one extreme 
and this ceiling effect may account for the 
failure of the reduce tuition treatment to 
have measurable influence on opposition to 
tuition increase. 


DISCUSSION 


The two main findings were a reproduction 
of the Walster-Festinger (1962) results and 
a failure to show an advantage for overheard 
counterpropaganda. Walster and Festinger 
found that overheard communications were 
more effective for involved listeners when the 
communications urged the listeners toward a 
position they would like to accept. The pres- 
ent overheard treatment was more effective 
than the regular treatment for the involving, 
and acceptable, position (reduce tuition) but 
not for the uninvolving recommendation 
(compulsory ROTC). It is worth emphasiz- 
ing that the present study replicated Walster 
and Festinger even though many aspects of 
the two methodologies were radically dissimi- 
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lar. The most noteworthy differences pertained 
to the speaker’s identification of the listeners 
prior to the treatment and his withdrawal 
from the presence of his listeners in the over- 
heard treatment. In the Walster-Festinger 
studies the speaker did not interact at all 
with his listeners and the choice of overheard 
or regular was perceived as up to the experi- 
menter, not the speaker. 

Within the same context in which the 
Walster-Festinger results were replicated, 
additional findings suggested that communica- 
tions advocating unacceptable events are not 
more powerful when they are overheard. The 
overheard effect can now be placed in sharper 
perspective and gains in theoretical under- 
standing are possible. 

Walster and Festinger considered three fac- 
tors which might make overheard communi- 
cations more effective. The overheard speaker 
aroused less defensiveness than the regular 
speaker; listeners were especially influenced 
by communications they were not supposed to 
hear; and ulterior motives were imputed to a 
greater extent to the regular than to the 
overheard speaker. Each of these factors, by 
itself, is insufficient to explain either the 
Walster-Festinger data or the present results. 

The listener’s defenses are likely to be in- 
stigated to the extent he perceives a direct 
attempt to influence. However, it surely 
seemed improbable that the speaker, who had 
tried to dismiss his caller, and who was try- 
ing to break off a phone conversation, was, at 
the same time, engaged in a direct attempt to 
influence the subjects. The speaker appeared 
to be no more prepared for the unwanted 
interruption than were his listeners. In both 
the regular and overheard treatments the 
subjects were occupied judging colors and 
therefore both treatments were, to some de- 
gree, overheard and not construable as a di- 
rect attempt to influence. Furthermore, the 
term “defenses,” as Walster and Festinger 
noted, is inapplicable when an acceptable 
position is advocated by the communication. 
as in the reduce tuition treatment. : 

The second factor, hearing things one is not 
supposed to hear, does not explain the lack of 
effect for the uninvolving issue, nor can it 
adequately account for the lack of effect 
when the persuasive communication was in- 
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volving but counter to the listener’s position, 
The third possibility, differential imputation | 
of ulterior motives, hardly fits the’ behavior 
of the present speaker who was at pains to| 
cut off the communication. There is the) 
further question, noted by Walster and Fest- 
inger, as to why involved listeners would 
impute ulterior motives to a speaker recom:| 
mending a desirable event. , 
A satisfactory account must explain why 
overheard persuasive communications are not 
more powerful when the issue is uninvolving 
and when an unacceptable position is advo- 
cated. As part of a recent theory of persuasion 
(Brock, 1964) two factors are identified! 
which relate multiplicatively in determining 
the effectiveness of propaganda: communi- 
cator credibility and receptivity (eagerness to 
expose oneself to information). As receptiv- 
ity approaches zero, differences in credibility, 
however great, have decreasing influence on 
propaganda effectiveness. This simple model 
clarifies the first experiment reported by Wal- 
ster and Festinger in which the effectiveness 
of the overheard speaker was limited, to the 
involved listeners, namely, the smokers. Both 
smokers and nonsmokers rated the overheard 
speaker as more honest, more sincere, and 
therefore more credible, than the regulat 
speaker. However, differences in speakers! 
credibility did not affect listeners’ opinions 
unless the listeners were involved in the com 
munication issue, smoking and lung cancel: 
Three recent studies (Brock, 1965; Feathety 
1962, 1963) have clearly shown that involved 
persons, smokers, are more receptive to M 
formation about smoking and cancer thal 
nonsmokers, Walster and Festinger’s smokes 
were undoubtedly more receptive to the com 
munications than the nonsmokers. Hence, the 
theorized multiplicative effect of receptivill 
and credibility explains why overheard com 
munications are more powerful for involv 
but not for uninvolved recipients. The expl# 
nation depends in part on the assumpti0?) 
supported by data from the first Walste™ 
Festinger experiment, that overheard com 
municators are considered more trustworthy 
than regular communicators. $ 
The credibility x receptivity formulatio 
integrates Walster and Festinger’s data, 
present replications of their conditions, aS wel 
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as earlier data on source credibility.* In the 
present study some additional concept is nec- 
essary to explain why the combination of 
high receptivity and high credibility did not 
“jncrease opinion change under the overheard 
counterpropaganda treatment. 
The believableness of the overheard speaker 
was undoubtedly augmented by the present 
‘technique in which he withdrew from the 
presence of his listeners. The listeners may 
well have regarded the speaker’s subsequent 
words as the unvarnished truth, free of inhi- 
_ bition and distortion.* Under these circum- 
stances, learning that the faculty board had 
recommended tripling the tuition probably 
mobilized more defensiveness than in the 
, regular treatment. For there is a tendency in 
informal social communication to withhold 
bad news from the appropriate recipients. 
When the bad news is really true, the likeli- 
hood that it will be withheld or not communi- 
cated in a straightforward fashion is in- 
' creased. Hence, if a communicator has at- 
tempted to conceal some adverse information, 
the trath of the information may appear es- 
pecially compelling. If, in the triple tuition 
treatment, the overheard speaker mobilized 
more defensiveness and fear than the regular 
Speaker, that arousal of concern and resist- 
ance could have countervailed any favorable 


‘The theorized combination of receptivity and 
credibility may resolve a long-standing discrepancy 
} appearing in the classical studies of source credibil- 

ity by Hovland and his associates. Walster and 

Festinger (1962, p. 395) and Hovland, Janis, and 

Kelley (1953, p. 35) have noted that in two studies 

(Hovland & Weiss, 1951; Kelman & Hovland, 1953) 
_ large differences in source credibility reliably influ- 

enced opinions while in a third (Hovland & Mandell, 

1952), the effect of source credibility was quite 

Weak. Reexamination of the three studies disclosed 
| that the opinion issues of the first study (antihista- 

ee atomic submarines, steel shortage) and of the 

i a study (treatment of delinquents) were timely 
a interesting for their audience. In contrast, the 

ate of the third study (Hovland & Mandell, 1952) 
Pi complicated and abstruse. The opinion issue 

= eae currency devaluation and the authors 
ES mmended that subsequent replications employ 
de kes issues, “particularly ones with a higher 
pS €e of ego-involvements [p. 587].” The inferred 
Mange of receptivity in the Hovland and 
differ ell study explains why the large manipulated 

BG, ence in communicator trustworthiness had only 
] eak effect on opinions. 
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effects of high receptivity and heightened 
credibility. 

Subsequent research requires direct tests of 
the emergent hypothesis: adverse information 
elicits belief and discomfort to the extent it © 
appears the communicator has expended effort 
to conceal or distort the information. The re- 
lationship of the communicator to the recipi- 
ent, whether friendly, neutral, or antago- 
nistic, would, of course, interact with the 
effort-to-conceal variable. The psychology of 
secrets is still unexplored terrain. 

A final question concerns the general sig- 
nificance of the effectiveness of overheard 
persuasive communications. Professional prop- 
agandists, opinion leaders, educators, par- 
ents, etc., are mainly concerned with urging 
their listeners in a direction the listeners 
would not like to move, at least initially, In- 
formal social communication is directed to- 
ward perceived deviates (Festinger, 1950). 
On the other hand, there exists a welter of 
inducements which can make a person adopt 
a more extreme position in his current values 
and opinions. The present results suggested 
that the power of overheard propaganda is 
restricted to moving persons in a direction 
they want to go anyway. The present analysis 
has attempted to place the “overheard” effect 
in a theoretical perspective in which com- 
municator credibility, together with other 
factors, predicts propaganda effectiveness. 
The overheard versus regular variation may 
be added to the list of factors which affect 
communicator credibility. 
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. Logically, perceptual defense (PD-as defined by Brown, a systematic relation- 
ship betwéen stimulus emotionality and the edse of recognition of stimuli) 
might be due to verbal response bias or to variables which inevitably cause 

& verbal response bias. In conflict with this interpretation, significant PD was 
~ found, in both males and females, in the absence of response bias in any of 
the measures used. Alternative interpretations are suggested. Some PD is 

readily created by verbal conditioning, but eliminated by a complex correction 

for several response tendencies. Another form of PD (emotionally biased 

recognition) is neither readily created by verbal conditioning nor eliminated 

by the corrections, Results have implications for PD research and for verbal 


conditioning studies. 


Perceptual Defense 


Several hundred psychological studies have 
suggested that recognition thresholds are dif- 
ferent for stimuli which arouse emotion than 
for those which do not (e.g., see reviews by 
Brown, 1961; Bruner, 1957; Goldiamond, 
1958; Inglis, 1961; Minard, 1963, 1965). In 
his monograph, Brown advocated that any 
systematic relationship between the emotion- 
arousing properties of a stimulus and recogni- 
tion threshold be descriptively referred to as 
perceptual defense. Although some have main- 
tained that the term be restricted to elevated 
thresholds to emotion-arousing words (Post- 
man, 1953, p. 71), such a restriction is not 
Suggested either by the results of previous 
Tesearch, in which emotion-arousing stimuli 
have had either elevated or lowered thresholds, 
or by other uses of the term defense, which 
do not restrict it to a failure to recognize 
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external stimuli (e.g., Fenichel, 1945; Freud, 
1937; Lazarus, 1954). 


Response Bias 


However perceptual defense is defined, 
many recent studies and reviews indicate that 
it may be due only to verbal response tend- 
encies of the sort shown by responses to blank 
slides (Goldiamond, 1958; Goldiamond & 
Hawkins, 1958; Goldstein, 1962; Goldstein, 
Himmelfarb, & Feder, 1962; Nothman, 1962).? 
If a simple response bias hypothesis should 
prove sufficient to account for all perceptual 
defense, the methodology of perceptual de- 
fense research and the conclusions typically 
drawn from it might be considerably simpli- 
fied by eliminating the stimuli altogether and 
directing attention only to verbal behavior. 

Unfortunately, studies supporting a re- 
sponse bias interpretation have typically not 
allowed the simultaneous measurement of 
tendencies present both in recognition and in 
verbal responses (Minard, 1965). 


Simultaneous Measurement 


Because of differences in perceptual defense 
and response bias which may occur from one 


2 From the response bias point of view, the most 
complete description of the relative importance of 
stimulus and response characteristics was probably 
given by Goldiamond (1960). 
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experimental situation or one subject group to 
another, perceptual defense and response bias 
should be measured in the same subjects in 
the same experimental situation, and as nearly 
“simultaneously as possible, in order for useful 
comparisons to be made between the two 
variables. This paper describes a variety of 
response bias measures which may be ob- 
tained during a perceptual defense study. 

Since different frequencies of prior experi- 
ence with the stimulus are reflected by re- 
sponse bias measures (Goldiamond & Hawk- 
ins, 1958; Spence, 1963), the simultaneous 
measurement of perceptual defense and re- 
sponse bias provides a check on the success the 
experimenter has had in the important task 
of equating emotion-arousing and neutral 
stimuli for frequency (Howes & Solomon, 
1950, 1951). 


Sources of Perceptual Defense 


It has proved useful, for practical and 
theoretical purposes, to divide the immediate 
sources of perceptual defense results into 
three categories: response biases which may 
affect the probability of chance accuracy in a 
way which may be demonstrated by any form 
of guessing, even when stimuli are not pre- 
sented at all; the guessing strategy subjects 
may develop during an experiment, a strategy 
completely shown only by responses to stim- 
uli; emotionally biased recognition in which 
the detection and interpretation of stimuli 
vary with the emotion-arousing properties 
of the stimulus in a way not accounted for by 
response bias. In completely accounting for 
response tendencies, it is necessary to con- 
sider guessing strategy. For example, if the 
subject guesses often and is very shrewd 
when guessing, any response bias which is 
present may have a great effect on the prob- 
abilities of chance accuracy for different 
stimuli. 

In addition to testing a rather simple form 
of the response bias interpretation of per- 
ceptual defense, this paper introduces one 
experimental design and method of statistical 
analysis which should result in measures of 
emotionally biased recognition relatively un- 
contaminated by the effects of purely verbal 
response biases. 
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Verbal Conditioning 


Conclusions based on the statistical p 
cedure will be checked experimentally 
means of verbal conditioning. All other thir 
being equal, it is known that verbal respo 
bias can be created by the experiment 
approval (Krasner, 1958; Salzinger, 19 
Sarason & Minard, 1963). Most interpret 
tions of psychological defenses suggest 
it should be difficult to create them by a i 
minutes of verbal approval. 

Therefore, before concluding that w 
seems to be perceptual defense is due to ve 
response bias, it should be demonstrai 
first, that the result is reduced to an insign 
cant level by correction for response teni 
encies, second, that a similar result can be 
created by means of verbal approval and, 
finally, that the created result is reduced to 
an insignificant level by correction for te 
sponse tendencies. 

Before concluding that a perceptual 
fense result is not due to verbal respo 
tendencies, it should be demonstrated 
the result resists both statistical correction 
for response tendencies and verbal condition- 
ing effective in changing these tendencies. 

The proposed use of verbal conditioni 
represents not only a technique of validatiot 
but also an extension of verbal conditio 
into problems having to do, not simply 
a response category based on the content € 
grammatical classification of a response, 
with a category based on the relation of th 
response to a prior stimulus. This structut ul 
characteristic of responses seems of consi 
able importance, but has been largely ign 
in verbal conditioning studies (Kra 
1958; Salzinger, 1959). 


Previous Studies 


Previously, Mathews and Wertheimeét 
(1958) used a technique for simultaneous 
measuring response bias and perceptual 
fense. They found “Ss scoring high on 
Hy [Hysteria] scale of the MMPI de 
strate a significant perceptual defense eHg 
over and above [p. 375]” their measu 
response bias. They concluded that 4 
ceptual defense effects cannot be accou 
for entirely on the basis of response 
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pression [p. 375],” a conclusion rather di- 
rectly „in conflict with that suggested by 
Goldiamond (1958), Goldiamond and Haw- 
kins (1958), Goldstein (1962), Goldstein 

"set al. (1962), and Nothman (1962). The 
experiment to be described provides a 
replication and extension of Mathews and 
Wertheimer’s procedure. 

In contrast with their results for the Hy 
scale, Mathews and Wertheimer did not find 
significant perceptual defense over and above 
response bias for subjects scoring high on the 
MMPI Psychasthenia (Pt) scale. This is one 
of the findings indicating that perceptual de- 

r ofense is affected by the subjects’ personal- 

| ities. The Hy and Pt scales were originally 

developed to detect patients with conversion- 
hysteria and psychasthenia, respectively. A 
high Hy score is achieved by reporting somatic 
complaints, sadness, and dissatisfaction, and, 
paradoxically, by frequently reporting those 
characteristics which suggest superior adjust- 
ment. A high Pt score is achieved by people 
suffering from worries, fears, preoccupations, 
and *obsessive behavior (Marks & Seeman, 
1963; Welsh & Dahlstrom, 1956). 

Brown (1961) believed that recognition 
threshold had a curvilinear relationship with 
stimulus emotionality, such that subjects will 
be unusually likely to recognize stimuli which 
are highly emotion arousing for them, for 
example, they will show perceptual vigilance. 
He found this conclusion supported by the 
main trend within previous research (e.g., 
Bruner & Postman, 1947; DeLucia & Stagner, 
1953). A similar formulation has recently been 
Suggested by Inglis (1961). When words were 
used as stimuli, Brown’s female subjects 
showed the predicted curvilinear relationship, 
but the males showed only the ascending, first 
Portion of the curve. He assumed he had not 
obtained words which were sufficiently emo- 
tion arousing to show the whole range of 

» Tesults for males in the particular experimental 
situation he was using, According to his inter- 

ı Pretation, both the characteristics of the sub- 
Ject’s Personality and of the experimental 

. Situation can serve to reduce the degree to 
Which stimuli arouse emotion and, hence, 

7 reduce the probability of obtaining perceptual 
vigilance. 

The perceptual vigilance which he demon- 
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strated for his female subjects is clearly 
crucial to the hypothesis of curvilinear rela- 
tionship between recognition threshold and 
stimulus emotionality. Without it there woulq 
be no curve and the relationship would 
simply be linear. 

The procedure to be reported was designed 
to select stimuli which were highly emotion 
arousing according to criteria quite similar 
to those used by Brown. 


A METHOD 
Subjects 


The 52 subjects were students in an introductory 
psychology course. Postexperimental interviews and 
the course instructor indicated that each subject 
was naive regarding perceptual defense and verbal 
conditioning. The MMPI Hy and Pt scales (Marks 
& Seeman, 1963) were administered in the classroom 
to groups of students (312 in all); those who were 
selected had Hy scale scores ranking in the upper 
quartile and Pt scores below the median for the 
group. In all, 26 males and 26 females were 
selected. Except for 4 subjects who dropped school 
and were replaced by subjects with identical Hy 
and similar Pt scores, all those selected served as 
paid volunteers. Median Hy scores of the male and 
female groups were identical (26); Pt medians dif- 
fered by 4 points (male median = 13, female = 9). 


Procedure 


In the individual stimulus selection procedure 
four emotion-arousing words were selected for each 
subject by objective criteria applied to a word- 
association test. For each emotion-arousing word a 
relatively neutral word was chosen to be structurally 
similar and equal in frequency in the English lan- 
guage. It may be important that, in the selection of 
emotion-arousing stimuli, unfamiliar and socially 
unacceptable words were avoided by the experimenter 
and a careful attempt was made to select stimuli 
which were emotion arousing for individual reasons. 
Chosen were common words for which the subject’s 
associative reaction time was at least long enough 
to be within the upper 8% of his times, a time 
long also with respect to other subjects’ times for 
that word; the response to the word almost always 
showed such other indications of personal significance 
and emotional arousal as an unusual association, 
stuttering, explanations and justifications of the 
response, blushing or perspiring. Among the selected 
emotion-arousing words were ACCEPT, ADJUST, BIBLE, 
CHILD, DUMB, DEPENDENT, DIVORCE, FIGHT, MAKE, 
SELF, and SIN. 

In the individual experimental session (held about 
a week after stimulus selection) the subjects at- 
tempted to identify stimuli which were tachistoscop- 
ically flashed in random order under conditions 
individually selected to result in a slightly, but sig- 
nificantly, better-than-chance frequency of accurate 
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stimulus recognition. No stimulus was unexpected 
since each subject held a card listing the selected 
stimuli. The stimulus presentation session consisted 
of four repetitions of the procedure used by 
‘Mathews and Wertheimer (1958) (128 stimulus 
presentations in all). This was extended by using 
additional techniques for measuring response bias 
and by conditioning during the last two repetitions 
of the procedure, using the experimenter’s approval 
of either neutral or emotional word responses. The 
contingency for approval depended on the experi- 
mental group to which the subjects were randomly 
assigned. In order to prevent fatigue or boredom, 
a 5-minute break was introduced between the first 
and last half of the stimulus presentation session. 
During this break the subjects completed a rating 
scale. 


Measures of Variables 


Response bias was measured by determining the 
probabilities of neutral and emotional responses 
within various groups of responses, all of which 
were assumed to be guesses because of their lack of 
correspondence to a discriminative stimulus; per- 
ceptual defense was measured by determining prob- 
abilities of neutral and emotional responses within 
the subset consisting of accurate responses, most of 
which were clearly not simply guesses, because of 
their correspondence to the stimulus. 

This conditional probability formulation yields a 
perceptual defense measure with several advantages. 
(a) It is conceptually comparable to the recognition 
thresholds used in many studies (since these thresh- 
olds are the result of the response probabilities 
obtained under a given set of physical conditions). 
Nevertheless, its validity is not dependent on the 
existence of a psychophysical threshold for complex 
stimuli. (b) It is directly comparable to the measures 
of response bias. (c) Stated in terms of equivalent 
percentages, conditional probabilities are more 
readily understood than thresholds. 

Response bias, A variety of different response bias 
measures were obtained. In each of the subsets 
used to obtain a measure, the percentage (P) of 
responses which was neutral showed whether or 
not response bias was present. Since half the pos- 
sible responses were neutral, a P (neutral response) 
greater than 50 indicated a bias favoring neutral 
responses; a P (neutral response) less than 50 
indicated a bias favoring emotional responses. (a) 
Smudged blank slides were randomly presented 
throughout the experiment (adding 32 presentations). 
Pilot studies and interviews revealed no difference 
between the way subjects regarded these stimuli and 
the way they regarded difficult-to-see words. Statis- 
tical comparisons (using student’s t) showed no 
significant difference between the probability of a 
neutral response to a blank slide and that to an 
unrecognized word. Since subjects were required to 
respond on each stimulus presentation, blanks pro- 
vided a sample of responses made in the absence of 
discriminative information. (b) Unknown to the 
subject only half the neutral and half the emotional 


J. G. Mmwam, D. E. BAILEY, AND M. WERTHEIMER 


_that it will be emotional, recognition thresholds 


words on his card were presented during a 
block of 40 presentations, Unpresented words 
so unlike the presented words that responses 
them were gross errors. Such errors providi 
basis of the Mathews and Wertheimer meas 
response bias. (c) Wholly inaccurate res 
(which were neither accurate, nor the nearly 
curate word paired with the presented word, 
vided a sample which was divided into errors 
when neutral words were presented and errors 
when emotional words were presented. Eacl 
vided a measure of response bias. Any dii 
in the response bias to the different kinds of 
would be evidence of a sort of stimulus diserin 
tion, in the absence of stimulus recognition. - 
Some responses would be expected to be act 
by chance, affecting any response bias measure 
on all wholly inaccurate responses. Therefore, 
of the response bias measures in c was CO) 
for the effects of chance accuracy and used J 
additional measure. (e) Immediately before 
after the verbal conditioning session, the sub; 
was asked to “name one word from each cart 
a deck of cards, on each of which was typed | 
of his neutral words and one of his emotio 
words. Every possible pairing and order were a 
cluded in the deck. Choices showed the subje 
preference for saying neutral or emotional words 
all, there were over a half dozen measures 0 
sponse bias, each obtained two or more times d irin 
the experiment (Minard, 1963). y 
Perceptual defense. Accurate responses $i 
stimulus perception, although they may occasion 
be only chance accuracies, As Goldiamond 
pointed out, when the probability that an accut 
response will be neutral differs from the prob: 


A, 


the two types of stimuli will differ. Hence, percep 
defense may well be assumed present. 

The probability of the neutral response Wi 
the subset of accurate responses was sufficient 
show either a raised or lowered threshold to m 
or emotional stimuli, Since half the stimuli and] 
sible responses were neutral and half were emotii 
a P (neutral response) greater than 50 Si 
that emotional stimuli were the most diffi 
recognize; if less than 50, neutral stimuli wet 
most difficult to recognize. Both for males @ 
females, the variance of P (neutral response 
accurate responses was insignificantly differen! 
that for blank slides, when variances were con 
by an F ratio. 

Corrected perceptual defense and response 
The perceptual defense measure and several 
response bias measures can be affected by act 
which occur merely by chance. For example, 
dominance of accurate responses might be 
due to chance accuracy and a bias in favor of 
guesses. Since all the stimuli and respo 
sibilities are known and response bias measu 
readily available, corrections for such chance 
curacies can be made by applying the 
probability theory. In applying these rules, 
not sufficient simply to consider respons! 
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Rather it was necessary to consider and include in 
the correction formulas two other tendencies which 
may prove of psychological importance in their own 
right: 

1, The tendency to use particularly appropriate 
responses in guesses. In this experiment a measure of 
this tendency was provided by the proportion of 
guesses to blank slides which named words in the 
group being presented during a given block of 
40 trials. An increase in this proportion would occur 
if careful inferences were made by the subject during 
the course of the experiment and might be related 
to a more general ability to infer what is appropriate 
and hence to guess more accurately. 

2. The tendency tg guess more frequentiy in 
response to certain barely recognizable stimuli, 
depending on the degree to which the stimuli afouse 


< e emotion. Particularly wild responses to emotion- 


} 


arousing stimuli could be interpreted to support the 
conception that anxiety is characterized by interfering 
responses (Child, 1954; Goldstein, 1961; Sarason, 
1959), In this experiment the frequency with which 
each subject’s responses were wholly inaccurate 
could be readily and objectively determined for 
both neutral and emotional words by noticing the 
frequencies with which subjects’ responses wholly 
failed to correspond to the presented stimuli and 
correcting these frequencies for the effects of chance 
accuracies, 

Tp short, it was necessary not only to consider 
the simple response bias shown by P (neutral re- 
sponse) to blank slides, but also to consider the 
guessing strategy” which affects the probability 
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of chance accuracy in more subtle ways. In fact, the 
analysis demonstrated that correcting for simple 
response bias is not of great importance to this 
study, whereas correction for subjects’ guessing 
strategy is crucial, The complete correction før 
response bias will not be described in detail here, 
but has been described by Minard (1963). 

Wherever chance accuracies could affect a response 
bias or perceptual defense measure, corrected measures 
were computed. 


RESULTS 
Response Bias 


The dozen or more response bias measures 
revealed insignificant findings. There was no 
evidence of a preference for either neutral or 
emotional responses nor were there significant 
differences between different response bias 
measures. Neither were there response bias 
differences between males and females. The 
slight, statistically insignificant trends which 
were present suggested that males were biased 
in favor of emotion-arousing words and fe- 
males were biased in favor of neutral words. 
Some of these trends are shown in Figure 1. 


Perceptual Defense 


According to a verbal response bias inter- 
pretation, perceptual defense should not have 


VARIABLE 


PERCEPTUAL DEFENSE 
RESPONSE BIAS A 
RESPONSE BIAS B 
RESPONSE BIAS C 


MALE 
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EL 
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Fic. 1. Students’ ¢ values suggesting failure of response bias to account for perceptual defense. 


(Positive t values, 


to the right of center, suggest an elevated threshold to emotional words and a 


lower response probability for them. Since a ¢ of 1.708 was required for significance at .05, 


“perceptual defense” is significant 
curate responses, “Response Bias A” 


Bias B” was based on responses naming unpresented words, 


etrors to presented emotional words.) 


but response bias is not. Perceptual defense was based on ac- 
was based on responses to smudged blank slides, “Response 


“Response Bias C” was based on 
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TABLE 1 


PERCENTAGE OF NEUTRAL RESPONSE WITHIN THE SUB- 
SET OF AccurATE NEUTRAL AND EMOTIONAL RE- 
SPONSES CORRECTED FOR EFFECTS OF “GUESSING 


STRATEGY” 
2 


M t 
Males 52.78 1.460 
Females 46.63 2.140* 
Sex difference 6.15 2.491* 


* p < .05, two-tailed test. 


been present. However, as Figure 1 shows, it 
was; the probability of an accurate response 
clearly differed with the emotion-arousing 
properties of the stimulus as well as with the 
sex of subject and did so in a way not to be 
accounted for by response bias. Males ac- 
curately recognized 59.62% of the neutral 
words; females recognized 41.33%. The dif- 
ference of 18.29 yields a student’s ¢ of 2.982 
(p < .01). 


Corrected Perceptual Defense 


Although a verbal response bias hypothesis 
had nothing to contribute to the explanation 
of male or female perceptual defense, the 
more complete correction procedure did. Cor- 
rections for subjects’ “guessing strategy” did 
account for the predominance of neutral re- 
sponses among the accurate stimulus identifi- 
cations by male subjects. 

However, the predominance of emotional 
word responses among accurate stimulus 
identifications by female subjects remained 
statistically significant after correction, and 


TABLE 2 


PERCENTAGE OF NEUTRAL RESPONSE WITHIN THE 
SUBSET OF ACCURATE RESPONSES WITH AND WITHOUT 
CORRECTION FOR THE EFFECT OF “GUESSING 

STRATEGY” 


Group approved for: 


Measure 
Emotional responses | Neutral responses 

Uncorrected 

M 47.758 57.064 

t 540 2.103* 
Corrected 

uM 48.640 51.520 

t +893 1,197 

* p}< .05. 
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because of this, males and females continued 
to differ on the corrected measure (Table 1). 

The measurement of response bias in 
perceptual defense was extended to more than 


twice the length of the original Mathews and , 


Wertheimer study because it was expected 
that the subject would serve as his own 
evaluative listener and that perceptual de- 
fense might change. Significant change was not 
observed, although there was a trend reducing 
the deviation of all values from 50%, that is, 


a trend toward a matching of the response : 


probability with the siimulus presentation 
probability. 


Verbal Conditioning 


The early verbal conditioning trials were 
uninformative. The final conditioning results 
obtained with uncorrected and corrected per- 
ceptual defense measures supported conclu- 
sions suggested by the statistical corrections. 
Conditioning created a 9.31% group dif- 
ference in perceptual defense (ż = 2.505, $ 
< .01), a difference reduced by the corrections 
to a statistically insignificant 2.88%. Table 2 
shows that a predominance of accurate neutral 
responses (which corrections suggested were 
due to response tendencies) was created by 
verbal approval. As should be the case, cor- 
rections for response tendencies removed all 
evidence of this predominance of accurate, 
neutral stimulus identifications by the experi- 
mental group. 

A predominance of accurate emotional re- 


sponses (which corrections suggested were due, 


to stimulus recognition) was not created by 
verbal approval. Thus both correction and 
conditioning indicated that this predominance 
of accurate identifications of emotional stimuli 
was not due to verbal response tendencies, 
but to a more stable process of visual recogn! 
tion. 

Results suggested that the experimental 
sample included subjects whose predominancé 
of accurate neutral responses was due to col 
ditionable response tendencies, and also sub- 
jects whose predominance of accurate em? 
tional responses was not affected by the 


experimenter’s approval. As suggested, 4 51 


nificant group difference after conditioning was 
found in the accurate neutral responses ° 
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those subjects whose responses were pre- 
dominantly neutral prior to conditioning (¢ = 
1.850, p < .05). No such difference was found 
in the accurate emotional responses of those 
subjects whose responses were predominantly 
emotional prior to conditioning. In fact, the 
very slight group difference in these subjects 
was in a direction opposite to that which 
would have been produced by conditioning. 


DISCUSSION 


In many respects, results of this experi- 
ment support and eextend those of previous 
research. In addition they point toward resolu- 
- tions of current conflicts between the inter- 
pretations of perceptual defense studies and, 
also, toward new lines of research. 


Immediate Determinants of Perceptual De- 
jense Results 


As Mathews and Wertheimer (1958) found, 
verbal response bias did not account for 
perceptual defense results; such bias was 
absent in this sample. It may well be that 
response bias was so unimportant in their 
study because their emotion-arousing stimuli 
were familiar and socially acceptable, and 
therefore did not elicit marked response sup- 
pression. 

With familiar, socially acceptable stimuli, 
verbal response bias is not necessary for the 
demonstration of perceptual defense. For 
example, in this study, perceptual defense was 
present, despite the absence of significant re- 
sponse bias. Trends in the male and female 

_ Tesponse bias were such that they failed to 
‘predict the kind of perceptual defense found 
in either sex. Clearly, all perceptual defense 
cannot be explained by any hypothesis which 
considers only response bias or the variables 
which cause it. 

However, subjects’ guessing strategy did 
account for some perceptual defense results. 
This should be considered a stimulus-response 
bias rather than the purely verbal bias which 
would have been demonstrated in responses 
to blank slides. Presumably, with socially un- 
acceptable stimuli, purely verbal response 
bias would have been important in the results 
obtained with subjects reluctant to say the 

dirty words” (Zajonc, 1962). 
Perhaps most important, there were results 
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which resisted a rather complete correction 
for response tendencies, and also resisted the 
effects of reinforcement sufficient to change 
verbal response tendencies, that is, there is 
evidence of emotionally biased recognitions 

A number of conflicts in the interpretation 
of perceptual defense might be resolved if it 
were accepted that these results have several 
determinants. Even crude attempts to measure 
these determinants seem preferable to the 
implicit assumption that perceptual defense 
has only one of them as its source. 

The importance of each of the determinants 
of perceptual defense results may vary with 
the characteristics of stimuli and subjects. 
Stimuli may be socially unacceptable and elicit 
response bias, or they may be socially ac- 
ceptable (but personally emotion arousing) 
and elicit perceptual defense not accounted 
for by response bias. Subjects who are high 
on the MMPI Hy scale and low on the Pt 
scale have been found to be capable of per- 
ceptual defense in the absence of response 
bias. Such a result was suggested by previous 
research (e.g., Mathews & Wertheimer, 1958) 
and seemed interesting because of the inter- 
pretations applied to such MMPI scores, for 
example, that high Hy scorers may convert 
emotional problems to somatic symptoms, 
which may include either an increased or 
reduced sensitivity to certain stimuli (e.g., 
Welsh & Dahlstrom, 1956). The detailed 
results of this study cannot be safely gen- 
eralized beyond the limits of a college popula- 
tion with high Hy and low P# scores. How- 
ever, in view of the frequency with which 
perceptual defense has been found in other 
samples (Brown, 1961) it would seem fruit- 
ful to explore the hypothesis that perceptual 
defense will be found rather generally but 
that its determinants will vary with the 
personality of the subject. 


Conditioning 

The evidence for verbal conditioning of 
perceptual defense is similar to that found 
by other investigators (Ullmann, Weiss, & 
Krasner, 1961). Perceptual defense shows a 
modifiability which raises questions regarding 
its perceptual nature (e.g., Postman, Bronson, 
& Gropper, 1953). However, emotionally 
biased recognition did mot show convincing 
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evidence of conditioning in this study, and 
may not have been involved in the changes 
demonstrated by others. 
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SOURCE CREDIBILITY AND BEHAVIORAL COM- 
PLIANCE AS DETERMINANTS OF ATTITUDE 
CHANGE * 


FREDRIC A. POWELL? 


Boston University 


The experiment explored the influence of behavioral compliance upon the at- 
titude change effects of high and low source credibility. Hypotheses derived 
from dissonance theory were tested, using a 3 X 2 factorial design, in a sample 
of university upperclassmen. The results suggest that (a) high credibility 
sources effect greater positive attitude change in cases of noncompliance, (b) 
low credjpility sources effect greater positive, attitude change in cases of 
voluntary compliance, and (c) there is no difference in the attitude change 
effects of high and low credibility in cases of involuntary compliance. It was 
concluded that high credibility sources are not, in all instances, the most 
effective attitude change agents; a low credibility source, at least in the case 
of voluntary compliance, may prove more effective in inducing positive at- 


titude change. 


In recent years, a variety of hypotheses in 
the study of attitude-change phenomena have 
been generated by Festinger’s (1957) theory 
of cognitive dissonance and subjected to em- 
pirical test. One such hypothesis asserts that if 
an individual engages in overt behavior dis- 
crepant with an attitude which he holds, 
he will experience cognitive dissonance. Fes- 
tinger’s theory suggests that if other means 
of obviating that dissonance (e.g., denying 
that he had engaged in the discrepant be- 
havior or deciding that he had little or no 
choice in doing so) are not readily available 
to him, the individual may be impelled to 
change one or more of his existing attitudes 
in the direction of the discrepant behavior. 
Investigations reported by Aronson and Mills 
(1959), Brehm (1959, 1960), Cohen, Brehm, 
and Fleming (1958), and Festinger and Carl- 
smith (1959), among others, have demon- 
strated this to be the case. 


‘This article is taken from a dissertation sub- 
mitted to the Department of Communication, 
Michigan State University, in partial fulfillment of 
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pi The author is indebted to Hideya Kumata, 
Tee K. Berlo, Malcolm S. MacLean, Jr. and 
poig „C. Troldahl of the Department of Com- 
Sean: Frederick Waisanen of the Sociology 

epartment; and Milton Rokeach of the Psychology 

partment at Michigan State University, for their 
eaae and encouragement in planning and con- 
ucting the study. 


One determinant of the magnitude of at- 
titude change effected when an individual 
has engaged in attitude discrepant behavior 
is his freedom or lack of freedom in choosing 
to engage in that behavior; that is, whether 
or not this discrepant behavior is voluntary 
or involuntary. Dissonance theory suggests 
that 


under those conditions where the individual experi- 
ences least external pressue to engage in discrepant 
behavior or consider a piece of counter-propaganda 
he will become most positive toward the induced 
position [Cohen, 1960, p. 317]. 


In those cases in which the individual experi- 
ences the greatest subjective choice in whether 
or not to engage in the discrepant behavior, 
the dissonance effects of having done so are 
maximized. Consequently, he can be expected 
to change his attitude in the direction of the 
discrepant behavior to a greater degree than 
a situation in which the behavior is not 
voluntary, but forced, compliance. The at- 
titude-change effects of choice in engaging in 
discrepant behavior are demonstrated in the 
findings of Brehm (1959), Brehm and Cohen 
(1959), Brock (1962), and Davis and Jones 
(1960). 
Cohen (1960) further suggests that: 


It also follows from the theory that if a com- 
municator or inducing agent is viewed positively, 
complying with his request should produce less dis- 
sonance and consequently less attitude change 
toward the event than under conditions where a 
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person complies with the requests of a negative or 
disliked communicator (p. 319). 


Zimbardo (1960) has also suggested that a 
negative communicator may be a more effec- 
tive attitude-change agent than a positive 
communicator in compliance situations. He 
bases this expectation on the premise that 
there would be less justification in the mind 
of the individual for his compliance with the 
wishes of a negative communicator, hence 
more dissonance when discrepant behavior is 
induced, and consequently greater positive 
attitude change. s 

A series of experiments reported by Smith 
(1961) tend to confirm the prediction of 
Cohen and Zimbardo. Although the evidence 
provided by Smith is not unequivocal, it sug- 
gests that for persons complying with a 
request for behavior discrepant with their 
established attitudes, the more negative the 
communicator, the greater is the consequent 
dissonance and attitude change. Having once 
complied, such individuals can reduce their 
dissonance by modifying their attitudes to be 
more consistent with the compliant behavior. 

Clearly, the attitude-change effects postu- 
lated by Cohen (1960) and Zimbardo (1960), 
and the findings of Smith (1961), are not 
wholly in accord with the conclusions generally 
accepted in the source credibility literature. 
The many studies of source credibility, includ- 
ing those of Hovland and Weiss (1952), 
Kelman and Hovland (1953), and Weiss 
(1963), all emphasize one underlying prin- 
ciple: the content of a message is made more 
believable and acceptable by its link with a 
“credible?” source. More specifically, it is 
asserted that a high credibility source is 
significantly more effective in inducing posi- 
tive attitude change than is a low credibility 
source when the same message is attributed 
to the two sources. 

However, a condition in which a low credi- 
bility source may be more effective than one 
of high credibility in producing positive at- 
titude change is clearly implied by dissonance 
theory. Specifically, dissonance theory sug- 
gests that if the receiver complies, either 
voluntarily or involuntarily, with the recom- 
mendation of a source, a low credibility 
source will effect greater attitude change 
than will a high credibility source. 
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It may be expected that in cases of be- 
havioral compliance with the source’s position, 
low credibility sources will effect greater posi- 


ability to muster sufficient justification for! 
his compliance. The most relevant cognition, 
that of source credibility, is, in fact, antago- 


case, the receiver experiences greater cogni- 
tive dissonance and is more likely to reduce 
that dissonance by attitude change in the 
direction of the compliant behavior. In the! 

case of compliance with the position of a high | 
credibility source, that source’s credibili 
provides justification for having complying; 
dissonance is not heightened, and attitude 
change is less than when compliance is in- 
duced by a low credibility source. 

Further, it may be expected that in cases of 
behavioral compliance, greater positive at 
titude change will be effected when that 
compliance is voluntary than when it is im] 
voluntary. In the latter instance, the fact that 
the receiver had no choice in the decision to 
comply with the source’s position provides him 
with additional justification for his compl 
ance. Able to reduce the dissonance created 
by compliance, at least partially, in he 
knowledge that he was forced to comply, the 
attitude-change effects of that compliand 
should be minimized. Thus, the lower @ 
credibility of the source inducing complia 
and the greater the choice the receiver has if | 
complying, the greater are the expected posh | 
tive attitude-change effects as a result of 
compliance. y 

The purpose of this study was to explore 
within the dissonance-theory framework, 
implications of behavioral compliance 
choice or lack of choice in complying, for 
explanation and prediction of the attitude 
change effects of high and low source credi 
bility. 

Three hypotheses were tested: 

1. Given behavioral compliance on the 
of a receiver, low credibility sources are mø 
effective in inducing positive attitude chanig 
than are high credibility sources. j 

2. Given compliance, both high and low 
credibility sources are more effective in # 
ducing positive attitude change when Waa 


j 


SOURCE CREDIBILITY, COMPLIANCE, AND ATTITUDE 


compliance is voluntary than when it is in- 
voluntary. 

3. dn the absence of compliance, high 
credibility sources are more effective in in- 
ducing positive attitude change than are low 
credibility sources. 

These hypotheses, then, posit an interac- 
tion in the attitude-change effects of source 
credibility and behavioral compliance or 
noncompliance, and in the attitude-change 
effects of source credibility and choice in 
complying. 


METHOD 
Subjects 


% Seven Michigan State University summer school 
classes, composed of 177 junior and senior students, 
were used as subjects. Five classes from the College 
of Business and two from the College of Social 
Science were utilized. Although not representative 
of all university level classes, or of all university 
upperclassmen, the seven classes were not signifi- 
cantly different in terms of any relevant and pos- 
sibly biasing variable—sex, age, academic class, 
prior experience with the attitude object (the Red 
Cross blood donor and research program), or ex- 
pressed interest in the attitude object. 


o 
Experimental Variables 


The two experimental variables—source credi- 
bility and behavioral compliance (including the 
implicit variable of choice in complying)—were 
directly manipulated. 

A pilot study was conducted to determine the 
high and low credibility sources used in the experi- 
ment. Using a credibility rating scale devised by 
Lemert and Berlo (1963) 29 Michigan State Uni- 
versity upperclassmen, comparable to subjects in 
the primary study, evaluated 13 potential sources 
of American Red Cross messages in terms of their 
„Competence or expertise, their trustworthiness or 
safety, and their dynamism. Of the 13 potential 
sources, the one accorded highest credibility by pilot 
study subjects was “Public Relations Counsel, 
American National Red Cross,” and the source 
attributed lowest credibility was “Volunteer Worker, 
Ingham County Red Cross Chapter.” The experi- 
menter, in later orally presenting the experimental 
message, assumed one or the other of these identities 
in the appropriate treatment groups. 

Three behavioral compliance conditions were ef- 
fected—voluntary compliance, involuntary compli- 
ance, and noncompliance (message-only condi- 
tion). In the voluntary compliance conditions, 
subjects were asked either by the high or low 
credibility source to donate a small portion of 
their blood to the Red Cross blood program for 
Tesearch purposes. They were then asked to make 
an appointment for the purpose of taking the 
blood sample, thus overtly manifesting their 
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voluntary compliance. In an effort to create in 
these subjects an active commitment to give 
blood at a specified time, the appointments were 
made for that afternoon. (Those subjects who did 
not comply were excluded from the analyses that 
follow. Of 31 subjects in the high credibility, 
voluntary compliance treatment group, 3 failed to 
make an appointment; of 38 subjects in the low 
credibility, voluntary compliance treatment group, 7 
failed to make the commitment.) Subjects in the in- 
voluntary compliance conditions were told that they 
had no choice but to permit taking of the blood 
sample, and that the sample would be taken in the 
next few minutes. It was merely suggested to sub- 
jects in the noncompliance (message-only) condi- 
tions hat they participate in the blood research 
program if and when the opportunity presented 
itself. The attitude-change effects in the noncom- 
pliance conditions were wholly dependent upon 
the persuasive character of the basic experimental 
message attributed either to the high or low credi- 
bility source. 


Procedure 


A before-and-after experimental design was em- 
ployed. In the pretest phase of the experiment, 
subjects’ initial attitudes toward the presumably 
neutral attitude object (the American Red Cross 
blood program) were measured. (Analysis of pretest 
attitude scores revealed that the blood program 
was, for most subjects, a neutral attitude object.) 

Approximately 4 weeks later, the experimental 
treatments were administered in six of the seven 
groups of subjects; the seventh group serving as an 
“unexposed” control group. The experimental manipu- 
lations entailed exposing the classes, as intact 
groups, to the persuasive message under one of the 
treatment conditions: 


High credibility source, voluntary compliance 

High credibility source, involuntary compliance 

High credibility source, noncompliance (message- 
only) 

Low credibility source, voluntary compliance 

Low credibility source, involuntary compliance 

Low credibility source, noncompliance (message- 
only) 


Three versions of the basic message, designed to 
induce attitude change in the direction of increased 
favor of the Red Cross blood program, were pre- 
pared. The basic content of all three message versions 
was the same, describing the scope and goals of the 
Red Cross blood program. Each version concluded 
with an appeal for subjects’ participation in the 
blood research program, variations in this con- 
cluding appeal constituting the compliance manipu- 
lations. 

The experimenter, posing as the high or low credi- 
bility source, was introduced to the subjects by the 
class instructors, using a prepared introduction 
designed to emphasize the high or low credibility of 
the message source. The introductions stressed the 
point that the departmental administration and 
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class instructors were cooperating with the Red 
Cross in permitting the source to address the classes 
and take blood samples. Thus, subjects, in the 
voluntary compliance condition were given the im- 
pression that, while the purposes of the source had 
the approval and support of the departmental 
administration, they were free to comply or not 
comply as they wished. Those in the involuntary 
compliance condition, however, were given to under- 
stand that they had been committed to compliance 
by authorities other than themselves, thereby en- 
hancing the cognition of forced compliance. 

The control group was not exposed to the 
experimental message, functioning as a base-line 
group against which attitude changes noted in the 
experimental groups were compared. This „group 
also served as a control on the attitude-change 
effects of extraneous, otherwise uncontrollable 
events bearing on the criterion measure. 

Immediately following administration of the ex- 
perimental treatments, subjects’ attitudes toward 
the critical attitude object were reassessed, per- 
mitting comparison of pre- and posttest attitude 
scores and providing a measure of attitude change 
accruing from the experimental manipulations. 


Pre- and Posttest Measures 


The pretest instrument administered to all sub- 
jects consisted of 25 Likert-type items assessing at- 
titudes on a variety of health and medical topics. 
Ten of these items, forming a Likert-type attitude 
scale and interspersed randomly among 15 “mask- 
ing” items, constituted the measure of subjects’ at- 
titudes toward the critical attitude object. The 10 
items comprising the attitude measure were developed 
in a second pilot study purportedly measuring the 
effects upon public opinion of discussions and 
criticisms of the Red Cross blood program carried 
by the national news media. (Illustrative of the 
items included in the attitude measure is the 
statement, “Everyone’s life is in the hands of God— 
therefore, I can’t support any blood donor pro- 
gram.”) Subjects indicated the extent of their agree- 
ment or disagreement with all 25 items; only the 
10 items concerned with the critical attitude ob- 
ject were scored and evaluated. 

Administered immediately following the experi- 
mental manipulations, the posttest instrument did 
not incorporate the masking features of the pretest 
instrument. The masking items were not used again 
inasmuch as the subjects were now aware of the 
experimenter’s interest in their evaluations of the 
Red Cross blood program. This questionnaire in- 
cluded only the 10 relevant attitude items; the 
attitude change effected by the various experimental 
treatments being indexed by pretest-posttest differ- 
ences in the subjects’ total attitude score (summed 
over 10 items). 

The posttest instrument also contained credibility 
rating scales (those used in the credibility pilot 
study) for the subjects’ evaluations of the high or 
low credibility source from whom they had heard 
the persuasive message, together with the following 
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classificatory items: age, sex, and academic class of 
the subject; whether the subject had ever donated 
or received blood through the Red Cross; whether 
close friends or relatives had ever donated or ‘eceived 
blood; and whether the subject was interested in 
learning more about the Red Cross blood donor 
and research programs. 

The pretest and posttest instruments were ad- 
ministered to the subjects by the class instructor, 


RESULTS 
Initial Attitude 


The mean attitude scores obtained in the 
pretest phase of the experiment revealed that 
the seven groups were nòt entirely homogene- 
ous in their initial attitudes toward the Red 
Cross blood program. Analysis of differences in 
mean pretest attitude measures, using the 
procedure suggested by Dunnett (1955) for 
comparing a number of treatment groups with 
a control, indicated that the control and low 
credibility, noncompliance groups differed sig- 
nificantly in mean attitude score (¢ = 209 
p< .05). All other comparisons of mean at- 
titude scores between groups were not sig- 
nificant (p > .05). j 

Despite the one significant difference neted, 
the seven groups were considered homogeneous 
in their mean initial attitude toward the crit- 
ical attitude object. This assumption was 
given added credence by the fact that initial 
attitude score variances among the seven 
a were homogeneous (x? = 12.30, $> 
05). 


Credibility Manipulations 


In order to test the success of the credibility 
manipulations, the mean credibility ratings 
given the two sources by the subjects were 
compared. Subjects in the high credibility | 
treatment groups gave the source a meal | 
rating of 53.0 (of maximum 70); those in the 
low credibility groups gave the source a mean | 
rating of 42.5. The difference in mean credi- 
bility ratings of the two sources was signifi- 
cant (t = 6.98, p <.01). An examination 0 
the ratings on each of the three credibility 
dimensions comprising the total credibility | 
scale showed that the credibility attributed 
the high credibility: source was chiefly due t0 
his perceived competence, while he was 10 ey 
seen as particularly trustworthy or dynamit. 
The low credibility source, on the other han, | 
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was perceived as relatively trustworthy, but 
neither competent nor dynamic. 


Attitude Change 
Prior to testing the hypotheses of this 


study, it was first determined, using Dunnett’s 
procedure, whether or not the experimental 
manipulations had, in fact, produced any 
significant attitude change. The mean at- 
titude-change scores and standard deviations 
of the experimental and control groups are 
presented in Table 1. 
» Comparison of the mean attitude change 
scores of the experiméntal groups with that 
of the control group indicated that all were 
significantly different from the control group 
with the exception of the high credibility, 
involuntary compliance treatment group ($ 
< .05). The absence of mean attitude change 
significantly different from that of the con- 
trol group in this particular treatment group 
seems in line with dissonance-theory expecta- 
tions. That is, in this treatment group, subjects 
could be expected to reduce dissonance, not 
simply by changing their attitudes in the 
direction advocated by the source, but by 
justifying or rationalizing their compliance in 
the knowledge that they had been forced to 
do so by a high credibility source. Con- 
sequently, the attitude change effected in the 
high credibility, forced compliance treatment 
group could be expected not to differ sig- 
nificantly from that of the control group and 
to be less than that effected in the high 
credibility, noncompliance treatment group. 
The data were also tested for heterogeneity 
of variance before tests were made of the 
| Wpotheses, Application of Bartlett’s test of 
_ lomogeneity indicated that the variances of 
_ ‘he experimental and control groups were, in 
"fact, homogeneous (z = 1.83, p > .05). 
Empirical evidence supporting or failing to 
Support the attitude-change hypotheses was 
obtained by testing the significance of the 
differences between mean attitude change 
Scores of the six experimental groups.* The 
| lnweighted averages method for the analysis 
x differences in group means (Snedecor, 
. 1956) was employed for this purpose. 
The results of the analysis of differences 


8 se s, 
ge minimum .05 level of significance was used in 
i g each of the hypotheses. 
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TABLE 1 
ArtiTuDE-CHANGE SCORES, MEANS, 
AND STANDARD DEVIATIONS 
Compliance 
Ei Voluntary | Involuntary No AU cong 
M |SD| M|SD| M | SD| M | SD 
High | 3.00 | 1.83 | 1.29 | 1.98 | 1.62] 1.42 | 2.00 | 2.29 
(28)* (24) (29) (81) 
Low 5.55 | 1.76 | 1.84 | 2.75 |-2.71| 2.12 | 2.04 |4.49 
(31) (25) (24) (80) 
Both | 4,51 | 3.56] 1.57 | 2.06 |- .34] 2.79 | 
| (59) (49) (53) 
Control .25 | 1.44 
(16) 


a Ns appear in parentheses. 


in mean attitude-change scores among all six 
experimental groups indicated that for this 
sample of groups and subjects: 

1. The mean attitude change effected in 
the high credibility conditions was not sig- 
nificantly different from that effected in the 
low credibility conditions (F = .009, df= 
1/55, p > .05). 

2. Significantly different mean attitude 
change was effected between the three be- 
havioral compliance conditions (F = 53.31, 
df = 2/155, p < .01). 

3. There was a significant interaction in 
the mean attitude-change effects of the source 
credibility and behavioral compliance con- 
ditions (F = 29.49, df= 2/155, p < .01). 

Interpretation of the results of the overall 
analysis of variance rested upon closer analy- 
sis of the significant interaction in attitude- 
change effects of the credibility and com- 
pliance manipulations. For this reason, ¢ tests 
were used to test the significance of differ- 
ences in the mean attitude-change scores 
effected by high and low credibility at each 
of the three levels of behavioral compliance. 
The results of these supplemental ¢ tests 
showed that: 

1. In the voluntary compliance conditions, 
significantly greater positive attitude change 
was effected in the low credibility treatment 
group than in the high credibility treatment- 
group (¢ = 2.98, p < .01).. 

2. In the involuntary compliance conditions, 
the mean attitude change effected in the low 
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credibility and high credibility treatment 
groups was not significantly different (t= 
92, p > .05). 

3. In the noncompliance (message-only ) 
conditions, significantly greater positive at- 
titude change was effected in the high credi- 
bility treatment group than in the low credi- 
bility treatment group (t = 8.87, p < .01). 

On the strength of these results, it was 
determined that of the three attitude-change 
hypotheses posited, the evidence tended to sup- 
port one and to partially support a second. 
The hypothesis that, in the absence of be- 
havioral compliance, a high credibility source 
would effect greater positive attitude change 
than would a low credibility source was con- 
sidered confirmed. Only partial support was 
seen, however, for the hypothesis that, given 
behavioral compliance, a low credibility 
source would effect greater positive attitude 
change than would a high credibility source. 

In order to further test this and the remain- 
ing hypothesis, a second analysis of variance 
was performed. In this analysis, the non- 
compliance or message-only treatment groups 
were excluded. 

The results of this analysis, again using 
Snedecor’s method of unweighted averages, 
indicated that, given behavioral compliance: 

1. Significantly greater positive attitude 
change was effected in the low credibility than 
in the high credibility treatment groups (F 
= 11.54, df= 1/104, p < .01). 

2. Significantly greater positive attitude 
change was effected in the voluntary than in 
the involuntary compliance treatment groups 
(F = 29.47, df = 1/104, p < .01). 

3. There was a significant interaction in the 
attitude-change effects of the credibility and 
compliance manipulations (F = 4.04, df= 
1/104, p < .05). 

Again, a significant interaction effect com- 
plicated interpretation of the results of this 
analysis. The results, however, tend to support 
one and partially support the second of the 
remaining two hypotheses tested. It was ap- 
parent from this analysis, and from the ¢ 
tests reported above, that the significant 
interaction obtained was attributable to the 
fact that the low credibility source effected 
greater positive attitude change in the case 
of voluntary compliance, while neither the 
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high nor the low credibility source was more 
effective in inducing positive attitude change [ 
in the involuntary compliance condition, Thus, 
the prediction that, given behavioral compli- 3 
ance, low credibility sources effect greater (i 
positive attitude change than do high credi- | 
bility sources held only when compliance was 
voluntary, and the hypothesis was only par- 
tially supported. 

The hypothesis that voluntary compliance 
would effect greater positive attitude change 
than would involuntary compliance with the 
source’s position was confirmed. This finding, 
however, could have been the result of a self- 
selection bias. That is, the greater positive 
attitude change in the voluntary compliance 
condition may have been the cause, rather 
than the effect, of the compliance. Subjects 
who voluntarily complied with the source's 
recommendation may have been those who 
had already been convinced by the message. 
If this is the case, the mean attitude-chang? 
scores of subjects who failed to comply 
should not differ from those of subjects who 
voluntarily complied, and should differ from 
the attitude-change scores obtained in the 
noncompliance condition. Testing this argi- 
ment, the mean attitude-change scores of the 
10 subjects who did not voluntarily comply 
were compared with those of subjects in the 
voluntary compliance and message-only Col 
ditions. The mean attitude-change scores 0 
subjects not complying differed significant) 
from those in the voluntary compliance coh 
dition (¢ = 7.79, p < .01), but did not differ’ 
significantly from the mean attitude-chang 
scores in the noncompliance condition (t= 
48, p > .05). Thus, it would seem that volun 
tary compliance was the cause, rather than 
the effect, of the attitude change observed it 
that condition. 
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Discussion 


The results of this study suggest that, m 
the absence of behavioral compliance with tie 
request or position of the source, high credi 
bility sources are more effective in inducing 
positive attitude change than are low credi 
bility sources. If, however, the receiver 
complied with the position of the source, 
results suggest that low credibility sour 
are more effective, or at least equally © 
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five, in inducing positive attitude change. On 
ie basis of this study’s results, it appears 
that low credibility sources are more effective 
an are high credibility sources in effecting 
ositive attitude change in cases of voluntary 
“compliance, while neither high nor low credi- 
bility sources are more effective than are 
‘high credibility sources in effecting positive 
ttitude change in cases of voluntary com- 
liance, while neither high nor low credi- 
bility sources are significantly more effective 
in cases of forced compliance. It was further 
noted that greater positive attitude change 
‘follows from voluntary compliance with the 
7Miurce’s position than follows from forced 
t involuntary compliance. 
T Festinger’s theory of cognitive dissonance 
“was the genesis of the attitude-change hy- 
‘potheses posited. The attitude-change effects 
of high and low source credibility observed in 
‘this study may be examined in the light of 
“that theoretical orientation. In contrast to re- 
‘Ceivers exposed only to the persuasive mes- 
“Sage (message-only condition), receivers who 
complied with the position of the low credi- 
bility source, whose compliance was dis- 
Crepant with their initial attitudes, or who 
“experienced choice in complying or not com- 
‘lying, should have been able to muster fewer 
“Cognitions in justification of their compliance 
behavior. These receivers therefore would be 
“expected to experience greater cognitive dis- 
Sonance (felt drive to defend and justify 
one’s actions and decision) and to demon- 
“strate greater attitude change in the direction 
; “Advocated by the source. 
Thus, when subjects had behaviorally 
Complied with the source’s position, voluntary 
‘compliance effected greater positive attitude 
: change than did involuntary compliance. 
Similarly, the low credibility source effected 
8teater positive attitude change than did the 
high credibility source. However, the at- 
tude change effected by low credibility was 
‘Significantly greater only in the case of volun- 
fary compliance; in the forced compliance 
“Condition, the attitude-change effects of high 
i ao low source credibility were, although in 
oa tight direction, not significantly different. 
te his latter finding was somewhat unexpected, 
. Masmuch as subjects in the low credibility, 
forced compliance treatment group were €x- 
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pected to exhibit greater attitude change than 
those in the high credibility, forced compli- 
ance group. Two alternative explanations 
might be offered to account for this result. 
First, the high credibility source may have 
effected greater positive attitude change than 
was anticipated. This alternative does not 
seem feasible, however, since the least amount 
of positive attitude change was observed in 
the high credibility, forced compliance group; 
less change, in fact, than that effected in the 
high credibility, noncompliance group. Second, 
subjects in the low credibility treatment group 
may not have experienced a sufficiently high 
level of dissonance to be impelled to greater 
positive attitude change. That is, subjects 
would be less able to justify compliance oc- 
casioned by a low credibility source, and that 
positive attitude change would be their most 
readily employed dissonance reduction meas- 
ure. Consequently, they were expected to 
manifest greater positive attitude change 
than subjects in the high credibility, forced 
compliance treatment group. Since no measure 
of felt dissonance was employed, it can only 
be speculated that subjects in the low credi- 
bility, involuntary compliance treatment 
group did not experience the expected mag- 
nitude of dissonance. It is this explanation, 
however, which seems most feasible in the 
light of the study’s rationale. 

As expected, the high credibility source, in 
the involuntary compliance condition, effected 
less positive attitude change than did the 
high credibility source in the noncompliance 
condition. This seems consistent with dis- 
sonance theory, in that subjects in the forced 
compliance treatment groups could “rational- 
ize” their complaint behavior in the knowl- 
edge that they had no choice in whether or not 
to comply. Thus, they could reduce dissonance 
in this way, rather than by changing their 
attitude in direction of the source’s position. 

On the other hand, subjects in the noncom- 
pliance treatment groups were, in effect, 
“spared” the dissonance occasioned by be- 
havioral compliance with the position of the 
source. Dissonance may have been aroused, 
however, by perceived inconsistencies between 
subjects’ evaluations of the source and their 
attitudes toward the position ‘advocated by 
that source, Source credibility is, in part, a 
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function of what the source knows and be- 
lieves—his expertness and trustworthiness. 
Thus, it is cognitively inconsistent for the 
receiver to adhere to a position or attitude 
' which is in conflict with that expressed by a 
source for whom he holds a high regard. Con- 
versely, it is cognitively inconsistent for the 
receiver to hold an attitude which is espoused 
by a low credibility source—one of whom the 
receiver holds a low opinion. Therefore, sub- 
jects, perceiving a discrepancy between their 
own position with respect to the attitude ob- 
ject and that advocated by the source, are 
faced with a dilemma. They can either change 
their evaluation of the source or change their 
attitude toward the attitude object—the Red 
Cross blood program. The second alternative 
apparently was selected by the majority of 
subjects in the noncompliance conditions. 
Thus, greater positive attitude change was 
effected by the high credibility source, while 
the low credibility source effected “negative” 
attitude change. 
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AN EXPECTANCY INTERPRETATION OF 
z MANIFEST ANXIETY 


JOSEPH F. RYCHLAK ann JEROME J. LERNER k 


Saint Louis University 


This study examined the hypothesis that manifestly anxious Ss generalize 
expectancies (GEs) from recent past experience. The anxious S was seen as 
more constrained by the pushes and pulls of his most recent successes and 
failures. 80 Ss (40 males, 40 females) were divided into equal groups on the 
basis of high and low Manifest Anxiety scale scores. Following an initial esti- 
° mate of GE, Ss were administered 4 tests of manual dexterity, with the value 
of positive or negative reinforcement controlled by E. At Test 5, a 2nd GE was 
requested, which was then followed by a change in the level of reinforcement 


(from positive to negative and vice versa). A Test 6 GE was then requested, and 


E change in GE was assessed through a D score measure between the 2nd and 


3rd GE estimates. The findings support 
question about whether or not anxious 


the general hypothesis. There is some 


Ss lower their GEs more absolutely 


following negative reinforcement on Test 5 than do nonanxious Ss. They clearly 
raise their level of GE following a positive reinforcement on Test 5 more 


absolutely than do the nonanxious Ss. 


The anxiety construct has been widely 
studied in recent years, thanks to the valuable 
introduction of the Manifest Anxiety (MA) 
scale to the literature (Taylor, 1953). This 
scale was devised to advance theoretical 
speculations of a drive-reduction nature, and 
the work as reported and interpreted to date 
(eg., Spence, 1958; Spence, 1963; Taylor, 
1956) has focused on the “construct valid- 
ity” of the anxiety concept (Spence, 1963, p. 
22). Even so, interpretations of anxiety con- 
tinue to vex the investigator, and range from 
overt and acute physical reactions by an or- 
ganism in a stressed condition (as in the 
clinical symptoms of palpitation, vertigo, etc.; 
Hodgins, 1951), through a chronic state of 
semicovert heightened drive (the original MA 
scale view; Spence, 1963, p. 22), to the view 
that anxiety is a psychological characteristic 
of humans, involving their personalities and 
projected values (May, 1950, p. 191). Some 
even reject the concept entirely, as does the 
expectancy theorist, Rotter (1954): 

Since we have rejected a drive-reduction theory in 
favor of an empirical law of effect, it is not neces- 
sary to assume or postulate a source of drive or 


energy to account for behavior following noxious 
stimulation [p, 236]. 


It is the confounding of the physiological 


(drive) with the psychological or behavioral 
which Rotter seeks to correct by rejecting 


anxiety as a worthwhile construct. In its 
place, he proposes to use an index of the sub- 
ject’s mean expectancy for success (gratifica- 
tion), or the subject’s reciprocal mean expec- 
tancy for failure (punishment), as general- 
ized from past.experience to an upcoming task 
or life situation. Thus, if a subject is said to 
be experiencing “test anxiety” (Mandler & 
Sarason, 1953), for Rotter this would imply 
that the subject has little expectancy for suc- 
cess in such (identical or related) achieve- 
ment situations. To call this low level of the 
subject’s expected success (or, the converse, 
high expectancy for failure) “anxiety,” ac- 
cording to Rotter (1954, p. 194), is to add 
little to our description of the behavior under 
observation. In his social learning theory 
(SLT), “anxiety behavior” is seen as being 
determined by the entirely psychological con- 
structs of expectancy and reinforcement value. 

Rotter has also developed formulae to dem- 
onstrate that any given expectancy (Æ) ac- 
tually has two components: expectancy spe- 
cific to the task at hand (Z’), and generalized 
expectancy (GE) from related tasks: E= 
E' &GE. The SLT research suggests that as 
the subject comes into a novel situation, his 
initial Æ is dependent entirely upon GE, but 
that as the subject gains experience (e.g, 
massed trials on the same task), GE loses its 
influence to E’ in the determination of a given 
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E. After repeated experience, E may come to 
be identical with E’, in the now thoroughly 
familiar situation facing the subject. Consider- 
ing the quality of apprehensiveness about the 
‘unknown future attributed to anxiety, it 
seemed plausible to the writers to think of GE 
as having a more direct relation to “anxious 
behavior” than Æ would have. 

The significance of anxiety for expectancy 
theory has also been underscored in the risk- 
taking model proposed by Atkinson (1958), a 
model which is in many respects very similar 
to SLT. Atkinson’s theory of motivatéd be- 
havior (or motivation, as he terms it) rests on 
three basic constructs: Expectancy: 

a cognitive anticipation ... that performance of 


some act will be followed by a particular conse- 
quence [p. 323]. 


Rotter’s definition is very similar, with E 
viewed as the subjectively held probability by 
the subject that his behavior “X” will lead to 
(positive or negative) reinforcement “Y” 
(Rotter, 1954, p. 107). Rotter has also used 
the refinement of Z’ and GE already dis- 
cussed. Incentive: 

the relative attractiveness of a specific goal that is 
offered in a situation or the relative unattractiveness 


of an event that might occur as a consequence of 
some act [Atkinson, 1958, p. 323]. 


This is similar to Rotter’s concept of rein- 
forcement value (RV), which is defined as 
the degree of preference for any given rein- 
forcement to occur over all others possible, 
assuming that the probabilities for all rein- 
forcements were identical (Rotter, 1954, p. 
107). Noxious reinforcement stimuli would 
be seen as having a negative RV, analogical 
to the negative incentive of Atkinson. Motive: 


A motive is conceived as a disposition to strive for a 
certain kind of satisfaction in the attainment of a 
certain class of incentives. ... The general aim of 
one class of motives, usually referred to as appetites 
or approach tendencies, is to maximize satisfaction of 
some kind. The achievement motive is considered a 
disposition to approach success....The aim ‘of 
another class of motives is to minimize pain. These 
have been called aversions or avoidant tendencies, 
... The motive to avoid failure is considered a 
disposition to avoid failure and/or a capacity for 
experiencing shame and humiliation as a conse- 
quence of failure [Atkinson, 1958, p. 324]. 


One is reminded here of Murphy’s (1947, 
p. 170) reference to a “system of anticipa- 


“prized goal by many or all; it is also difficult 
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tions” in discussing how canalizations arise, | 
Atkinson’s appetitive motives are reminiscent 
of Murphy’s canalization, But in any case, 
whereas Rotter would claim that Behavior 
Potential (BP; i.e., motivated behavior) is aif 
function of Æ (including E’ & GE) and RV, 
Atkinson’s formula for motivated behavior 
includes motive, expectancy, and incentive, Is 
there a hidden similarity in these two concep- 
tualizations? We think so, and would assert 
that GE plays a role similar to that of motive 
in the Atkinson model. As he has clearly, 
stated, a motive is not an energizing drive | 
(Atkinson, 1958, p. 324), and therefore pre- 
sumably Atkinson’s model is not based upon: 
the drive-reduction thesis. Motives appear to 
be learned phenomena based upon the organ- 
ism’s history and “capacity for satisfaction” 
(Atkinson, 1958, p. 324). In like fashion, the 
concept of GE serves as a kind of “tie” be- 
tween past and present learned situations. If 
a subject has been “successful” (rewarded) 
in a desired (high RV) achievement situation 
in the past, this tendency to expect success in 
related achievement situations “generalizes”; 
failure expectations would generalize in the 
same fashion (see, e.g., Crandall, 1951). One's 
appetites and/or aversions are merely pat 
terned expectancy-setting phenomena, nul 
tured by past experience and now generalized 
to more and more situations in the present. 
The point of departure from SLT for the 
Atkinson model is in the assumption that 
expectancy and incentive are inversely related 
(combining into unity, so that if one score 1$ | 
90 the other is .10, or .50 and .50, etc.; Ab 
kinson, 1958, p. 326). Rotter (1954, pp. 161- 
162) argues that E and RV are frequently te- | 
lated, but that basically the two constructs | 
are independent of each other. It is difficult 
to earn money in the stock market, a highly 


to climb a. mountain, a highly prized. goal by 
only a few. But in any case, through a multi 
plication of expectancy and incentive scores) 
Atkinson (1958, p. 327) is able to work ott 
approach and avoidance values for tasks 0 | 
various levels of difficulty. As can be se 
tasks of very high or very low expectancy 
(.90 or .10 probability for success) will havé 
low values (.09 for these extremes); 2 
tasks in the intermediate ranges of .50 will 
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have the strongest attraction (or repulsion if 
the incentive is negative), because their mul- 
tiplicand (incentive) is presumed to be .50 
and this gives a value of .25. Hence, if one is 
motivated to achieve, he will get the greatest 
sense of achievement by choosing tasks of an 
intermediate level of difficulty. And, if one is 
motivated to avoid a feeling of failure, he 
will choose either very difficult tasks (where 
failing is to be expected) or very easy tasks 
(where success is assured). In this fashion, 
, Atkinson ingeniously explains the well-docu- 
mented fact that arfxious subjects (presum- 
ably fearful of failure) set either very high or 
ssvery low aspiration levels on upcoming tasks, 
whereas nonanxious subjects are more likely 
to set intermediate aspiration levels (Lewin, 
Dembo, Festinger, & Sears, 1944). 

The present study will incorporate these 
findings in a somewhat different fashion, and 
from an SLT perspective. Further, there will 
be an economy of explanation in that no dis- 
tinction will be made between expectancy, 
motive (in our terms, GE), and level of aspira- 
tionsetting behavior (approach and avoidance 
gradients). Although based upon an empirical 
law of effect, SLT must still account for be- 
havior typically called anxious. This study 
hopes to provide additional understanding of 
such behavior. Earlier (Rychlak & Eacker, 
1962), tentative evidence was presented sup- 
porting the view that subjects high in manifest 
anxiety were less stable in GE than were 
nonanxious subjects. The anxious subject 
seemed more prone to raise his GE for success 
ina series of different tasks (Task 1, Task 2, 
etc.) following positive reinforcement, and to 
lower this GE following negative reinforce- 
Ment, than was a nonanxious subject. If we 
use the Atkinson model to explain this tend- 
ency, we would presumably hypothesize that, 
following success on Task 1, the anxious in- 
dividual who is highly motivated to avoid 
failure chooses a very high expectancy level 
for Task 2. This permits him to shrug off any 

» Subsequent failure at Task 2 because of its 
high level of difficulty. He would seek a 
defensible failure (Task 2) in the face of 
Success (Task 1). After his failure experience 

| 0n Task 2, he greatly lowers his sights in 
Order to assure some kind of minimal success 
on Task 3. Thus, even if the Task 2 failure 
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were defensible, he would now seek to gain 
a minimal reinforcement at Task 3, after 
having priced himself out of the market for a 
reasonable success experience at Task 2. The 
nonanxious individual, who is presumably les$ 
fearful of experiencing failure and more mo- 
tivated to gain success, stays within the middle 
ranges of expectancy over all tasks, and there- 
fore appears more stable in the series of trials. 

In contrast to this view, the hypothesis 
under test in the present research has it that 
anxious subjects are more responsive to their 
recent past experience than are nonanxious 
subjects. They are more prone to redefine their 
performance level following an as yet unknown 
but low number of (positive or negative) 
reinforcements in any given area of ability— 
that is, generalize from recent past experience. 
Nonanxious subjects would be seen from this 
view as more likely to transcend immediate 
past experience, and consequently to be more 
stable in GE. They would have a broader 
base from which to generalize, a larger fre- 
quency of experiential events, leading to in- 
creased stability of GE in the present (fol- 
lowing Rotter’s, 1954, Ch. 5, explanation). 
Therefore, in a sequence of behavioral events, 
the anxious subject may be expected to con- 
tinually fluctuate in GE because he is seeking 
an anchor point from which to project a 
reasonable performance level. Unfortunately, 
he finds it impossible to transcend recent past 
experience and continually over- and under- 
estimates his ability level for the “next” task, 
depending upon his most recent successes 
(positive reinforcement) or failures (negative 
reinforcement) in the area of ability under 
consideration. Thus, he brings about a “vicious 
circle” of setting himself up for failure by 
shooting too high, and then, following a set- 
back, drops off too quickly to guarantee 
further positive reinforcement, which only re- 
peats again the rapid increase in GE with 
its concurrent increase in the chances for 
another failure. 


METHOD 
Tasks 


The experimental tasks, concerning which GE 
had to be formed, were six performance tests of 
manual dexterity. They were numbered in the order 
of their experimental appearance, and each test 
was adapted to a 1-minute trial procedure (after 
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Rychlak, 1958). The better-known tests and their 
numbers were: Minnesota Rate of Manipulation, 1; 
Pennsylvania Bi-Manual Worksample, 3; Purdue 
Pegboard, 4; and O’Connor Tweezer Dexterity Test, 
5 (Buros, 1949, 1953, 1959). Test 2 was a two-part 
pegboard requiring the subject to put several brass 
washers on capped pegs and then to insert this 
combination in a series of holes. Test 6 was a 
hand-steadiness test, which involved placing a stylus 
in holes of various sizes without touching the sides. 


Subjects 


The subjects were chosen from introductory psy- 
chology classes, and ranged in grade from freshmen 
through primarily sophomores to a few juniors in 
college attendance. Before selecting experimental 
subjects, 273 introductory psychology students (191 
males, 82 females) were administered the MA scale 
(Taylor, 1953), Following the MA scale testing, 
males and females were stratified separately, and 
extreme groups were selected of 20 subjects each 
from both distributions. Thus, two groups of 40 
subjects (20 males, 20 females) were designated as 
either anxious or nonanxious on the basis of their 
MA scale scores (total N=80). The selection 
method was as follows: subjects were randomly 
assigned to experimental conditions and then con- 
tacted individually by telephone and asked to par- 
ticipate (for which they received course recognition). 
Approximately 3% of those subjects initially con- 
tacted failed to cooperate, and when this occurred 
an appropriate replacement was randomly chosen 
from the extreme quartiles (for male subjects) or 
extreme thirds (for female subjects) of the respec- 
tive distribution. 


Procedure 


The study was designed to have both anxious 
and nonanxious subjects state a GE, at the outset, 
and then to provide them with four interpolated 
positive or negative reinforcements (on the perform- 
ance tasks), before requesting a second statement 
of GE», The GE, measure was necessary to insure 
that all subjects had roughly the same expectancy 
for success in tasks of this nature at the outset. 
The interpolated reinforcements were prearranged 
according to experimental condition. Thus, the 
experimenter controlled all “scores.” Following their 
statement of GE. subjects were given a single rein- 
forcement out of proportion to their earlier experi- 
ence, and then a final GE; was requested. It was 
therefore possible to examine change in GE through 
the use of difference scores (D) between the second 
and third GE statements. The crucial test would 
come from the D measure, and the prediction was 
that anxious subjects would have a significantly 
greater D than nonanxious subjects. 

The specific instructions were uniform for each 
experimental condition. Subjects were told that the 
series of tests they were about to take had been 
administered to thousands of college students all 


over the country. They were instructed that each 
test was a little different from the other, but that 
all tests measured roughly the same thing: “the 
ability to manipulate your hands.” As an additional 
source of motivation, all subjects were informed: 
“although we are not certain of the relationship, | 
we feel these tests are, in some manner, related to 
such things as scholastic achievement, intelligence, 
and personality characteristics.” Flicking through a 
booklet of tables, the experimenter further explained 
that all scores would be based upon how well other it 
college students of a comparable age and educational 
level had done. Later these pseudonorms were 
obviously referred to in “deriving” each subject's 
score. q 

Previous research (Chapman & Volkmann, 1939; | 
Rotter, 1954, p. 177) has demonstrated that groups 
of subjects having no knowledge of a task on which?” 
the possible scores are reported to range from 0 to 
50 points, will state a mean GE of approximately 
28 on their first attempt. This general finding was 
used to establish an anchor point defining a positive 
or negative reinforcement in the present study. That f 
is, subjects under negative reinforcement were given 
(four interpolated) scores averaging 11 points below, 
and subjects under positive reinforcement 11 points 
above this anchor point of 28. At the outset of the 
experiment and prior to actually seeing the first 
task, each subject was requested to write a covert 
statement of expectancy for Test 1 on a slip of 
paper, based upon the 0-50 range of score possibil- 
ity, and then to drop it into a box placed within 
reach, This procedure was justified as an attempt to 
discern what “most people” think they might earm 
on such tests before actually having seen what was 
specifically involved. Subjects were led to believe 
that these estimates would not be identifiable. How- 
ever, the covert estimates were recovered following 
ne experimental session and taken as the subject's 

a 

After giving their initial estimates (GH); the 
subjects took Tests 1-4 consecutively without any 
request being made. Following each test trial, the 
experimenter paged noticeably through the book of 
tables and stated the appropriate “score” aloud % | 
he also recorded it on the subject’s performance 
sheet. Positively reinforced experimental subjects 
were given scores of 36, 38, 37, and 39 on the 
interpolated Tests 1-4. Negatively reinforced sub- 
jects were given a sequence of 16, 18, 17, and 
on the first four tasks. Note that these scote 
average either 11 points above or below the ancho 
point of 28. Immediately preceding Test 5, the 
subject was asked to state an (overt) GE: “no 
that you have some experience with tests of they 
sort.” The score on Test 5 (controlled by © 
experimenter) was 11 points above (for subject 
who had been given negative interpolated reinfolt | 
ment) or 11 points below (for the positively | 
forced subjects) this Test 5 estimate. The ae 
(overt) estimate (GE,) was then requested fo 
Test 6. On the last trial of the experimental session 


each subject was reinforced at least 7 points abov | 
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his GE; hence, everyone ended the experiment with 
a success experience. 

It $ important to emphasize that the subject 
knew nothing about the specific task he was about 
to take when he made his expectancy statements. 
The tests were kept out of sight until a given 
administration was in order, and the subjects could 
not see their next test until actually in the process 
of taking it. It is in this sense that all three esti- 
mates can be viewed as statements of GE. The 
subject had absolutely nothing to go on except past 
experience in related tasks; he was, essentially, asked 
to state his GE for success in tasks of this nature. 


RESULTS . 
Four subgroups of 20 subjects each (10 


gpmale, 10 female) were required to fulfill 


experimental conditions. There were posi- 
tively and negatively interpolated reinforce- 
ment conditions for both anxious and non- 
anxious subgroups. Tests 5 and 6 were the 
trials on the basis of which D scores were 
calculated for each subject. Groups under 
positive interpolated reinforcement (Tests 
1-4) which received a negative reinforcement 
at Test 5 could naturally be expected to 
degline in GE, so that such D scores would 
reflect a negative change. The opposite direc- 
tion was to be expected in D scores for 
groups under negative interpolated reinforce- 
ment which were then positively reinforced 
at Test 5. However, there was no account 
taken of the direction of change in treating 
these data because the experimenters were 
interested only in comparing all experimental 
subgroups on the extent of change over Tests 
5 and 6. In one sense, the GE measures may 


, be thought of as the raw scores of the study, 
‘and the D scores as the derived measures. 


The first point of interest is to note that, 
considered as two large groups of 40 subjects 
each, there was a sex difference in GE, with 
males stating a higher mean value (F = 7.74, 
’<.01). This finding has been cited else- 
where (Rychlak & Eacker, 1962) and it is 
doubtless due to the essentially masculine 
nature of this experimental task. However, 
within-sex differences were not found be- 
tween anxious and nonanxious subgroups. 
The GE, means (and standard deviations) 
broken down by sex and anxiety condition 
were as follows: anxious males, 37.35 (2.68) ; 
Nonanxious males, 37.30 (4.58); anxious 
females, 33.85 (5.48); and nonanxious 
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TABLE 1 


FACTORIAL ANALYSIS or D Score, Invotvinc SEX 
-ANXIETY, AND INTERPOLATED REINFORCEMENT 


Source af MS P 

Sex (A) 1 | 662] .39 
Anxiety (B) 1 | 19.02 | 1.13 
Interpolated reinforcement (C) | 1 | 94.62 | 5.64* 
AXB 1 | 49.60 | 2.96 
AXC 1 | 59.50 | 3.55 
BXC 1 | 86.10 | 5.13* 
AXBXC 1 | 23.13 | 1.38 
Error 72 | 16.78 

Total 79 

. 
*p <.05. 


females, 34.45 (6.15). These are unusually 
high values which do not fit the pattern of 
earlier researches using the 0-50 range (see 
Procedure). 

In order to test the experimental hypothe- 
sis that anxious subjects would generalize 
expectancies from recent past experience, the 
D measures were submitted to factorial 
analysis. Table 1 gives the results of this 
analysis between sex, anxiety, and inter- 
polated reinforcement. It is important to keep 
in mind that the D score data of Table 1 
reflect absolute differences between GE» and 
GEs, with the direction of change ignored, 

Table 1 reflects two significant F tests, 
There is a significant effect due to inter- 
polated reinforcement (F = 5.64) which 
cross-validates an earlier finding in the use 
of this experimental procedure (Rychlak & 
Eacker, 1962). That is, for groups combining 
both sexes, D scores are greater following 
interpolated positive reinforcement and a 
Test 5 negative reinforcement (M = 6.25, SD 
= 4,79) than they are following interpolated 
negative reinforcement and a Test 5 positive 
reinforcement (M = 4.07, SD = 3.65). Peo- 
ple raise their GEs more slowly following 
positive reinforcement in an area of poor 
ability than they lower their Gs following 
negative reinforcement in an area of good 
ability. This conservatism in the raising of 
GE and liberalism in lowering GE ensures the 
likelihood of positive reinforcement on the 
“next” trial. The experimental hypothesis 
states that the anxious individual violates 
this general principle, and that he will be 
just as likely to reflect a large D score under 
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either interpolated reinforcement condition. 
The other significant finding of Table 1 
brings us to a test of this hypothesis, for it 
suggests that there is a significant inter- 
attion between anxiety and the experimental 
manipulation of interpolated reinforcement 
(F = 5.13). 

In order to test the significance of mean 
differences between subgroups of subjects 
combining the two factors of interpolated re- 
inforcement and anxiety, the D scores for all 
such groups were subjected to a Duncan new 
multiple range test (Edwards, 1960, pp.?136— 
140). Table 2 gives the GE, GE», and D 
scores (difference between GE2 and GE) for 
the four subgroups which combine both anxi- 
ety and interpolated reinforcement, and also 
the results of the Duncan test. There were no 
differences across subgroups on the GE; and 
GE, measures, but Table 2 is arrayed ac- 
cording to the significant findings on the D 
measure, The subgroups are arrayed from 
highest to lowest on the size of the D score, 
and it can be seen that only the nonanxious, 
negatively reinforced subjects differ signifi- 
cantly from the other conditions in change of 
GE over Tests 5 and 6. 

The Table 2 findings are in support of the 
experimental hypothesis, for they suggest 
that nonanxious subjects behave very much 
like our general principle; they drop in GE 
when negative follows positive reinforcement 
(6.80) to a significantly greater degree than 
they climb in GE when positive follows nega- 
tive reinforcement (2.55, p < .01). The anx- 
ious subjects fail to perform true to the princi- 


TABLE 2 
Covert GE Statement (GE), First Overt GE Statement (GE:), AND D SCORE 
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ple and show little difference in change 0 
across positive and negative reinforce 
conditions, even though their rank orde 
in line with the expected performance (5 
and 5.60, respectively). The experimen 
hypothesis is also supported by the fact tl 
anxious subjects under negative reinfor 
ment who are positively reinforced spurt mi 
in GE for the “next” trial (5.60) than 
nonanxious subjects under the same experi 
mental procedure (2.55, p < .05). K 

There is one finding in Table 2 which 
to support, or even to go along with the 
perimental hypothesis. If anxious sul 
generalize from recent past experience, 
it would follow that the greatest change (1 
largest D score) should have been fo 
in the anxious, positively reinforced gro 
Though there was no statistically significant! 
difference between anxious and nonanxiou 
subjects under positive interpolated reinfor 
ment, the latter group did rank highest 
change of GE, and this fails to support 
reasoning advanced. In an attempt to $ 
further light on this problem, a second s 
was run, making use of what few subj 
were remaining in the anxious and nonanxiou 
pools. This necessitated limiting subjects 
males, and using only one experimental tr 
ment; that is, positive interpolated reinforce 
ment (which was the doubtful case). El 
anxious and 11 nonanxious males were 
lected and put through a slightly mod 
procedure. 

In this minor exploratory study, the al 
was to see what would happen if the subj 


Experimental condition 
MA scale, interpol: 
ces erpa aien GEì (covert) GE: (overt) D (GE: & GEI) 
M SD M SD M 
Ponve reinforcement 
onanxious 36.55 5.05 36.75 2.05 6.80** 
Anxious 34.80 4.35 36.65 1.88 5.70* 
Negrtve reinforcement 
nxious 36.40 4.68 
Nonanxious 35.20 6.04 2.6 12 235. 


* Significance greater than the nonanxious, negative reinforcem: 
eane arent thanl he IOA aerative intorermenteratne pee or 
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was merely administered the first four tests 
under positive reinforcement (scores of 36, 
38, 37, and 39), and then given one negative 
reinforcement (score of 17) on Test 5. What 
level of GE would he then proffer for Test 6? 


“ Thus, the subject made only one overt GE 


statement, just preceding Test 6. If the anx- 
ious subject generalized more from recent past 
experience, then we should expect the Test 
5 score of 17 to pull his Test 6 GE further 
down from his earlier level of performance 
than it would pull down the Test 6 GE of 


* a nonanxious subject. We would expett to 


find a lower Test 6 GE for anxious than for 


pey- nonanxious subjects. This is what was in fact 


found, but once again the difference between 
means failed to reach significance. The mean 
Test 6 GEs of the two groups were as follows: 
anxious, M = 32.27, SD = 3.35; nonanxious, 
M = 33.64, SD=2.29 (t= 1.13, ns). Inci- 
dentally, Table 2 has a finding not unlike 
this hidden within the negative reinforcement 
condition. Note that the mean GE, measure 
of the anxious, negatively reinforced group is 
lower (18.85) than the comparable measure 
for the nonanxious, negatively reinforced 
group (22.65). Here again, though there 
is the suggestion that anxious subjects are 
pulled down more by their recent past experi- 
ence (via the interpolated reinforcement), 
one cannot cite a significant statistical level 
of difference between means (due primarily to 
the SD of the nonanxious group). Similarly, 
D scores worked out between GE, and GE 
with direction of change ignored, failed to 
show an interaction effect between anxiety 

*.and reinforcement condition, when subjected 
to factorial analysis. 

Finally, the reader may wonder whether 
there was a relationship between the sub- 
ject’s actual performance on the six experi- 
mental tests and his level of anxiety. Did the 
anxious subjects perform more adequately or 
More poorly on the performance tests than 
the nonanxious subjects? Although not done 
as a formal aspect of the study, the authors 
worked out a scoring scheme for each test, 
adapted to the 1-minute trial procedure, and 
factorial analyses were run on these data. 
Since the procedure was not standardized and 
no attempt was made to check on the reli- 


683 


ability of measures, etc., one could hardly 
lay claim to highly valid measures over such 
a brief testing period. However, it is interest- 
ing to note that though a sex difference 
emerged (males doing better on Test 1 than 
females) and an interpolated reinforcement 
difference emerged (negatively reinforced 
subjects doing better on Test 5 than posi- 
tively reinforced subjects), there were no sig- 
nificant differences on any test for level of 
anxiety. 


e DISCUSSION 


It would appear that the experimental hy- 
pothesis suggesting that manifestly anxious 
individuals generalize expectancies from re- 
cent past experience has been supported in 
its general phrasing, but that specific diffi- 
culties remain. Compared against their own 
performances in the raising and lowering of 
GE, anxious individuals are prone to follow 
recent experience in setting their aspirations 
for the future. They lower their expectancies 
about as much as they raise expectancies, 
depending upon how things are going in some 
area of ability or performance. The non- 
anxious individual, on the other hand, once 
having abruptly lowered his sights due to 
what might be termed a failure experience in 
an area of relative competency, raises his 
expectancy rather slowly in order to ensure 
success on the upcoming turn of events, This 
general finding is entirely in support of the 
theoretical position advanced in the Intro- 
duction, 

However, difficulties arise when we compare 
specific aspects of the raising and lowering of 
expectancies across groups defined as anxious 
and nonanxious, each of which is stating ab- 
solute expectancies according to a common 
metric. In this instance, we can still say that 
anxious individuals who have been experienc- 
ing a low level of success, and who then 
achieve a higher level of success, are more 
prone to raise their level of expectation than 
are the nonanxious individuals. Quantitatively 
speaking, the anxious individual spurts more 
in expectancy for the future than does his 
nonanxious peer. On the other hand, though 
some support exists for this view, we cannot 
state with acceptable evidence that anxious 
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individuals lower their expectancy for success 
any more absolutely on a common metric 
than do nonanxious individuals after having 
been negatively reinforced in an area of abil- 
ity in which they have been achieving some 
success, Presumably, everyone “ducks” com- 
parably when fate swings a blow his way. If 
anxious individuals find themselves in our 
vicious circle (see Introduction), then their 
difficulty lies mainly on the side of their 
easily stimulated “great expectations.” 

It is interesting to note (Table 2) that the 
GE, measure failed to distinguish between 
levels of anxiety. Anxious subjects did not 
project a higher or lower covert level of GE 
than the nonanxious at the outset of the 
study, though we might have anticipated this 
development in reasoning from the Atkinson 
model, Even more cogently, the SD of the 
GE, measure is greatest for the nonanxious 
subjects. If anxious subjects project either 
high or low levels of aspiration due to their 
fear of failure, then we should expect to find 
an anxious group with the largest variance at 
each GE measurement. Actually, the reverse 
is the case. Finding that anxious subjects 
perform no better on the experimental tasks 
than nonanxious subjects is not surprising, 
since the MA scale has admittedly been used 
to assess performance on far more simple 
behaviors than were under study here 
(Spence, 1963, pp. 22-23). 

In conclusion, it might be said that the 
anxious individual is one who is caught up in 
current necessities; he cannot easily transcend 
immediate experience, even though he might 
prefer to do so. This need not always be an 
unhealthy situation, for in one sense to be 
anxious is to be concerned, invested, and com- 
mitted; and such behaviors are in themselves 
often meritorious, Finally, it should be noted 
that nonanxious individuals may literally 
maximize the likelihood of obtaining a self- 
gratifying positive reinforcement through 
their conservatism in the raising of expect- 
ancies. By not pricing themselves out of the 
market following past successes, there is al- 
ways a good chance for large achievement 
returns in the future. 


$ 
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PERFORMANCE EXPECTANCY AS A DETERMINANT 
OF ACTUAL PERFORMANCE: 


A REPLICATION WITH A NEW DESIGN * 
Š NICKOLAS B. COTTRELL ? 


Research Center for Group Dynamics, University of Michigan 


Previous studies of the effects of discrepancies between expected performance 
(EP) and obtained performance (OP) yielded conflicting findings. The present 
study used a split-plot design to examine the effects of EP-OP discrepancy and 
the anticipation of an audience upon performance changes. The present study 
found that persons with an EP-OP discrepancy rejected the inconsistent 
performanse by changing their resħonses. Persons with no EP-OP discrepancy 
changed relatively few responses. Furthermore, an inconsistent performance did 
not influence Ss’ stated expectancies for future performance. Additional evidence 
made it unlikely that these findings were the result of Ss’ desire to do E a favor. 


a 


Aronson and Carlsmith (1962) extended 
dissonance theory (Festinger, 1957) to include 
the effects of discrepancies between expected 
and obtained performance. They found that 
persons whose expected and obtained per- 
formances were inconsistent or dissonant re- 
duce the dissonance by revoking the incon- 
sistent performance and replacing it with one 
consistent with expectation. In terms of their 
experiment, persons who expected good per- 
formance but who performed poorly and per- 
sons who expected poor performance but who 
performed well change more of their responses 
than persons whose performance confirmed 
their expectations. 

Ward and Sandvold (1963), however, failed 
to obtain Aronson and Carlsmith’s disso- 
hance-reduction finding. They did find, as 
did Aronson and Carlsmith, that persons who 
Performed poorly revoke their poor perform- 
ance and replace it with a better one. 
lt is far from clear why dissonance-reduc- 
tion effects were obtained in one study but 
not in the other. Although the performance 
a both studies was the same—score on a 
Psychological” test—there were two pro- 


1 This study was conducted during the tenure of 
Postdoctoral Fellowship F2, MH-14, 434-01 granted 
ı to the author by the National Institute of Mental 
Health, United States Public Health Service. 
The author wishes to thank E. Burnstein, D. 
` Cartwright, and R. B. Zajonc for their suggestions 
and E. Aronson and R. Baron for their helpful 
criticisms of the manuscript. 
Now at Kent State University. 


cedural differences. Ward and Sandvold’s 
subjects are likely to have expected that their 
classmates would learn their scores since their 
psychology instructor told them that partici- 
pation in the research would form the basis of 
future class discussions of psychological test- 
ing. Aronson and Carlsmith’s procedure does 
not seem to have led their subjects to expect 
that an audience of peers would subsequently 
learn their scores, Previous research (Bergum 
& Lehr, 1963; Travis, 1925) has shown 
that the presence or anticipation of an audi- 
ence enhances performance and suggests that 
performing well is more rewarding when an 
audience is expected to view the results than 
when no audience is expected. The anticipa- 
tion of an audience in Ward and Sandvold’s 
study may have heightened the concern with 
performing well to such an extent that disso- 
nance between expectancy and performance 
was a relatively small price to pay for a high 
performance. 

Ward and Sandvold told their subjects that 
the test was of questionable validity while 
Aronson and Carlsmith described the test as 
a highly valid one. Ward and Sandvold feel 
that this change in procedure removed the 
“experimenter demands” (Orne, 1962) for 
performing consistently which they feel ac- 
count for Aronson and Carlsmith’s dissonance- 
reduction findings. However, expressing little 
confidence in the test may have produced 
weaker expectancies in Ward and Sandvold’s 
study and therefore a smaller magnitude of 
dissonance between expectancy and perform- 


685 


686 


ance and less pressure to revoke an incon- 
sistent performance, 

Actually, the findings of both studies are 
open to experimenter-demands interpretations 

ə because their experimental designs confounded 
the particularities of a single experimental 
session with the combination of experimental 
variables administered during that session. In 
the previous studies a single subject was run 
at a time and he was led to expect that he 
would perform at a certain level on a person- 
ality test. The subject’s subsequent perform- 
ance either confirmed or disconfirmed, his ex- 
pectation, The experimenter then announced 
he was compelled to throw out the score be- 
cause he had forgotten to perform necessary 
latency measurements and therefore the sub- 
ject would have to take the test again, The 
dependent variable was the number of re- 
sponses the subject changed while retaking 
the test. The relationship between the experi- 
menter and the subject was complex and 
there are many ways in which the experi- 
menter could have unwittingly shaped and 
influenced the subject’s behavior independ- 
ently of the experimental treatments during 
that session. For instance, the manner in 
which the experimenter requests the subject 
to retake the test can carry overtones urging 
either that the subject change his responses 
or that he not change them. Also, the experi- 
menter can vary in the plausibility with which 
he states that the score from the first take 
on the test will not count, Therefore, subject- 
to-subject variations in the perceived revok- 
ability of the first score on the test or in ex- 
perimenter-communicated norms concerning 
changing performance, rather than the experi- 
mental variables, may account for the results 
of the previous studies, 

The present study’s first aim was to de- 
termine the effects upon performance change 
of dissonance between performance and ex- 
pectancy, unconfounded with subject-to-sub- 
ject variations in procedure. The study’s 
second purpose was to determine the effects 
of the anticipation of an audience upon the 
reduction of expectancy-performance disso- 
nance, It is hypothesized that discrepancies 
between expectancy and performance produce 
dissonance-reducing changes in performance 


when no audience is anticipated and that per- 
s 
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formance changes are not related to expect- 
ancy-performance discrepancy when it is an- 
ticipated that an audience of peers will view 
the results of performance. 


PROCEDURE i 


Much of the procedure was similar to that 
of Aronson and Carlsmith (1962). 


Subjects and Task 


The subjects were 32 female introductory psy- 
chology students at the University of Michigan 
Participation in research projects was a course J 
quirement. The subjects were told that the aim of 
the research was to find paper-and-pencil tests of 
personality which would predict the results of int 
views. The subjects were to be interviewed immedi- 
ately after the tests. Actually there were no inter 
views and only two personality tests: a warm-up 
self-rating scale and the task of interest, a bogus 
social sensitivity test. The subjects were told that 
the social sensitivity test was a very good m 
of an individual's sensitivity to others, that is, thei 
understanding and insight. Aronson and Carlsmith 
(1962, p. 179) instructions were used. q 

The test consisted of 50 cards; on each card ther 
were pictures of three men, The subject’s task k 
to judge which of the three men was schizophretlt 
Actually, there were no correct answers; to the 
investigator’s knowledge, none of the pictured ind: 
viduals, all Michigan State University seniors, wa 
schizophrenic. The subjects were also told that pal 
scores had ranged from 90% correct (these individ: 
uals were very socially sensitive) to 20% correi 
(these individuals were deficient in social sensitivity) 
and that it was quite difficult for people to JU 
their own performance on the test. ; 

The test was divided into five sections of M 
cards each. Each card was presented for no molt 
than 10 seconds. After each section of 10 carts 
there was a 3-minute rest period in which the & 
perimenter purported to score the judgments of MH 
Previous section and report to the subjects 
number of correct judgments they had made. 


Experimental Conditions 


There were three experimental variables: public 
private, expectancy, and performance, There 
two levels of each variable. Four subjects W 
randomly assigned to each of the eight combinatio 
of experimental treatments. The randomization 
cedure is described below. 

In the public condition the subjects were led 
expect that their scores on the social sensitivity 
would be revealed to their classmates. Before 
warm-up test the public subjects were told: 


We are conducting our research this seme 
on female students enrolled in the introducti 
Psychology courses. A number of girls from 
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ous sections of these courses have already taken 
the tests and been interviewed. In three weeks 
we should be through with our study. At that 
time the instructor of each section will receive a 
list of his students who participated in the 
study and their scores on the tests. This list of 

* names and scores will be passed around the class 
and then posted so you can see what your scores 
on the tests were. 


The subjects in the private condition were led to 
expect that their scores would be kept confidential 
and were told: 


Your test scores and interview materials will be 
e kept confidential. No one who is not connected 
with this project will Me permitted to see them. 


_ The scores reported at the end of each section of 
>10 cards were prearranged and were used to create 
variations in expectancy and performance. On the 
first four sections the high-expectancy subjects were 
told that their scores (the number correct out of 10) 
were 9, 8, 8, and 9, respectively, and the low-ex- 
pectancy subjects were told that their scores were 
3, 2, 2, and 3, respectively. On the fifth section the 
high-performance subjects received a score of 9 and 
the low-performance subjects received a score of 3. 
The fifth section score confirmed the expectancies of 
the subjects in the high expectancy-high performance 
condition and the low expectancy-low performance 
condition and disconfirmed the expectancies of sub- 
jects in the high expectancy-low performance con- 
dition and the low expectancy-high performance 
condition. 


Dependent Variable 


Si After the fifth section of the test the experimenter 

discovered” that the timing apparatus had broken 
down during that section, He gave the subjects their 
answer booklets and a copy of the key and asked 
them to score their performance on that section 
while he attempted repairs. While working on the 
equipment, the experimenter explained that he had 
bten timing their judgments all along and that it 
was essential to obtain the times. When the sub- 
Jects had finished the scoring, the experimenter an- 
nounced that he had successfully repaired the clocks 
and: said: 


Well, I'm afraid we can’t use these fifth section 
Scores. TIl have to throw them out since I didn’t 
get the times. You see, I have to measure your 
Judgments and judgment times simultaneously. 
Just getting the judgments without the time it 
took to make them isn’t any good. I’ve got the 
clocks working, so there’s only one thing to do. 
Would you mind terribly if I asked you to take 
the fifth section of the test over again? Why 
don’t you just pretend that it’s a completely new 
Set of pictures—that is, respond as if you've 
never seen the pictures before. That way I can get 
a fairly accurate measure of the time it takes you 
to complete the set. 
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The experimenter then collected the answer book- 
lets and keys and destroyed the pages on which 
responses to the fifth section apparently had been 
recorded without looking at the “results” of the 
fifth section. Actually, the answer booklets had been 
prepared before the session in such a way that whene 
they were compared with the key there emerged one 
of the four patterns of scores described above. A 
record of the subject’s actual choices on the fifth 
section was preserved. 

The subjects then took the fifth section again. The 
number of previous responses the subjects changed 
in the retake of the fifth section was an operational 
measure of their discomfort with their fifth section 
score. Obviously, changing responses in the face of a 
low score would improve the score, and changing 
responses in the face of a high score would lower 
the score. 

Finally, the subjects filled out a questionnaire 
anonymously. The questionnaires were coded so 
that the experimenter could identify them. The 
subjects were asked not to discuss the study with 
other students. Later in the semester the experi- 
menter explained the study to them and answered 
their questions. 


Design 


Both of the previous studies confounded the par- 
ticularities of a single experimental occasion with 
the combination of expectancy and performance 
treatments administered during that session, The 
present study employed a split-plot design which 
equalized subject-to-subject variations in procedure 
for the four combinations of expectancy and per- 
formance. Four subjects were run simultaneously 
during each session. Sets of four subjects were 
formed. The first two sets constituted the first rep- 
lication of the experiment, the third and fourth sets 
the second replication, etc. Within each replication 
one set of subjects was randomly assigned to the 
public condition, the other to the private condition. 
All combinations of expectancy and performance 
treatments were represented in each experimental 
session. The four subjects in each session were 
randomly assigned to the combinations of expect- 
ancy and performance treatments—one subject to 
each of the four possible combinations. 

Communication among subjects during a session 
was avoided. The experimenter instructed them not 
to talk to one another during the session. This re- 
quest was honored. The subjects sat a few feet apart 
facing the experimenter. They signaled their choices 
on the social sensitivity test by holding up cards 
labeled “Left,” “Right,” or “Center.” Cardboard 
partitions concealed a subject’s test choices and 
written work from the other subjects. After each 
section of the test the experimenter delivered indi- 
vidual, written score reports to each subject. 

Preexperimental acquaintance among subjects in a 
session was minimized; each subject in a session 
came from a different course section and lived in 
a different dormitory. Anticipated postexperimental 
interaction with the others was minimized by telling 
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the subjects that their posttest interviews would be 
conducted simultaneously in’ different rooms and 
would last different lengths of time. 


Questionnaire Measures 


Three questions concerned communication among 
subjects. The subjects indicated whether they knew 
anyone else’s social sensitivity test scores and 
whether they thought anyone else in the session 
knew their scores. They indicated their degree of 
preexperimental acquaintance with the others in the 
session, 

Two questions concerned the effectiveness of the 
experimental manipulations, To check the public- 
private manipulation the subjects were asked: 


What do you think the chances are of the other 
students in your psychology class finding out the 
scores you made on the personality tests? 


The subjects responded on a 41-point, a priori scale 
ranging from “Zero” (1.0) to “Certain” (5.0). To 
check the expectancy manipulation the subjects were 
asked: 


Suppose you were shown ten new cards from the 
social sensitivity test, cards you had never seen 
before. How many do you think you would get 
correct? Do not check the number you would 
like to get correct or the number you would try 
to get correct, rather, check the number you think 
you would probably get correct. 


The subjects checked 1 of 11 responses which ranged 
from “0” to “10.” 

Two other questions were asked. The subjects 
evaluated the adequacy of the social sensitivity test 
as a measuring instrument on a 61-point, a priori 
scale ranging from “Very Good Test” (1.0) to “Very 
Poor Test” (7.0). On a 51-point, a priori scale 
which ranged from “Not at All” (1.0) to “En- 
tirely” (6.0), the subjects rated the importance of 
the test results to the experimenter by answering 
the question: 


TABLE 1 
ANALYSIS OF VARIANCE OF NUMBER OF RESPONSES 
CHANGED 
Source df MS F 
ni 3 8.45 8.89 
lic pa private (A) 1 2.53 2.66 

EAEN (a) 3 .95 
Expectancy (B) 1 3.78 1.65 
Performance (C) 1 75.03 26.23 
AXB 1 03 | <1 
BXC 1 16.53 5.78* 
AXC 1 78 | <1 
AXBXC 1 03 | <1 
Error (b) 18 2.86 

* 

5 Sioi. 
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_ aged 3.12. The high expectancy-high ] 


How much do you think the success of 
search project depends on the test results 


RESULTS 


Effectiveness of Experimental Manipul 
and Procedure 


Presession and within-session co) 
tion among subjects was slight and 
geneous among the experimental cond 
Ten of the 32 subjects indicated a slig 
gree of preexperimental acquaintan 
one other subject in their session. Six í 
acquainted subjects were in the priv: 
dition, four in the public. Seven of # 
quainted subjects were in the high-disso 
conditions (high expectancy-low perfor 
and low expectancy-high performance 
in low-dissonance conditions, Fishel 
tests did not reveal a significant d 
within either pair of proportions, Two 
jects, both in the public condition, } 
that they knew one of the other subjec 
cial sensitivity test scores. One of thi 
in the high expectancy- -low performance ( 
dition, the other was in the high expec 
high performance condition. None of th 
jects felt that the others in their sessi 
their test scores, 

The public-private manipulation 
cessful. The public subjects felt it was 
likely that their classmates would learn’ 
scores than did the private subjects. The 
was 2.76 for the public subjects on the: 
pation of an audience question, and 1. 
the private. On this question a res 
1.00 indicated a belief that the chan 
zero that one’s psychology classmat 
learn one’s test scores. The mean differi 
significant, F = 25.10, df = 1/3, p<‘ 

The expectancy manipulation was sS 
ful also, The high-expectancy subject 
pected to get an average of 7.00 correc 
new set of 10 social sensitivity items 4 
low-expectancy subjects expected an a 
of 3.06. This difference is significant, 
42.79, df = 1/18, p < .001. The Perform 
and Expectancy X Performance effet 
not significant. The low expectancy- 
formance subjects averaged 3.00 and ti 
expectancy-high performance subjects 
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ance subjects averaged 7.00 and the high 
expectancy-low performance subjects also 
averagetl 7.00. 


Main Dependent Variable 


The main dependent variable was the num- 
ber of previous responses the subjects changed 
in the retake of Section 5. Table 1 presents 
the results of an analysis of variance of these 
data. The hypothesis of the present study— 
that the Expectancy X Performance inter- 
action would be contingent upon the public- 


_ private variable—was not confirmed; the 


three-factor interaction in Table 1 is not sig- 
ijificant. The Expectancy X Performance in- 
teraction is significant, F = 5.78, df = 1/18, 
p< .05. Table 2 presents the mean number 
of changes in the eight experimental condi- 
tions. Table 2 shows that the Expectancy X 
Performance interaction was of the form that 
subjects whose fifth section performance dis- 
confirmed their expectancy made more changes 
than subjects whose fifth section performance 
confirmed their expectancy. The high expec- 
tancyelow performance subjects averaged 5.62 
changes and the low expectancy-high per- 
formance subjects averaged 3.25. The high 
expectancy-high performance subjects aver- 
aged 1.12 changes and the low expectancy- 
low -performance subjects averaged 4.88. 
Aronson and Carlsmith obtained a similar 
Expectancy Xx Performance interaction, 
Differences between the means of the four 
expectancy and performance conditions were 
assessed by the Newman-Keuls studentized 
range testing procedure (Winer, 1962, p. 80). 
The high expectancy-high performance condi- 


_ tion produced significantly (p < .05) fewer 


changes than any of the other three conditions 
and they did not differ significantly among 
themselves, 

; There is also a significant performance ef- 
ect, F = 26.23, df = 1/18, p < .001. Low- 


Performance subjects averaged 5.25 changes 


se high-performance subjects averaged 2.19. 
ene Aronson and Carlsmith and Ward and 
andvold found a similar performance effect. 


Postexperimental Ratings 


pe experimental conditions did not sig- 
icantly affect the subjects’ ratings of the 
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TABLE 2 


MEAN NUMBER or RESPONSES CHANGED AS A FUNCTION 
or AUDIENCE ANTICIPATION, EXPECTANCY, AND 


PERFORMANCE 
Private | Public| Totals * 
Score obtained | „Score 
auat OMA A @) 

Low Low 5.00 | 4.75 |4.88, 
Low High | 5.75 | 5.50 |5.62. 

5.25 
High Low 3.75 | 2.75 |3.25. 
High High | 1.50 75 (1.12) 546 
N (cases per cell) 4 4 8 16 


Note.—Entries in Column 3 with different subscripts are 
significantly different from one another at the .05 level, 


importance of the test scores for the success 
of the research project. ; 

On the whole, the subjects’ evaluations of 
the adequacy of the social sensitivity test were 
unaffected by the experimental conditions. 
The only significant effect is the Expectancy 
x Performance interaction, F = 6.65, df= 
1/18, p < .025. The interaction was of the 
form that high-dissonance subjects evaluated 
the test less favorably than did low-disso- 
nance subjects. The high expectancy-low per- 
formance subjects averaged 4.49 in their 
evaluations of the test (a high score indicates 
an unfavorable evaluation) and the low ex- 
pectancy-high performance subjects, 5.14. The 
low expectancy-low performance subjects had 
an average evaluation of 4.11 and the high 
expectancy-high performance subjects, 3.45. 


Discussion 
Dissonance Reduction 


The hypothesis that persons reject a per- 
formance which is not consistent with their 
expectations has two sources of support in the 
present study. First, the subjects whose fifth 
section performance was dissonant with their 
expected performance changed more responses 
than did subjects whose fifth section perform- 
ance confirmed their expectancy. Second, 
performance on the fifth section did not influ- 
ence the subjects’ estimates of the score they 
would achieve on a new section of 10 cards. 
Expectancy alone influenced this variable; 
high-expectancy subjects had higher estimates 
than did low-expectancy subjects. A discrep- 
ant fifth section performance was not incor- 
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porated into the subjects’ estimates of future 
performance. These findings gain extra weight 
because the present study did not confound 
the experimental variables with subject-to- 
subject procedural variation. 

The present study’s dissonance-reduction 
effects are not as pronounced as those obtained 
by Aronson and Carlsmith. In addition to a 
significant Expectancy X Performance effect, 
they found that low expectancy-high per- 
formance subjects make significantly more 
changes than both the high expectancy-high 
performance subjects and the low expectancy- 
low performance subjects, In the present study 
the low expectancy-high performance subjects 
make significantly more changes than the high 
expectancy-high performance subjects but do 
not differ from the low expectancy-low per- 
formance subjects. Differences between the 
studies with regard to subject population or 
the social stimulus characteristics of the ex- 
perimeter may underlie the difference in mag- 
nitude of dissonance-reduction effects, It is 
also possible that Aronson and Carlsmith’s 
findings were inflated by uncontrolled sub- 
ject-to-subject procedural variation. 


Doing the Experimenter a Favor 


Aronson and Carlsmith suggested that a 
desire to do the experimenter a favor by per- 
forming consistently on his test, rather than 
a need to reduce dissonance, might have 
prompted the high-dissonance subjects to 
change their responses, This interpretation 
lacks support in the present study. If 
the apparently dissonance-reducing response 
changes in the present study were actually 
motivated by a desire to help the experimenter 
in his research, one would expect that high- 
dissonance subjects who changed many re- 
sponses would also report that they felt the 
test scores were vital to the research. Actually, 
the reverse occurred. For high-dissonance 
subjects the correlation between number of 
changes and ratings of how much the success 
of the research project depended upon the 
test scores is negative, r = —.48, and signifi- 
cant, ¢ = 2.303, df = 14, p < .05. The more 
responses they changed, the less they felt 
the project’s success depended upon the test 
scores. Among low-dissonance subjects num- 
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ber of changes is not significantly rel 
the ratings, r = .04. Although the cor 
do not differ significantly it almost 
as if the high-dissonance subjects f 
would be inconveniencing the experimente 
making changes, but changed anyway. 
more they changed, the more guilty 
and the more they minimized the score 
portance to the experimenter. 5 
The “do the experimenter a favor” in 
pretation assumes the social sensitivity | 
is an avenue for the subjects to commi 
their good will (or lack of it) tow. 
experimenter. If by changing their res 
the high-dissonance subjects had been 
the experimenter a favor, one would ni 
pect them to subsequently attack the 
menter, Yet attack him they did by de 
ing the test. The high-dissonance sul 
evaluated the test significantly more uni 
ably than the low-dissonance subje 
would seem that the subjects felt that ag 
test must be internally consistent, The 
dissonance subjects found the test | 
internal consistency, and good will tow: 
experimenter did not inhibit their unfavori 
evaluations of the test. 


Anticipation of an Audience 


In the present study the anticipation ol 
audience did not overcome dissonance- 
tion effects. Perhaps the public subjects 
not as certain that their classmates 
learn their scores as Ward and Sand 
were. The public subjects indicated 
moderate certainty of a future audien 
their postexperimental ratings. 

Another possibility is that the p 


different from the performances pre 
studies demonstrated were susceptible to 
ence effects, Travis (1925) studied 
pursuit performance, Bergum and 
(1963), vigilance performance. These 
haviors satisfy the commonly accepted 
nition of performance, the emission of 
viously learned responses, but the task 


study would seem to be affected primarily by 
cognitive processes of self-perception and self- 
consist€ncy; these processes are not usually 
employed in explaining variations in rotary 
+ pursuit or vigilance performance. Perhaps if 
a less cognitively determined task had been 
used in the present study, the hypothesized 
audience effects would have been observed, 
Also, further research must determine whether 
the dissonance-reduction effects of the present 
study are limited to judgment tasks. 
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Under the guise of learning an artificial language, 54 male Ss differentiated for T, 
perceived severity of childhood socialization, were led to expect either punish- > 
ment (electric shock) for failures, reward (cash payment) for successes, or 
information (“right” or “wrong”) for both successes and failures. Ss who 
ascribed more authoritarian child-rearing attitudes to their parents showed 
more perceived similarity to the instructor under the punishment condition, 
while the more democratically socialized Ss reported greater perceived similarity 
to the instructor under the reward and information conditions. Results were 
interpreted as indicating that the severity of childhood socialization influences 


patterns of adult identification. 


Since the time of Freud’s (1938) initial 
writing on the topic of identification, the 
concept has been regarded as a central vari- 
able in personality development (Bandura 
& Walters, 1963; Bronfenbrenner, 1960; 
Kagan, 1958). Two major processes of iden- 
tification have been described: anaclitic 
identification, or the imitation of a highly 
valued and positively regarded model; and 
identification with the aggressor, or the de- 
fensive imitation of a threatening model. 
These two processes have been related to 
qualitative differences in the relationship be- 
tween subject and model by a number of 
authors in subsequent writings (e.g., Bron- 
fenbrenner, 1960; Carlson, 1963; Freud, 
1960; Mowrer, 1950; Sanford, 1955; Sarnoff, 
1962; and others). A considerable amount of 
research has been reported to corroborate the 
anaclitic process (e.g., Bandura & Huston, 


1This study was supported in part by a small 
grant from the University of Kentucky Research 
Fund. The authors would like to express their ap- 
preciation to the University of Kentucky Comput- 
ing Center for assisting with the data analysis, The 
views expressed are those of the authors and do not 
necessarily reflect an official Position of the United 
States Public Health Service. 

2At the time of this study, at the University of 
Kentucky. 


1961; Helper, 1955; Mussen & Distler, 1959, 
1960; Payne & Mussen, 1956; Sears, 1953; 
Sears, Maccoby, & Levin, 1957). The process 
of identification with the aggressor, on the 
other hand, has stimulated less research, 
Anecdotal accounts of the imitation of threat- 
ening models have been described (Balint, 
1945; Freud, 1954), and an extensive natu- 
talistic report of threat-based behavior in 
a German concentration camp has appeared 
(Bettelheim, 1943). Two studies have also 
been reported which describe stable indi- 
vidual differences in patterns of identification 
in response to threat (Carlson, 1963; Sarnoff, 
1951). However, no experimental studies of 
the process have been reported. 

Sarnoff (1951) has argued that three basic 
conditions are necessary for the occurrence of 


identification with the aggressor: an aggres- . 


sor who is determined to impose his hostility 
upon another individual; a victim who is 
socially dependent upon the aggressor and 
who thus makes a convenient target for the 
aggressor’s hostility; and a system of social 
constraints such that the victim cannot com- 
pletely escape the influence of the aggressor. 
Anaclitic identification, on the other hand, 
is based on the rewarding qualities of the 
model, and thus would be expected to occul 
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in a situation in which the individual is 

socially dependent upon a model who is 

supportive and nonthreatening toward him, 

and who makes these behaviors conditional 
* upon the individual’s behavior (Bandura & 
Huston, 1961). 

It has been proposed that individual dif- 
ferences in childhood experiences affect the 
process of identification and influence the 
type of imitation shown by a person. In par- 
ticular, Balint (1945) has suggested that 


take a punitive and restrictive attitude 
toward pleasure-seeking activities show a de- 
creased tolerance for frustration and anxiety. 
As a consequence, the children show an 
increased tendency to identify with available 
models, especially those posing a threat to 
their self-esteem. 

Sarnoff (1951) has provided data which 
tend to support one aspect of Balint’s posi- 
tion by showing that people who identify 
with an aggressor more strongly (e.g., anti- 
Semitic attitudes in Jews), do in fact ascribe 
more rejecting, frustrating, and hostile atti- 
tudes to their parents than do people who 
resist such identification. The latter report 
more accepting and love-oriented parental 
attitudes. 

The present study was aimed at investi- 
gating identification in an experimental set- 
ting designed to encourage either an ana- 
ditic (support-based) or defensive (threat- 
based) relationship between young adults and 
a model. In addition to the experimental 
manipulation, subjects were also differenti- 
ated for their reports of the parents’ child- 
rearing attitudes: rejecting and hostile versus 
accepting and supportive. Under these condi- 
tions both anaclitic and defensive identifica- 
tion were expected to occur. Individual dif- 
ferences in degrees of identification as well 
as type of identification were also predicted. 


3 METHOD 
Subjects 


Subjects were drawn from male introductory 
percuelogy students at the University of Kentucky. 
es were unpaid volunteers and received 
ae credit for their participation. 
a jects who reported having been 
a act parental home were eligible. Also, since other 
studies involving similar experimental procedures 


„children reared in homes in which the parents: 


IDENTIFICATION WITH REINFORCING MODEL 
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were being conducted during the same semester, only 
subjects who had not participated in similar studies 
were used. A total of 69 subjects participated, al- 
though 15 were later excluded from the sample due 
to failure to cooperate with the experimental instruc- 
tions or failure to accept the experimental ruse 
(described below).8 The remaining 54 subjects 
represent the final sample. 


Apparatus 


The room in which the experiment was conducted 
contained two chairs and a table, on which a micro- 
phone and one set of earphones were placed in full 
view. Wires from the communication equipment 
Jed to a terminal box in the wall. The earphones 
were connected to a tape recorder in the adjoining 
room. Depending on the’ experimental condition to 
which the subject had been assigned, there was 
also present in the room additional experimental 
apparatus. For one arrangement of the room, an 
imposing looking “shock” apparatus was placed 
next to the subject’s chair, on an electrical apparatus 
rack, standing about 3 feet high. The apparatus 
contained a large stimulator unit with the words 
“shock stimulator” painted on it, a row of six patch 
panels above the stimulator to which a network 
of variously colored wires were attached, and a 
small unit of dials below the stimulator. Colored 
lights located on the stimulator and the dial unit 
were turned on at all times. Electrodes leading from 
the apparatus were resting on the table in front of 
the subject. Wires leading from a different outlet 
on the apparatus went to the adjoining room under 
a connecting door. A second arrangement of the 
room involved the chairs, table, and communication 
equipment, but without the shock apparatus. In this 
condition three rolls of quarters wrapped in standard 
orange-colored bank rolls with $10 and the word 
Quarters printed on the side in large characters were 
displayed on the table. One roll was “broken open,” 
with 10 quarters piled next to the filled rolls. A 
pad of receipts with the words University of Ken- 
tucky printed at the top was placed next to the 
quarters. The third arrangement of the room 
included only the chairs, table, and communication 


equipment. 


Procedure 


Subjects were contacted during the laboratory 
period of their class and asked to volunteer for an 
experiment in Jearning. When they appeared at their 
appointed time, they were accompanied to the 
experimental room by a male undergraduate experi- 
menter. All subjects were run individually by the 
same experimenter, who was carefully supervised in 


8 The identification scores (described below) of 
all 15 subjects omitted from the sample were com- 
pared with the scores of those included in the study. 
In 11 out of 15 cases the scores fell within 1 stand- 
ard deviation of their corresponding cell means, and 
in all 15 cases the scores were within 1.31 standard 
score units of their corresponding cell means, 
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the conduct of the study, but who was naive with 
respect to experimental hypotheses. 

Upon entering the room, the experimenter seated 
the subject and told him that before beginning the 
main experiment he would be asked to fill out a 
brief questionnaire for another purpose and at the 
request of another experimenter. He was told that 
since it would take only a short time, it could be 
“gotten out of the way” before the main experi- 
ment. The experimenter read the instructions with 
the subject, answered any questions raised, and then 
permitted him to complete the ratings on the form 
while the experimenter busied himself with other 
papers. The questionnaire items were actually an 
abbreviation of the Traditional Family Ideology 
(TFI) Scale (Levinson & Huffman, 1955), which 
was obtained for the purpose of later differentiating 
subjects into levels of parental authoritarianism 
and restrictiveness toward child socialization. Fifteen 
items were chosen from the full scale which rep- 
resented parental attitudes of authoritarian sub- 
mission, conventionalism, exaggerated reliance on 
masculinity and femininity, moralistic rejection of 
impulse life, and extreme emphasis on discipline. 
Items were selected (or in several cases rephrased) 
to apply to both parents and to balance the direction 
of responding. A 6-point scale, ranging from very 
strong agreement to very strong disagreement, was 
provided for each item. Subjects were instructed to 
recall the period when they were growing up and 
answer the items the way they thought their parents 
would have responded to them. 

Following completion of the TFI, the question- 
naire was put aside and the subject was told that 
the main experiment was beginning. The experi- 
menter read to the subject the instructions to the 
experiment in a standard and conversational manner, 
The experiment was presented as an investigation 
of the effectiveness of a particular method of learn- 
ing an artificial language, Esperanto, He was told 
that he would receive a number of basic principles 
about the language through the earphones, and then, 
after he had become familiar with the principles, 
he would be given a list of 35 words in Esperanto 
one at a time and at a constant rate, to which he 
was to respond with the English translation. 

Subjects were assigned on a random basis to 
one of three experimental treatments. A punish- 
ment group (run in the presence of the shock 
apparatus) was told that the method of learning 
being studied was the use of punishment as a learn- 
ing technique. The subject was told that the elec- 
trodes would be attached to his left hand just 
before starting, and that each time he made an 
etror in translation or failed to report the correct 
translation within 5 seconds, the instructor in the 
other room would push a button and he would 
receive a shock through the electrodes. He was told 
that the shock would be quite strong, and would 
probably hurt a little. He was also told that most 
people tend to get 15-20 shocks under these condi- 
tions, and that some get more or less, depending on 
how quickly ‘they learn. Finally, he was told that 
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he would be able to ask any questions he had later, 
but that there was another part to the study that 
he was to be told about first. y 

The reward group (run in the presence of the 
quarters and receipt pad) was given comparable 
instructions which stressed the expectation that the, 
instructor in the other room would press a button 
after each correct translation within the time period, 
and the subject would be given a quarter., The 
experimenter demonstrated how quarters would be 
moved to the subject’s reward pile on his side of 
the table, and the experimenter said that the 
quarters would be his winnings, and would be his 
to keep. The subject was also told that a receipt 
for, his winnings would be requested, and he was? 


led to believe that most people tend to get 15-20 : 


quarters under these conditions, with some getting 
more or less, depending on how quickly they learn, 
The control group (run in the absence of support- 
ing equipment) was given comparable instructions 
which stressed the expectation that the instructor 
in the other room would say “right” after each 
correct translation within the time period, and 
would say “wrong” if the translation were incorrect 
or too slow. Subjects were also told that most 
people tend to get about 15-20 correct translations 
under these conditions and about the same number 
incorrect, 

The experimental treatments, then, were designed 
to place the subject in a dependent relationship 
with the instructor in the adjoining room, who was 
characterized as specially trained in the material to 
be learned, and who was to be either punishing, 
rewarding, or informative while teaching the subject 
the language. 

To measure the degree of identification present, 
an additional manipulation was introduced. Follow- 
ing the instructions concerning the task and the 
nature of the relationship between the subject and 
the instructor, the experimenter indicated that an- 
other aspect of learning was also being studied. 
This aspect was “what the student knows about his 
instructor.” The subject was informed that experts 
think that knowing one’s instructor is an important 
factor in learning. He was told that this was also 
being investigated, and that he was to overhear an 
interview in the adjoining room between the pro- 
fessor conducting the study and the subject's 
instructor. He was told that both the professor and 
the instructor knew he would be listening. He was 
also told that he should try to form as clear 4 
Picture of his instructor as possible while listening. 

The interview was actually a 4-minute tape 


recording which the experimenter started in the next ' 


room by leaving the experimental room briefly, 
ostensibly to inform the professor and instructor 
that the subject was ready to begin. The content 
of the interview was quite general and dealt with 
a graduate student, had gained of the University, 
Lexington, and the Surrounding horse farms. 
After the subject had heard the interview, he was 
asked to make a series of ratings about his instructor 


impressions the instructor, who was identified a ' 


— A 


aoe 


1 a 
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from impressions he had gained. When the ratings 
of the instructor were completed, the sheet was 
withdrawn and he was asked to rate himself on 
another sheet containing the same items. Finally, the 
self-ratings were withdrawn, and he was asked to 


e tate the average male student at the University 


of Kentucky, using the same items. The ratings 
were made on 22 bipolar adjectival dimensions, each 
of which was judged on a 7-point equal-appearing 
interval scale. The dimensions used were obtained 
from Cattell’s (1950) rating scale items. 

Following completion of the three sets of ratings, 
the subject was given a booklet containing four 
questions* about the experiment which were rated 

e on either 31- or 36-point scales. The specific items 
referred to: how pleasefl he felt about participating 
in the experiment; how much he would like to be 
an instructor for this kind of task in a future 

“experiment; how many correct (incorrect) responses 
he thought he would get in translating the words; 
and how valuable he felt research of this kind is. 
Results of the ratings on the four items proved to 
be unreliable, and thus were not considered further. 

After the subject had completed all the ratings, 
he was informed that the experiment was over and 
he was interrogated briefly concerning his accept- 
ance of the experimental ruse. Only those 54 sub- 
jects accepting the ruse were included in the sample. 
Finally, subjects were asked to avoid discussing the 
Seu mene with other students and were dismissed. 


RESULTS AND DISCUSSION 


Subjects were classified into six experi- 
mental groups by dividing each of the three 
treatment groups at the median TFI score. 
Six groups of nine subjects each were ob- 
tained. Comparison of TFI scores across €x- 
perimental conditions was done in order to 
evaluate the comparability of the experi- 
mental groups with respect to the degree of 
authoritarianism ascribed to their parents. 
“The three experimental groups were found 
to be quite comparable in levels of TFI 
(Fs < 1.00). 

Product-moment correlations between the 
ratings of the instructor, the self, and the 
average male student were computed for each 
Subject. Partial correlations between self and 
instructor were then obtained, with the aver- 
age male student ratings controlled. In this 
way, characteristics attributable to male col- 
lege students in general were partialed out, 
and a more direct measure of perceived 
Similarity between the subject and the par- 
ticular instructor was obtained. This measure 
was adopted as an index of identification. 
Transformation of these correlations to 2 
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TABLE 1 
ANALYSIS OF VARIANCE OF TRANSFORMED 
IDENTIFICATION SCORES 
Source af MS F « 
Between cells 5 
Conditions (C) 2 -264 1.28 
Socialization (S)| 1 -123 <1.00 
cxs 2 1.042 5.08* 
Error 48 .205 
Total 53 
*p=.01. 


scores was done in order to allow parametric 
analyses. An analysis of the variance of the 
transformed identification scores was then 
done. Table 1 summarizes these results. 

It can be seen that both the experimental 
conditions and the individual differences in 
severity of childhood socialization considered 
separately show no reliable relation to identi- 
fication scores. The interaction of the experi- 
mental conditions and levels of severity of 
socialization did reach significance at the .01 
level, however. Mean transformed identifica- 
tion scores for the six groups are reported in 
Table 2. It can be seen from these scores that 
more identification occurred in high TFI 
subjects under the punishment condition, 
while the low TFI subjects tended to show 
more identification under both the reward 
and control (information) conditions. This 
difference in patterns of identification be- 
tween TFI groups is significant at the .01 
level by two-tailed ¢ test (¢ = 2.75, df = 48). 
Individual comparisons indicate that the dif- 
ferences are primarily attributable to the 
unusually high degree of identification shown 
by the high TFI subjects in the punishment 
condition, The high TFI punishment group 
differed from the other high TFI groups and 
from the low TFI punishment group at the 
01 level, and tended to differ from the re- 


TABLE 2 
MEAN TRANSFORMED IDENTIFICATION SCORES 


Condition 
Severity of 
socialization 
Punishment Reward Control 
High TFI 89 2t -26 
tee TFI 24 A8 A2 
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maining low TFI groups (f’s < .10). Con- 
sidered alone, the three low TFI groups 
did not differ reliably between experimental 
conditions. 

* The overall pattern of results indicates 
rather persuasively that differences in the 
type of identification obtained in the present 
situation depend on individual differences in 
perceived childhood socialization experiences. 
Subjects who report being reared in restric- 
tive, authoritarian homes show significantly 
more identification with the aggressor while 
subjects who report being reared in pérmis- 
sive, democratic homes tend to show more 
anaclitic identification, These results are con- 
sistent with previous data reported by Sarnoff 
(1951), and tend to confirm Balint’s (1945) 
hypothesis that more severely socialized chil- 
dren develop tendencies to identify defen- 
sively, The absence of an overall difference 
in identification as a function of differences 
in severity of socialization alone is at vari- 
ance with the second aspect of her hypothesis, 
however. If such differences in adjustment 
can be said to exist after the subjects have 
reached maturity, they appear to be second- 
ary to predispositions to respond selectively 
to the affective quality of the situation 
encountered. 

On the basis of the present findings a 
more tenable hypothesis appears to be that 
individuals reared in an authoritarian atmo- 
sphere, which required the development of 
tendencies toward defensive identification in 
childhood, will be more likely to identify with 
an aggressor as an adult. On the other hand, 
individuals whose patterns of identification 
developed in relation to supportive demo- 
cratically oriented parents will be more re- 
sponsive to situations which engender ana- 
clitic identification. Such an interpretation 
appears to be related to differences in con- 
formity and submission training in childhood 
socialization which have been emphasized in 
the authoritarian ideology research (Adorno, 
Frenkel-Brunswik, Levinson, & Sanford, 
1950). On the basis of their interview data, 
these authors have concluded that 


much of the submission to parental authority in 
the [extremely high scoring] prejudiced subject 
seems to be induced by impatience on the part of 
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the parents and by the child’s fear of displeasing 
them [p. 385]. 


This threat-based training, often accom- 
plished at the hands of extremely harsh and 
arbitrary parents, appears to be coextensive 
with the present concept of identification 
with the aggressor. Accordingly, the results 
of the high TFI groups in the present experi- 
ment are congruent with earlier clinical find- 
ings vis-a-vis the authoritarian personality. 

The problem of why the conditions favor- 


ing identification were not different from the . 


control condition in thé present experiment 
raises an unanswered question. This result is 


somewhat surprising, since at least the ana- 


clitic process of identification has been dem- 
onstrated previously by several investigators 
under different conditions (e.g., Bandura & 
Huston, 1961; Mussen & Distler, 1959, 
1960). One possible explanation for the 
present, somewhat anomalous result appears 
to be that the control (information) condi- 
tion was not actually as neutral as it was 
intended to be. That is, it may have been 
that the provision of information by the 
instructor established the anticipation of a 
supportive relationship, There is some evi- 
dence to support this possibility. As can be 
seen in Table 2, the subjects in the control 
condition responded quite similarly to the 
subjects in the reward condition. In any 
event, alternative control procedures appear 
warranted for further research using this 
situation. 
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MODIFICATION OF SELF-IMPOSED DELAY OF RE- 


WARD THROUGH EXPOSURE TO LIVE AND 
SYMBOLIC MODELS* 


ALBERT BANDURA anp WALTER MISCHEL 
Stanford University 


In a comparative test of the relative efficacy of live and symbolic models for 
modifying delay-of-reward behavior, groups of children with marked preferences 
for either immediate but less valued rewards, or more valuable delay rein- 
forcers, were assigned randomly to 1 of 3 experimental conditions. 1 group 
observed live models who exhibited delay behavior that was counter to the 
children’s pattern; a 2nd group was presented essentially the same “modeling 
cues except in symbolic verbal form; while a 3rd group had no exposure to any 
models. Changes in Ss’ delay-of-reward behavior were measured immediately 
following exposure to the modeling procedures, and reassessed approximately 1 
mo. later within a different stimulus situation. Both live and symbolic models 
produced substantial modifications in delay-of-reward behavior within the im- 
mediate social-influence setting, but the changes induced in high-delay chil- 


dren through exposure to symbolic models were less stable over time. 


Theory and research relating to the process 
of internalization and the development of 
self-control have been largely confined to 
resistance to deviation and the occurrence of 
self-punitive or restitutive responses following 
transgression (Aronfreed, 1964). Equally im- 
portant and perhaps even more prevalent 
behavioral manifestations of self-control are 
the manner in which persons regulate the 
self-administration of highly rewarding re- 
sources over which they have control (Ban- 
dura & Kupers, 1964; Kanfer & Marston, 
1963), and their willingness to defer immedi- 
ate rewards in favor of delayed, more highly 
valued reinforcers (e,g., Mischel & Gilligan, 
1964; Mischel & Metzner, 1962). 

In recent years there have been numerous 
investigations into behavioral manifestations 
of willingness to defer immediate gratifica- 
tion, These studies have typically employed a 
research paradigm in which subjects are con- 
fronted with real choices between immediately 
available but less valued rewards, as opposed 
to delayed but more valuable reinforcers 


1This investigation was supported by Research 
Grants M-5162 and M-6830 from the National In- 
stitutes of Health, United States Public Health 
Service, 

$ The authors are indebted to Ruthe Burg, who as- 
sisted with various phases of this project, and to 
Nicholas Anastasiow, Palo Alto Unified School Dis- 
trict, for his aid in arranging the research facilities. 


(e.g., Mahrer, 1956; Mischel, 1958, 1961c). 
Results from this procedure provide evidence 
that delay responses are relatively stable, 
tend to increase with age, and are systemati- 
cally related to other theoretically relevant 
variables usually subsumed under “ego- 
strength” constructs (e.g., Mischel, 1961b, 
1961c, 1965). It has likewise been demon- 
strated that delay responses can be increased 
by direct training through raising the 
probability that the delayed reward is forth- 
coming (Mahrer, 1956; Mischel & Staub, 
1965), or by decreasing the delay interval 
(Mischel & Metzner, 1962). Relevant data 
are lacking, however, on the effects of social- 
learning variables that might be expected to 
play an influential role in the establishment’ 
of delay-of-reward behavior. 

It is generally assumed that the acquisition 
and maintenance of the various forms of self- 
controlling responses are primarily achieved 
either through direct aversive stimulation 
(Aronfreed, 1964), or by means of complex 
intrapsychic mediational processes when im- 
mediate gratification is unavailable (Freud, 
1946). Directly experienced rewarding and 
punishing response consequences are undoubt- 
edly important factors in the development 
and maintenance of self-controlling responses. 
However, a number of recent studies have 
demonstrated that response inhibition (Ban- 
dura, 1965; Bandura, Ross, & Ross, 1963; 
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Walters, Leat, & Mezei, 1963) and the con- 
tingent self-regulation of reinforcers (Ban- 
dura x Kupers, 1964; Bandura & Whalen, 
1966; Marston, 1965; Mischel & Liebert, 
1966) can be readily transmitted vicariously 
without the mediation of direct reinforce- 
ment, In accord with the theory of vicarious 
learning, it is plausible to hypothesize that 
self-imposed delay of reward is likewise in- 
fluenced by the delay patterns displayed by 
social models. Indeed, some suggestive evi- 
dence for the influence of parental modeling 
in the developmenteof children’s willingness 
to delay rewards is provided in a study by 


, Mischel (1958) who found that children from 


the Trinidadian Negro subculture, in which 
immediate self-reward is the prevailing grati- 
fication pattern, displayed a greater preference 
for immediate rewards than children of Trini- 
dadian Indians, who characteristically exhibit 
self-denying delayed-gratification behavior. 

It is often mistakenly assumed that vicari- 
ous or imitative learning is essentially limited 
to younger age groups and to stimulus situa- 
tions in which real-life models exhibit, inten- 
tionally or unwittingly, the desired social- 
response patterns, While undoubtedly much 
observational learning is fostered through ex- 
posure to live models (Bandura, 1962, in 
press; Bandura & Walters, 1963), once a per- 
son has developed an adequate verbal reper- 
toire, increasing reliance is generally placed 
on symbolic models presented in the form of 
oral or written behavioral descriptions, pic- 
torial displays, or through a combination of 


„verbal and pictorial devices. The influence of 


verbally presented normative models in shap- 
ing and controlling social behavior has been 
abundantly documented in the research in 
experimental social psychology (Berg & Bass, 
1961; Hovland, Janis, & Kelley, 1953). 
There have been, however, no systematic 
comparative studies of the relative magnitude 
and stability of changes in social behavior as 
a function of exposure to real-life and sym- 
bolic modeling cues. Consequently, the pres- 
ent investigation studied the relative efficacy 
of both live and verbally presented symbolic 
models in modifying children’s delay-of-re- 
ward behavior. 

In the experiment reported in this paper 
children who exhibited predominantly either 
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delayed-reward or immediate-reward patterns 
of behavior were assigned randomly to one of 
three treatment conditions. One group of chil- 
dren observed live adult models who exhibited 
delay-of-reward responses counter to tife 
group’s self-gratification pattern; a second 
group was similarly exposed to a model dis- 
playing the opposite delay-of-reward behavior 
with the exception that the modeling cues 
were presented in written form, while a third 
group had no exposure to any models. Imme- 
diately following the experimental procedures 
the children’s delay-of-reward responses were 
measured in the absence of the model. In or- 
der to test the generality and stability of 
changes in delay behavior, the subjects were 
reassessed by a different experimenter in a 
different social setting approximately 1 month 
after completion of the experimental phase 
of the study. 

It was predicted that the modeling proce- 
dures would alter the children’s delay-of-re- 
ward behavior in the direction of their model’s 
response dispositions. Since an actual per- 
formance is apt to provide substantially more 
relevant cues with greater clarity than can be 
conveyed by a verbal description, it was also 
expected that live models would prove more 
efficacious than symbolic models in modifying 
children’s self-imposed delay tendencies. 

The maintenance of response patterns es- 
tablished through vicarious experiences is 
highly dependent upon reinforcement-related 
variables (Baer & Sherman, 1964). In view 
of the absence of any information concerning 
the naturalistic reinforcement contingencies 
that were operative during the relatively long 
period of time elapsing between the post- 
exposure and the generalization test phases of 
the experiment, no predictions were advanced 
regarding group differences based on the ter- 
minal assessment. 


METHOD 
Subjects 


The subjects were 60 boys and 60 girls selected 
from the fourth and fifth grades of three elementary 
schools in the Stanford vicinity. 


Design and Procedure 


Preexperimental assessment of delay-of-reward 
responses. In the initial phase of the experiment 
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approximately 250 children were administered in 
their classroom groups a series of 14 paired rewards, 
in each of which they were asked to select either a 
small reward that could be obtained immediately, or 
a more valued item contingent on a delay period 
tenging from 1 to 4 weeks. The group administration 
(Mischel & Gilligan, 1964) proceeded in the follow- 
ing manner: Children were provided individual 
booklets containing on each page a brief description 
of a given set of paired objects and the associated 
time interval. After the experimenter had displayed 
both rewards and explained the temporal contin- 
gency, the children were instructed to record their 
choice, and to turn the page in preparation for the 
next set of items. The subjects were also advised to 
choose carefully and realistically because in ðne of 
the choices they would actually receive the item they 
selected, either on the same day or after the pre- 
scribed delay period, depending upon their recorded 
preference. 

Half of the sets of paired rewards involved small 
amounts of money (eg, $.25 today, or $.35 in 1 
week), while the remaining items included edibles 
(eg., small bag of salted peanuts today, or a can 
of mixed nuts in 2 weeks), children’s magazines, and 
various play materials (e.g., small rubber ball today, 
or a large rubber ball in 2 weeks). 

From the total pool of subjects those falling in the 
extreme top and bottom 25% of the delay-score dis- 
tribution, computed separately for boys and girls, 
were selected for the succeeding phases of the experi- 
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ment. The low-delay group consisted of 60 chil- 
dren, 30 boys and 30 girls, who displayed a marked 
preference for immediate reward (mean percentage of 
immediate choices = 83); in contrast, the group of 
60 high-delay children exhibited a consistent pattern 


of delay behavior (mean percentage of delay choices i 


= 93). The subjects in each of these two groups 
were then randomly assigned to one of three treat- 
ment conditions, with 10 boys and 10 girls in each 
subgroup (Table 1). 

Experimental treatments. Approximately 4 weeks 
elapsed between the initial assessment of the chil- 
dren’s delay behavior and the experimental phase 
of the study. Two female experimentei's, each work- 
ing with a different male and female model, con- 
ducted the experimental treatments. 

In order to help create the set that the two 
phases of the experiment were unrelated, the teachers 
announced to their classes that a new group of ex- 
perimenters from Stanford University would be 
conducting a similar, but independent, project con- 
cerned with object preferences of different groups 
of both children and adults, 

The subjects were brought individually from their 
classrooms to the experimental room where the ex- 
perimenter explained that the adults had been 
invited to appear at the school, since it was difficult 
on weekdays to see them in their various places 
of employment. To insure that the model was en- 
dowed with adequate prestige, the adults were 
described as recent college graduates. All children 


TABLE 1 
SUMMARY OF THE EXPERIMENTAL DESIGN 
Phase 1 Phase 3 si 
Erperimental Ree Heenan Posttreatment Test tor the generality 
mei re ili 
delay behavior ioe delay behavior E uehayioe 
High delay 
I rae, 
(N = 20) Administration of 14 Live model exhibits an | Administration of 14 Administration of 14 
paired rewards immediate-reward paired rewards paired rewards 
fi (Set A) pattern (Set B) (Set A) 
(N = 20)| Same Symbolic model of Same Same 
immediate-reward 
pattern presented 
in written form 
OL 
(N = 20)} Same No model present Same Same 
Low delay 
nM 20) | S: 
= ame Live model exhibitsa | Sa: e 
delayed-reward ii ne 
v pattern 
(N = 20) | Same Symboli 
ymbolic model of Sam 
delayed-reward 3 pee 
pattern presented 
a in written form 
(N = 20)| $ 
) | Same No model present Same Same 
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who were assigned to the experimental treatments 
observed same-sex models. 

For children in the live-model condition, the adult 
entered shortly after the child was seated and in- 
troduced himself to the experimenter and to the sub- 
ject. In order to further increase credibility, the 
experimenter described the choice procedure to the 
model in considerable detail as though he were a 
naive subject. The instructions stated the objects 
would be presented in pairs and in each case the 
subjects were to select either a less valued item that 
could be obtained immediately, or a more valued 
object conditional on a specified delay period. The 
participamts were also informed that they would in 
fact receive 1 of their 14 choices. * 

On the pretext of the model's “busy schedule,” 
the experimenter first administered the items to the 
model while the child waited for his turn. The 


* model’s paired rewards included such adult-appropri- 


ate items as chess sets, paperback books, hi-fi maga- 
zines, gourmet candy bars, jars of instant coffee, 
and monetary choices. Although the items differed 
from those subsequently administered to the chil- 
dren, the delay intervals necessary for attaining the 
more valuable rewards were similar in both sets of 
items. During the modeling phase the experimenter 
displayed each pair of items to the model who then 
indicated his choices verbally. 

With high-delay children, the model consistently 
selected the immediately available rewards and in 
sevefal instances commented briefly, according to a 
prearranged script, on the benefits of immediate 
self-reward (eg., “Chess figures are chess figures. I 
can get much use out of the plastic ones right 
away.”) In addition, after the fourth item, the 
model casually summarized his immediate-gratifica- 
tion philosophy of life as follows: “You probably 
have noticed that I am a person who likes things 
now. One can spend so much time in life waiting 
that one never gets around to really living. I find 
that it is better to make the most of each moment 
or life will pass you by.” While the model periodi- 
cally extolled the virtues of immediate self-gratifi- 
cation, he carefully refrained from deprecating de- 


“lay behavior; otherwise, it would be impossible to 


determine whether any changes in the children’s 
behavior were a function of positive modeling of 
immediacy, or the modeling of negative attiudes 
toward postponement of gratification. 

With low-delay children the procedure was identi- 
cal to ‘that described above except the model con- 
sistently selected the more valued delayed rewards. 
The model likewise commented periodically on the 
Virtues of self-imposd delay (e.g, “The wooden 
chess figures are of much better quality, more at- 
tractive and will last longer. PI wait two weeks 
for the better ones.”), and expounded his postpone- 
ment-of-gratification philosophy of life in the fol- 
lowing manner: “You have probably noticed that 
Tam a person who is willing to forego having fewer 
or less valuable things now, for the sake of more 
and bigger benefits later. I usually find that life is 


more gratifying when I take that carefully into 
account.” 

In both of the above experimental treatments, 
the models departed immediately upon completion 
of the choice task so as to remove situational pres- 
sures on the children to adopt the models’ self- 
rewarding dispositions. 

In the symbolic-model condition, the experimenter 
explained to the child that the adult who was 
scheduled for the same time had to leave early and 
consequently, he had already made his selections. 
In addition, the subject was told that children and 
adults are typically seen simultaneously to expedite 
matters; therefore, in order to keep the conditions 
as similar as possible for all participants, he would 
first Be shown the paired items that were admin- 
istered to the adult together with his recorded pref- 
erences and comments. The child was then handed 
the answer booklet in which were written both the 
model’s choices and accompanying philosophy-of- 
life commentaries. The experimenter then exhibited 
each choice pair sequentially while the subject read 
the corresponding verbal accounts of the model’s 
behavior. 

In the no-model-present condition the children 
were informed that, because of scheduling difficul- 
ties, no adult would be present, but in order to 
insure intersubject comparability the items between 
which adults ordinarily make choices would be 
shown, The experimenter then simply displayed the 
series of paired objects. This procedure was adopted 
in order to control for any effects that mere ex- 
posure to a set of reinforcers might have on 
children’s subsequent delay behavior. 

Postexposure assessment of delay behavior. Im- 
mediately following the above procedures each child 
was individually administered his own set of 14 
paired items. The rewarding objects in the latter 
series differed from those employed in the initial 
phase of the experiment, but the money items were 
the same since pretesting revealed that subjects were 
unable to recall the exact amounts and temporal 
intervals involved. In order to maintain realistic 
choice behavior on all items, the specific payoff 
objects were varied randomly among the children, 
They were thus unable to predict which one of 
their choices they were likely to receive even if some 
prior communication between subjects had occurred. 

Test for generalization and stalibility of altered 
delay patterns. Between 4-5 weeks after the experi- 
mental treatments all children were readministered 
the initial set of 14 items within their classroom 
settings utilizing the same group procedure described 
earlier. The same experimenter who had conducted 
the preexperimental measurement of delay responses 
presided over the follow-up assessment. Considering 
the relatively long temporal intervals separating the 
various phases of the study, the variation in experi- 
menters, and the fact that in the terminal assessment 
children recorded their preferences privately in the 
context of their natural classroom situation, it was 
assumed that the follow-up data would provide a 
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Fic. 1. Mean percentage of immediate-reward 
responses by high-delay children on each of three 
test periods for each of three experimental condi- 
tions. 


particularly stringent test of both the generality and 
the stability of modeling effects. 

Dependent measures, Changes in the subjects’ 
behavior were measured in terms of the relative 
number of delayed responses produced by the low- 
delay groups and conversely, the number of im- 
mediate responses displayed by children in the high- 
delay groups at each of the three phases of the 
experiment, 


RESULTS 


Figure 1 shows the mean percentage of 
immediate-reward responses produced by the 
high-delay children on each of the test peri- 
ods as a function of treatment conditions. 


TABLE 2 


ANALYSIS OF VARIANCE oF ĪMMEDIATE-REWARD Re- 
SPONSES BY CHILDREN Wao Hap EXHIBITED 
A DELAYED-REWARD ORIENTATION 


ALBERT BANDURA AND WALTER MISCHEL 


Analysis of variance of these data (Table 
2) reveals that the main effects of modeling 
and experimental phases are highly signifi- 
cant sources of variance, In addition, the two 
modeling procedures proved to be differen- 


tially effective at the immediate postexposure ` 


and the later generalization phases of the 
experiment. 

Further comparisons by the ¢ test of pairs 
of means across experimental phases (Table 
3) show that high-delay children in all three 
conditions not only altered significantly their 


delay-of-reward behavior,in favor of immedi- ` 


ate gratification, but also maintained the 


response changes long after the experimental i 


interventions. 

Moreover, comparisons between groups at 
each of the test phases (Table 4) reveal that 
children who had been presented either live or 
symbolic models differed substantially in 
their postexposure delay behavior from the no- 
model controls, with the live-model condition 
yielding the greatest differences. It should 
also be noted that while the live and symbolic 
models were equally effective within the ,im- 
mediate social-influence setting, the changes 
in delay responses induced by the live model 
were significantly more stable over time. This 
is shown in the finding that high-delay chil- 
dren who had observed the live model con- 
tinued to display in the generalization test 
a significantly higher level of immediate- 
reward responses than children in either the 
symbolic model or the control groups, which 
did not differ significantly from each other. 


Source df _ MS P 

Model (M) 2 | 134.12 5.37% 
Sex (S) 1 0.80 | <1 
MXS 2 30.42 1.22 
Error (b) 54 24.99 
Experimental phases (P) 2 | 550.59 | 59,39 
PXS 2 615 | <1 
PXM 4 43.60 4,70* 
PXMxXS 4 417 | <1 
Error (w) 108 9.27 

*p <.02, 


* b <.01. 
wD <.001 


TABLE 3 
Comparison OF PAIRS of MEANS ACROSS 
EXPERIMENTAL PHASES 
Post- 
Pretest | Pretest | exposure 
versus e e 
Treatment iti post- generali- vel 
isc pacitions exposure | zation | generali- 
test test zation 
t t test 
t 
Immediate-reward treatment 
Live model 7.13% | 7.8770 0.64 
Symbolic model 476e | 4.07% | 0.69 
No model control 2.77 3.37** 0.60 
Delayed-reward treatment 
Live model 6.968% | 4.32%¢* | 3.09% 
Symbolic model 5.03% | 3.41% 1,34 
No model control 4.82% | 1.99 1,93 
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TABLE 4 


COMPARISON OF PAIRS OF MEANS BETWEEN TREATMENT 
Gonprtions For HicH-DELAy CHILDREN 
EXPOSED TO IMMEDIATE-REWARD 
MopELs 


Live model | Live model | Symbolic 
model 


versus 
Experimental phases rabols versua Serena 
model control 
Postexposure test | <1 3.41 2.46" 
Generalization test | 2.52% | 4,224 1.70 


oe} Shi, 


The corresponding set of data for low- 
delay children exposed to models exhibiting 
a preference for more valued delayed rein- 
forcers is presented graphically in Figure 2. 

Analysis of variance of these scores (Table 
5) reveals that the children’s willingness to 
delay gratification increased substantially 
across the phases of the experiment. 

Although the overall differences between 
the three experimental groups were not of 
statistically significant magnitude, it is evi- 
dent from supplementary analyses (Table 3) 
that *the two conditions employing modeling 
procedures were chief contributors to en- 
during increases in delay behavior. Within- 
treatment comparisons disclose that both 
forms of modeling produced highly significant 
temporary and long-term increases in self- 
imposed delay of reward. On the other hand, 
although the no-model control subjects ex- 
hibited a temporary change, their subse- 
quently assessed delay behavior did not differ 
significantly from their preexperimental level. 
Unlike the findings based on the high-delay 
children, however, the low-delay subjects 
were not differentially affected by the live 
as compared to the symbolic model. Another 
noteworthy difference between the two sets 
of data is the finding that experimentally 
induced immediate-reward responses gener- 
ally remained more stable over time (see 
Figure 2 and Table 3) than did self-imposed 
delay behavior, 


Discussion 
The results of the present investigation 
Provide further support for the influential 
role of modeling variables in the social trans- 
mission of self-controlling responses. Children 
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MEAN PERCENT OF DELAYED-REWARD RESPONSES 
. 


POST-EXPOSURE GENERALIZATION 


PHASES OF THE EXPERIMENT 
Fic. 2. Mean percentage of delayed-reward re- 


sponses by low-delay children on each of three test 
periods for each of three experimental conditions. 


2 PRETEST 


who had shown a predominantly delayed- 
reward pattern displayed an increased pref- 
erence for immediate and less valued rewards 
as a function of observing models favoring 
immediate gratification; conversely, subjects 
who had exhibited a marked disposition to- 
ward immediate rewards displayed an endur- 
ing increased willingness to wait for more 
highly valued delayed reinforcers following 
exposure to models displaying high-delay 
behavior. 

Although in all subgroup comparisons the 
live model produced changes of greater mag- 
nitude than those induced by the verbally 
presented symbolic model, both forms of 


TABLE 5 


ANALYSIS OF VARIANCE OF DELAYED-REWARD RE- 
SPONSES BY CHILDREN WHO Hap EXHIBITED 
AN ĪMMEDIATE-REWARD 


ORIENTATION 
Source of MS F 
Model (M 2 25.34 1.61 
Sex SN : 1 34.68 2,20 
MXS 2 12.67 | <1 
Error (b) 54 15.75 
imental phases (P) 2 | 305.58 | 46.30% 

PRSNE ae 2 | 17.44 | 2.64 
PXM 4 9.80 1,48 
PXMXS 4 3.89 | <1 
Error (w) 108 6.60 


+p <.001. 
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modeling procedures were essentially equally 
efficacious within the immediate social-influ- 
ence setting. With high-delay children, how- 
ever, the symbolic model yielded relatively 
weaker long-term effects. There are several 
possible explanations for the fact that simi- 
lar long-term differences between modeling 
conditions were not obtained based on find- 
ings from low-delay subjects. This dis- 
crepancy may partly reflect the effects of 
differential reinforcement contingencies char- 
acteristically associated with high- and low- 
delay behavior. In social training in our 
culture self-imposed delay-of-reward behavior 
is actively modeled, encouraged, and gener- 
ously rewarded by socialization agents, 
whereas immediate self-gratification is nega- 
tively reinforced on many occasions. To the 
extent that adoption by high-delay children 
of immediate self-gratification responses re- 
quires some reduction of previously estab- 
lished inhibitions, it would be expected that 
actual observation of a moderately prestigeful 
adult exhibiting low-delay behavior would 
have stronger disinhibitory cue value than a 
verbal description of his responses. This 
factor, if operative, would suggest that pre- 
dictions about the relative efficacy of live 
and symbolic models should consider not only 
the number and clarity of modeling cues 
associated with different modes of presenta- 
tion, but also the possibly greater inhibi- 
tory or disinhibitory influence of perform- 
ances by live models, 

An alternative explanation, and one that 
accounts for other obtained differences, is in 
terms of the temporal variations in reinforce- 
ment invariably associated with immediate- 
and delayed-reward responses. There is con- 
siderable empirical evidence that behavior 
can be more effectively maintained by im- 
mediate reinforcement than by delayed 
rewards (Renner, 1964). Consequently, self- 
imposed delay behavior, which is accom- 
panied by less favorable reinforcement condi- 
tions, would be harder to establish and even 
more difficult to maintain in the absence of 
intervening positive reinforcements, This in- 
terpretation would help to explain both the 
greater stability of changes in immediate- 
compared to delayed-reward behavior among 
subjects in the no-model conditions, and the 
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larger decrement in delay responses among 
subjects whose behavior was being altered in 
the direction of delayed gratification. ‘It will 
be recalled that, of the three groups in the 


latter condition, only the live-model-treat- | 


ment yielded a.statistically significant decre- 
ment in terminal delay behavior. Although 
subjects in the latter group continued to 
exhibit a higher level of delay behavior than 
they displayed in the preexperimental phase, 
nevertheless difficulty in maintaining the 
newly acquired behavior in the prolonged 
absence of the model, reduced the possibility 
of obtaining significant differences between 
modeling conditions, These overall findings 
point to the necessity for supporting newly 
established self-control behavior, particularly 
when it is associated with less optimal rein- 
forcement conditions as in the case of self- 
imposed denial of readily available rewarding 
resources, 

The results of this study also provide an 
interesting contrast between traditional psy- 
choanalytically based theories of personality 
and those derived from principles of social 
learning. According to the psychoanalytic 
theory of delay behavior (Freud, 1946; 
Singer, 1955), aroused impulses press for im- 
mediate discharge of tension through overt 
motoric activity. As a function of repeated 
association of tension reduction with goal ob- 
jects, and development of greater ego organi- 
zation, absence or imposed delay of satisfying 
objects results in the substitution of hal- 
lucinatory satisfactions, and other thought 
Processes that convert free cathexes into 
“bound cathexes.” The capacity to delay of 
inhibit motor discharge by substituting cath- 
ected ideational representations presumably 
reflects the gradual shift from primary-process 
activity to reality-oriented secondary-process 
thinking. ji 

The psychoanalytic approach thus leads 


one to seek determinants of delay behavior 


in terms of hypothetical internal events in the 
form of ego organizations and energy-binding 
ideations. In contrast, social-learning theory, 
as illustrated in both the findings of the 


present experiment, and those to which refer- | 
ence was made earlier, views manipulable | 


social-stimulus events as the critical’ determi- 
nants of self-controlling behavior. 


i 
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THE EFFECTIVENESS OF SOCIAL REINFORCERS ON PER- 


SISTENCE AND LEARNING TASKS FOLLOWING 
POSITIVE AND NEGATIVE SOCIAL 
INTERACTIONS * 


HERSHEL BERKOWITZ,? EARL C. BUTTERFIELD, ano EDWARD ZIGLER 


Yale University 


The extent to which a valence or a social drive position can account for dif- 
ferential effectiveness of social reinforcers following different forms of prelimi- 
nary E-S social interaction was assessed. Results obtained when both the 
preliminary interaction and ctiterion task were administered by the same E 
were compared with those obtained when Es were switched at the end of the 
preliminary interaction. The effects of positive and negative social interactions 
upon a learning and a persistence measure were compared when the criterion 
task was administered immediately after the preliminary interaction and when 
a 5-day delay intervened between the preliminary interaction and the criterion 
task. Both criterion tasks were administered to control Ss who had had no 
preliminary interactions with E. In general, the results obtained with the 
persistence measure supported the valence but not the social drive position. 


The findings obtained with the learnin; 


neither position. 


A number of recent investigations (Berko- 
witz, 1964; Berkowitz & Zigler, 1965; Erick- 
son, 1962; Gewirtz & Baer, 1958a, 1958b; 
McCoy & Zigler, 1965; Shallenberger & Zig- 
ler, 1961) have assessed the influence of vari- 
ous types of experimenter-subject interaction 
upon the experimenter’s subsequent effective- 
ness as a dispenser of social reinforcement, 
The results of these studies have been in- 
terpreted within two theoretical frameworks. 
One position asserts that social interaction 
between experimenter and subject affects the 
subject’s responsiveness to social reinforcers 
by altering his level of social drive. Gewirtz 
and Baer (1958a, 1958b), Erickson (1962), 
and Berkowitz (1964) have maintained that 
deprivation of social reinforcers increases 
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Public Health Service, which also contributed addi- 
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Nancy Miles who collected the data for this study. 
They also wish to thank the members of the Mil- 
ford, Connecticut, elementary school system for 
their cooperation and assistance in the course of 
this study. 

2Now at the Medical Center of the University of 
Colorado. 


ig Measure were equivocal and supported 


social drive leading to increased responsive- 
ness to social reinforcers, Conversely, social 
interaction with a warm and friendly experi- 
menter is viewed as drive satiating leading to 
a decrease in responsiveness to social rein- 
forcers. A second position (see Berkowitz & 
Zigler, 1965; McCoy & Zigler, 1965; Shallen- 
berger & Zigler, 1961; Zigler, 1961) has 
maintained that the nature of the social inter- 
action influences the valence which the ex- 
perimenter has for the subjects, that is, the 
subject’s attitude toward the experimenter, 
and that this valence or attitude determines 
Tesponsiveness to the social reinforcers the 
experimenter dispenses. Within this frame- 
work, a positive interaction is viewed as en- 
hancing and a negative interaction as attenti- 
ating the experimenter’s effectiveness as 4 
reinforcing agent, S 

These two interpretations have evolved 
from methodologically different studies. Find- 
ings in support of the drive interpretation 
employed reaction time or learning measures, 
while those in support of the valence position 
employed a persistence measure, that is, time 


spent with experimenter. It is possible that . 


these measures are sensitive to different as 
Pects of the social reinforcement process and 
that both the drive and valence positions 
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have a certain validity. Clearest support for 
the valence position has been found in studies 
where 4 period of time, that is, 7 days, inter- 
vened between initial contact with the experi- 


» „menter and tests for reinforcer effectiveness. 


a 
bn 


When a persistence task followed initial social 
contact immediately, a contrast effect between 
reinforcers administered during the experi- 
menter-subject interaction and those admin- 
istered during the criterion task was noted, 
that is, the effectiveness of positive social 
reinforcers was enhanced when they followed 
‘negative preliminary interactions. A 
The two positions have some markedly 
different implications. According to the drive 
position, the effect of experimenter-subject 
interactions is to change a particular motive 
state. One would conclude here that situa- 
tional variations which did not affect this 
motive state would not affect performance on 
the criterion task. The drive position asserts 
that partially satiating a child in social rein- 
forcers during a preliminary interaction re- 
duces his motivation for social reinforcement 
on aesubsequent criterion task. This change 
in motive state should manifest itself in sub- 
sequent performance even if the experimenter 
who collected the criterion scores were dif- 
ferent from the experimenter who had pre- 
viously interacted with the subject. In an 
immediate test, that is, no delay between 
preliminary interaction and criterion task, 
the prediction is that drive effects would oc- 
cur even if one experimenter were employed 
during the preliminary interaction and an- 
other experimenter during the criterion task. 
No predictions can be derived from the drive 
position for conditions involving a delay, 
since drive effects produced by a brief social 
interaction would be modified by the subjects’ 
everyday social experiences during the delay 
Interval, 
The valence position views the effect of an 
_ €xperimenter-subject interaction as changing 
the subject’s evaluation of the particular ex- 
. Perimenter, The prediction here is that sub- 
stantially different reinforcer effects would 
: be found if the experimenter who collected 
the criterion score were different from the 
experimenter who initially interacted with 
* the subject. This prediction can be made only 
for the delay condition since previous re- 
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search has indicated that contrast effects ob- 
scure valence effects in the immediate condi- 
tion. 

Both learning and persistence measures 
were employed in the present study in an* 
effort to evaluate the possibility that it was 
the differences in measures which gave rise 
to earlier divergent findings. Positive and 
negative preliminary experimenter-subject 
interactions were employed and both immedi- 
ate and delayed tests of reinforcer effective- 
ness were made. The above predictions were 
tested ‘by employing two experimenters and 
including experimenter switching and non- 
switching as a dimension of the study. The 
study, then, consisted of two 2 X 2 X 2 X 2 
(Positive-Negative Experimenter-Subject In- 
teraction X Immediate-Delayed Test X E- 
E X Experimenter Switching-Nonswitching) 
factorial analyses of variance, one for sub- 
jects receiving the learning task and one for 
subjects receiving the persistence task, In 
addition, switching and nonswitching control 
groups were run in which the only prior ex- 
perimenter-subject contact was that involved 
in conducting the subject to the experimental 
room. 


METHOD 
Subjects 


Subjects, numbering 120 boys and 120 girls, were 
chosen from the second grade of six Milford, Con- 
necticut, elementary schools. No child who had failed 
any grade was employed. The subjects were as- 
signed to one of three experimenter-subject interaction 
conditions. Ninety-six of the subjects met with an 
experimenter under a condition of positive prior con- 
tact (P) and 96 under a condition of negative prior 
contact (N). Half of the subjects in each of these 
groups were tested immediately after initial contact 
(I) and half were tested after a 5-day delay (D). 
Forty-eight control subjects (C) had no prior con- 
tact with the experimenter except that which oc- 
curred in the course of being conducted to the 
experimental room. Half of the subjects in each 
P, N, and C condition were tested by the same 
experimenter who had interacted with them previ- 
ously (half by Eı and half by Es) and for half of 
the subjects the experimenters were switched (half 
having an E:-E2 and half having an E-E, se- 
quence). Half of the subjects in each condition were 
assigned to the persistence task and half to the 
learning task. Assignment to the 32 experimental 
and 8 control groups (see Tables 1 and 2) was 
random with the following exceptions: an equal 
number of boys and girls was assigned to each 
combination of condition, measure, and experimenter ; 
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one subject from each of the six schools employed 
was assigned to each condition. 


Experimenters 


ə» The experimenters were two young women who 
had had considerable experience in conducting re- 
search with children, They had no knowledge of the 
authors’ hypotheses and did not view any collated 
data until collection was completed. While collecting 
data, the experimenters verbalized a variety of 
hypotheses, none of which was the same as those of 
the authors’ or which reflected the obtained findings. 

Great care was taken in training the experimenters 
in order to insure that procedures were correctly and 
uniformly administered. One of the authors (HB) 
maintained close contact with the experimenters 
throughout the experiment. At one point during the 
course of the study proper, E, expressed concern 
that she was not “being negative enough” to subjects 
in the N condition. Although no discrepancy between 
experimenters in the N condition was actually ob- 
served during either the training or the experiment, 
the data suggested that E, might well have been 
less negative than Fə. 


Apparatus 


Three games and a drawing activity (similar to 
those employed by Berkowitz & Zigler, 1965) were 
employed during the preliminary social interactions. 
The criterion tasks were the Marble-in-the-Hole 
persistence game employed by Berkowitz and Zigler 
and a two-color discrimination learning task. The 
latter measure was used in order to circumvent 
some of the difficulties, for example, position prefer- 
ences, which recently have been found (Parton & 
Ross, 1965; Patterson & Hinsey, 1964) with the 
typical preference-switching marble game as a learn- 
ing measure. Since the four preliminary tasks and 
the Marble-in-the-Hole Game are described in some 
detail elsewhere (Berkowitz & Zigler, 1965) a de- 
scription of only the learning apparatus will be in- 
cluded here. 

The learning apparatus consisted of a grey box, 
12 inches deep X 12 inches wide, the top of which 
sloped from 4 inches high at the side nearest the 
subject to 9 inches at the side farthest from him. 
On the top were two 1-inch round, ground glass 
windows, 4.5 inches apart with a lever 3 inches 
below each window. A Programmer changed the 
colors (red or green) appearing in these windows 
according to a randomly preestablished schedule. 
Following each Tesponse, that is, lever depression, the 
lights went out for 2 seconds, Lever depression ac- 
tuated an event recorder which recorded both color 
and position of the window under which the lever 
had been depressed, 


Procedure 


_ The experimenters met with all the subjects in- 
dividually, The experimenter conducted the children 
to the experimental room, chatting with those in the 
P condition, but Temaining silent with the N and C 


H. Berkowitz, E. C. BUTTERFIELD, AND E. ZIGLER 


subjects, except to answer questions. The experimenter 
sat across from the subject at a child’s desk. The 
experimenter gave instructions for the first preliminary 


game to subjects in the P and N conditions, Sub- 


jects in the C condition were given instructions for 
the criterion task. 

Preliminary games. These games were so con- 
structed that by giving the appropriate instructions, 
the experimenter could ensure that subjects in the 
P condition would make only “correct” responses, 
and subjects in the N condition only “incorrect” 
responses. In the Card Game, subjects were in- 
structed to attempt to pick only red cards from a 
deck spread out face down on the ‘table’ In the P 
condition, all of the cards were red and in the N 


condition all of the cards‘were black. In the Nail 


Board Game, subjects rolled marbles down an in- 
clined maze with two boxes at its bottom. The sub- 
jects were told that if the marble rolled into the 
box designated by the experimenter they would 
win, and that the marbles could roll into either box. 
Actually the marbles rolled into one box approxi- 
mately 95% of the time. This box was declared cor- 
rect for subjects in the P condition and incorrect for 
subjects in the N condition. In the School Game, 
subjects were told to guess which of two schools a 
group of children (represented by photographs) at- 
tended. The experimenter arbitrarily designated their 
choices as correct or incorrect. After the three games 
were completed, the experimenter said, “Now let’s 
try a drawing. Here are some crayons and # piece 
of paper, and you can make a picture of anything 
you like.” The P and N groups received positive and 
negative reinforcement, respectively, for their draw- 
ing activity. All subjects in the P and N groups were 
given the preliminary tasks in the order described 
above and played them for a total of 15 minutes. 

Experimenter-subject interaction. The experimenter's 
manner was warm and friendly in the positive condi- 
tion, and she praised the subjects’ performances on 
all the games. These subjects received both verbal 
and nonverbal support (ie., smiles and nods). 
Verbal support consisted of a series of reinforcing 
statements, made in random order during the four 
games, 

The experimenter’s manner was aloof and business- 
like in the negative condition. She gave the sub- 
jects neither verbal nor nonverbal support. She 
remained silent except to answer questions or t0 
deliver negative reinforcement. The experimenter 
employed a flat, indifferent tone of voice throughout 
this treatment. The subjects were exposed to verbal 
disapproval, in the form of negative statements, 
dispensed in random order, After completing the 
experiment, subjects in this treatment were urged 
to play once again. At this time they were lavishly 
praised for their performance, the experimenter at- 
tempting to interact with them in the same mannet 
as she did with the subjects in the P treatment. 
This procedure was carried out in an attempt t0 


aroused during the N treatment. Rii 
Control treatment, Subjects in the C group receive! 
no preliminary interaction but were simply C0 © 


reduce any negative feelings which might have been : 


| 
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ducted directly to the experimental room and were 
given the criterion task immediately.. p 

Delay, conditions. At the end of the 15-minute 
preliminary period, P and N subjects in the im- 
mediate (I) condition were introduced to Part I 
of the criterion task. Those in the delay (D) con- 
dition were returned to their rooms. Four daj 5 ater; 
these subjects were individually sent to the experi- 
mental room by their teacher. They again met with 
an experimenter and performed the criterion task. 

Switching conditions. In the switching-immediate 
and switching-control conditions, the first experi- 
menter conducted the subjects to the experimental 
room, administèred the appropriate experimenter- 
subject interaction in the N and P conditions, and 
then left the room, Thé second experimenter then 
entered and administered the criterion task. The 
subjects in the nonswitching conditions simply went 
én to play the criterion game without a change in 
experimenters. Subjects in the delay-switching con- 
ditions (the EEs D and ~Es-E: D conditions) 
simply met with a new experimenter upon entering 
the experimental room for the second time. 

Criterion tasks, It should be noted that each sub- 
ject was run on only one criterion task, All subjects 
were introduced to the criterion task in the same 
manner and received identical social reinforcers. 

Marble-in-the-Hole. Subjects assigned to this task 
were told to take marbles from an opening in the 
front of the apparatus one at a time, to drop 
marbl@s of one color into one of the two holes on 
the top of the apparatus, and marbles of a second 
color into the other hole. Subjects were told that 
they could place as many marbles in the holes as 
they wished and that they could stop whenever they 
wished. The opening was full of marbles throughout 
the game. 

After every tenth marble was inserted, the experi- 
menter randomly made one of the following four 
statements: “You really know how to play this 
game,” “That’s fine,” “You really know how to 
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play Marble-in-the-Hole, don’t you?” and “Very 
good.” She also smiled and nodded at the subject 
throughout the game. The subject played until he 
indicated that he wished to stop. At this point, the 

introduced Part II of the game, which 


experimenter 
was identical with Part I, except that the relation 


“ship between the marbles’ colors and the holes that 


they were inserted in was reversed. 

Light game. Subjects assigned to this task were 
told to try to turn off the two colored lights in the 
apparatus by pushing one of the two buttons beneath 
the two lights. 

The subjects’ initial 24 responses were not rein- 
forced providing a base-line period during which the 
experimenter determined which color the subject 
responded to least frequently. After the twenty- 
fourth trial, the experimenter reinforced every re- 
sponse to the least preferred color. The same rein- 
forcers were employed for this task as were employed 
for the marble game. (A partial reinforcement pro- 
cedure was to have been employed after a child 
made 10 consecutive correct responses. However, no 
child fulfilled this condition.) 


RESULTS 


The log time spent in the Marble-in-the- 
Hole (MITH) Game and the number of re- 
sponses to the reinforced stimulus in the Light 
Game are presented in Tables 1 and 2, re- 
spectively. Log times were employed for 
MITH because, as in other studies using the 
MITH task, means and variances of time 
scores were correlated. Preliminary analyses 
revealed no sex-of-subject effects for either 
score and, for purposes of clarity, the sex 
factor is not included in the following analyses. 


TABLE 1 


Loc Time (In SEconps) SPENT 
Each OF THE 


PLAYING MARBLE-IN-THE-HOLE BY 
Groups (N = 6) 


Immediate Delayed 
Control 
A Positive Negative Positive Negative 
M SD M SD M SD M SD M SD 
Same experimenter 
in both precondi- 
tion and criterion 
cot 46 2.28 181 2.29 158 1.92 250 
A 23. 5314. 4 2.59 iyt 5 5 ` ; { : 
i ee 2i 131 2.70 114 2.53 120 2.09 144 1.93 .225 
Different experi- 
Edy 116 2.23 288 2.05 225 2,29 184 
we 2. A21 2.11 A s ` Q : $ j 
EEr 206 119 2.16 156 2.20 176 2.30 155 2.11 .188 
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TABLE 2 
NUMBER OF REINFORCED RESPONSES DURING THE LIGHT GAME FOR 
Each OF THE Groups (N = 6) , 
a Immediate Delayed 
Control 
Positive Negative Positive Negative 
M SD M SD M SD M SD M SD 
Same experimenter 
in both precondi- 
tion and criterion à 
phases 
E 61.67 3.90 | 69.0, | 21.77 | 69.00 | 19.02 | 64.17 | 4.31 | 59.00 3.03 
E: 74.00 | 15.65 | 57.0 3.93 | 60.50 4.52 | 69.33 |° 20.10 | 68.33 | 18.94 
Different experi- 
menter 
E-E: 63.33 | 3.77 | 67.5 | 26.46 | 58.83 4.77 | 64.66 | 21.21 | 67.66 7.83 
E,E, 65.16 | 9.50 | 60.0 743 | 58.16 2.52 | 80.00 | 21.66 | 74.50 | 18.84 


MITH (Persistence Task) 


The primary analysis on the MITH data 
was a 2X2X2X2 (Delay X Condition 
X Switching X Experimenter) factorial anal- 
ysis of variance performed on the log-time 
scores (see Table 3). The significant main 
effect for switching reflected a shorter log 
time when experimenters were switched (M = 
2.12) than when they were not (M =2.37). 
The significant Delay X Condition and Delay 
X Condition X Experimenter interactions and 
the nearly significant Delay X Condition x 
Switching interaction are all pertinent to the 


TABLE 3 
ANALYSIS OF VARIANCE ON MITH 
Loc-Time Scores 
Source MS df F 

Total .1150 95 
Treatments (T) .1193 1 <1.00 
Delay (D) -0466 1 <1.00 
Switching (S) 1.5612 1 11.414 
Experimenters (E) .0034 1 <1.00 

XD .5612 1 4,10* 
TXS. .1442 1 1.05 
TXE -0003 1 <1.00 
DXS 0052 1 <1.00 
DXE 2554 1 1.87 
SXE 0085 1 <1.00 
TXDXS 4316 1 3.25 
TXDXE 5605 1 4,10* 
TXSXE 0416 1 <1.00 
DXSXE «0000 1 <1.00 
TXDXSXE 0364 1 <1.00 
Error (pooled) -1328 83 

*p <.05. 

+p <.01. 


predictions under test. In order to clarify 
these interactions, four 2 x 2 (immediate- 
delay—positive-negative) factorial analyses of 
variance were calculated. That is, one 2 Xx 2 
analysis was performed for each of the follow- 
ing four conditions: E;-E,, E,-E2, E,-E;, 
and E2-Es. e 

As predicted by the valence position, there 
were no significant effects of any kind (all 
Fs < 1) when experimenters were switched, 
that is, E,;-E. and E,-E,. Also consistent 
with the valence position, the only significant 
effect for the nonswitching analyses was the 
Delay X Condition interaction (F = 10.62, 
df = 1/20, p<.01) for EE» (There 
were no significant effects in the Hi-Fi 
analysis.) This significant interaction (see 
Figure 1) reflected the facts that log time was 
greater (¢ = 2.32, p < .05) under the nega- 


—————— 


tive-immediate than under the positive-im- — 


mediate condition; log time was greater (t = 
2.33, p < .05) under the positive-delay than 
under the negative-delay condition; and log 


time in the negative-immediate exceeded (t . 


= 3.35, p < .01) that of the negative-delay 


condition. In summary, the significant inter- 


actions reported in Table 3 reflected the fact 


that there were no significant effects when 2: , 


administered the positive and negative condi- 
tions but that when Æ, administered the 
positive and negative conditions, there was 4 
significant Delay X Condition interaction 


under the nonswitching condition and no sig- ` 


nificant effects under the switching condition. 
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It should be noted that while there were no 
statistically significant effects for E, the 
general pattern of her results, with the ex- 
ception of her negative-delay condition, is 


, similar to that of Æ> and consistent with the 


valence position (see Figure 1). 

In addition to the above analyses, a 2 X 2 
(Experimenters X Switching-Nonswitching) 
factorial analysis of variance was performed 
on the four control groups’ log-time scores. 
As expected, this analysis yielded no main or 
interaction effects. Therefore, the log-time 


f scores of all control subjects were pooled to 


yield one control group (M = 2.06, SD= 
406). The log time of the pooled control 
group was then compared to that of each of 
the 16 experimental groups by means of ¢ 
tests, It was found that none of the switching 
groups differed from the control group (larg- 
est £ = 1.19, p>.20). In the nonswitching 
condition, Z,’s negative immediate (¢ = 2.33, 
p < 05), Eys negative immediate (t = 3.68, 
p< .01), and Eys positive delay (¢ = 2.60, 
p < .02) group means all exceeded the con- 
trol roup mean, 


LOG TIME 
n 
w 
© 


DELAY 
IATE 


E, 


Fic. 1. MITH log times for Eı 
the pooled 


IMMED- POOLED DELAY 
CONTROL 
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Light Game (Learning Task) 


As a test of whether learning occurred, the 
scores of all experimental and control sub- 
jects run on this task were pooled, and a ¢ 
test was performed comparing the number of 
responses to the reinforced stimulus during 
acquisition (M = 66.8, SD = 10.36) to the 
number expected by chance (M = 60). At 
test was also performed comparing the mean 
number of responses averaged over trial blocks 
during acquisition (M = 2.25, SD = .314) to 
that quring the basal period (M = 3.13, SD 
= 271). Both #’s (7.23, p < .001 and 11.04, 
p < .001, respectively) were significant. 

Two primary analyses were performed. One 
of these analyses employed a conventional 
(Gewirtz & Baer, 1958a, 1958b) learning 
shift score. This score was derived by sub- 
tracting the number of responses made to the 
subsequently reinforced stimulus during the 
last basal trial block of eight responses from 
the number of responses to the reinforced 
stimulus during the acquisition trial block 
having the median number of responses to 
the reinforced stimulus. A 2xX2x2x2 


C] positive 
NEGATIVE 


IMMED- 
IATE 


E 

1 
and Es under nonswitching conditions and for 
control group, 
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C] positive 
NEGATIVE 


74 


72 


70 


68 


66 


64 


62 


60 


NUMBER OF REINFORCED RESPONSES 


DELAY CONTROL IMMEDIATE 


Fic. 2. Mean number of reinforced responses 
(adjusted for base preference) on the learning task 
for Es: 


(Delay x Condition x Switching x Experi- 
menter) factorial analysis of variance yielded 
no significant effects of any kind. A product- 
moment correlation indicated that stimulus 
preference during the basal period (number of 
responses to the least preferred, i.e., sub- 
sequently reinforced stimulus) was signifi- 
cantly correlated with learning shift r= 
—.51, $ < 005). Accordingly, a 2 X 2 X 2 
X 2 analysis of covariance which controlled 
for basal preference was performed. This 
analysis also revealed no significant effects 
of any kind. 

Since the groups were found not to differ 
in basal performance, a second primary 
analysis of the learning task data was per- 
formed on the total number of responses to 
the reinforced stimulus during acquisition. 
Because this score was also correlated with 
basal preference (r= —.25, P<.0l), a 
2X2X2X2 (Delay x Condition x Switch- 
ing X Experimenter) factorial analysis of 
covariance was performed controlling for 
basal preference. Because of the heterogeneity 
of the within-cell variances (see Table 2) the 
results of this analysis should be interpreted 
conservatively (Norton, 1952). This covari- 
ance analysis revealed a significant Condition 
X Delay X Experimenter interaction. In order 
to clarify this interaction, 2 x 2 (Immediate- 
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Delay X Positive-Negative) analyses of co- 
variance were performed separately for Ey 
and E>. No significant effects were found for 
E,. There was a significant Delay X Con- 
dition interaction for Es. Analysis by ¢ tests 
corrected for heterogeneity of variance on 
means adjusted for basal preference indicated 
that this interaction (see Figure 2) was due 
to the following: positive-immediate exceeded 
positive-delay (¢ = 2.63, p < .01), negative- 
delay exceeded negative-immediate (¢ = 4.13, 
$ < .005), negative-delay exceeded positive- 
delay (¢=3.92, p<.005), and positive- 
immediate exceeded negative-immediate (t= 
2.83, p < .005). 

In addition to the above analyses, a 2 X 2 
(Experimenters X Switching-Nonswitching) 
factorial analysis of covariance, holding basal 
preference constant, of the number of re- 
sponses to the reinforced stimulus made by 
the control subjects showed no significant 
effects. The number of reinforced responses 
for the control subjects were combined to 
form a pooled control group (M = 67.37, 
* = 202.59). The mean of the pooled .con- 
trol group was then compared to the means 
(adjusted for basal preference) of each of 
the experimental groups using ¢ tests cor- 
rected for heterogeneity of variance. These 
t tests showed that each of the following 
groups made significantly fewer responses 
to the reinforced stimulus than did the 
pooled control: ENI (t= 2.78, p < 01), 
EPD (t= 2.03, p<.05), E-EPD (t= 
2.44, p<.05), Eo-E;PD (¢=2.87, p< 
Ol), and EENI (t= 2.04, p <.05). 
No experimental group made significantly 
more responses than the control group. 


Discussion 


In general, the results obtained with the 
MITH task support the valence rather than 
the drive position. A rather sizable attenuation 
in playing time was found in the switching 
as compared to the nonswitching condition. 
This finding is consistent with the valence 


position which asserts that strangers arouse 


negative-reaction tendencies which depress 
their effectiveness as social reinforcing agents 
(cf. McCoy & Zigler, 1965). The valence 
position receives further support from Fs 
results which are quite similar to the findings 
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of Berkowitz and Zigler (1965). That is, E,’s 
playing times were greater in the positive- 
delay “than in the negative-delay condition. 
A further confirmation of the valence position 
is the absence of condition effects when experi- 
menters were switched. The drive position 
seems incapable of explaining these findings, 
particularly in the immediate condition. Fur- 
thermore, the drive position clearly suggests 
that playing time in the positive-immediate 
should ke shorter than those in the control 
condition. The findings of the present study, 
as well as those of Berkowitz and Zigler 
(1965) and McCoy and Zigler (1965), in- 


„dicate that this is not the case. All of these 


considerations seem to indicate that the 
valence rather than the drive position offers 
the most consistent explanation of the per- 
sistence data. 

No such clear support for the valence posi- 
tion was found in the results obtained by Ei. 
However, it should be noted that with the 
exception of the negative-delay condition, the 
pattern of her findings is similar to that ob- 
tained by Es. The fact that Er, during the 
course of the experiment and without knowl- 
edge of the results, had verbalized doubt about 
her ability to create a negative condition 
provides some explanation for the failure of 
her results to offer more clear-cut support for 
the valence position. 

The results obtained by both experimenters 
in the immediate condition lend support to the 
notion of a contrast effect advanced by a num- 
ber of other authors (McCoy & Zigler, 1965; 
Stevenson, 1961; Zigler, 1961). These au- 
thors posited a reinforcer contrast phenom- 
enon in which positive reinforcers in the 
criterion task take on enhanced value when 
they follow negative reinforcers. That E,’s 
findings in the immediate condition also sup- 
port the contrast interpretation, even though 
she did not produce strong valence effects 
in the delay condition, is further support for 
the view that the contrast and valence effects 
are independent of one another. This is also a 
further indication that contrast effects in the 
immediate condition are simply too strong 
to allow valence effects to manifest themselves. 
Further work is needed to determine the 
particular psychological processes that medi- 
ate this contrast effect. 
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Employing the same preconditions as those 
used in the present study, Berkowitz and 
Zigler (1965) found that playing times of 
children in the negative-delay condition were 
shorter than those in the control condition’ 
while the playing times of children in the posi- 
tive-delay condition were greater than those 
in the control condition. In the present study, 
playing times for E2s PD condition were 
greater than those in the C condition, but 
there was no significant difference between 
times in the ND and C conditions. The dif- 
ferences in the findings of the two studies 
are due to the shorter playing times obtained 
in the control condition of the present study. 
That is, ¢ tests showed that there were no 
significant differences between the playing 
times of the positive and negative conditions 
of the two studies (both #’s <1) but that 
playing times in the Berkowitz and Zigler 
control condition were significantly greater 
than those in the present study (#= 2.12, 
p< .05). 

It is possible that the difference between 
the control conditions of the Berkowitz and 
Zigler study and the present study’s control 
conditions is due to the sex of the experi- 
menters. Gilmore and Zigler (1964), employ- 
ing a male experimenter, found control group 
results similar to those of Berkowitz and 
Zigler (1965) who also used a male experi- 
menter. McCoy and Zigler (1965), employ- 
ing a female experimenter, obtained control 
group results similar to those of the present 
study. 

The results obtained with the two learning 
measures are equivocal and appear to give 
clear support to neither the drive nor valence 
positions. There were no significant effects of 
any kind when the conventional learning 
shift score was used. Another recent investiga- 
tion employing this score (Steward, 1965) 
also failed to find any significant results. The 
learning effects that were found were obtained 
with a learning score not previously employed 
in social reinforcement research. 

That this score resulted in more learning 
being found in the positive-immediate than 
the positive-delay condition is contrary to the 
drive position. That more learning was found 
in the negative-delay condition than in the 
positive-delay condition is contrary to the 
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valence position. This latter finding, in con- 
junction with the persistence findings, sug- 
gests that while a positive preliminary inter- 
action may lead to the child’s liking the 
adult and wanting to interact with him, it 
does not lead to the enhancement of the 
adult’s effectiveness in teaching the child a 
particular response. Indeed, it would appear 
that the anxiety created by a negative, as 
compared with a positive, preliminary inter- 
action and redintegrated following the delay 
period results in enhanced learning. It may be 
hypothesized that following this negative in- 
teraction the child is more sensitized to what 
the adult would like him to do. Following the 
positive interaction the child may feel that 
any of his responses are acceptable to the 
adult. Perhaps it is this attitude which leads 
to the positive-delay condition resulting in 
inferior learning to that found in the control 
condition. This superiority of the negative over 
the positive condition holds only in the delay 
condition. In the immediate condition, the 
positive interaction was found to be more 
conducive to learning than was the negative 
interaction. One possible explanation for this 
is that if the learning task is given immedi- 
ately after the anxiety-provoking negative 
interaction, the child’s level of anxiety is so 
high that it interferes with learning perform- 
ance. It is only when the anxiety is reduced 
by a delay period that it becomes more 
manageable and does not disrupt learning. 
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RACE AND BELIEF AS DETERMINANTS OF 


° BEHAVIORAL 


INTENTIONS* 


HARRY C. TRIANDIS anp EARL E. DAVIS © 
University of Illiniois 


300 Ss responded, on 12 semantic 


and 15 behavioral differential scales, to 


8 stimulus persons generated by all possible combinations of the characteristics 
Negro-white, male-female, pro- or con-civil-rights legislation. They also rated 
35 political and civil rights issues on semantic differentials. A factor analysis of 


responding to the stimulus persons. 


Ss, based on the covariances of their responses to variables, 
Ss. 2 of these types were strongly prejudiced. One type of S showed “conven- 
tional” prejudice; they were extremely sensitive 


defined 11 types of 


to the race component when 
Ss of the other type showed “belief” 


prejudice; they were particularly sensitive to the beliefs of the stimulus persons. 


The relative importance of the race 


atically with the degree of intimacy 
the Ss indicated they were willing t 


and belief components varied system- 
implied by clusters of behaviors which 
o undertake with the stimulus persons. 


Thus, the Ss’ behavioral intentions are a function of both personality and 
the type of behavioral intentions involved . 


Rokeach, Smith, and Evans (1960) hold 
that prejudice is in large part determined 
by perceived dissimilarity of belief systems 
rather than by ethnic or racial membership. 
To test this idea they asked their subjects to 
rate® on 9-point scales, stimulus persons dif- 
fering from themselves in race or religion, 
and belief, The ends of the scales were de- 
fined by the statements “I can’t see myself 
being friends with such a person,” and “I can 
very easily see myself being friends with 
such a person.” In this test it was found that 
the expressed friendship preferences were de- 
termined primarily by congruence in beliefs, 
rather than by race or religion. 

Triandis (1961) accepted these findings 
on the friendship variable, but argued that 
prejudice involved more than nonacceptance 
as a friend; it involved negative behaviors 
as well. Where large degrees of social distance 
are concerned, such as excluding a person 
from one’s neighborhood, Triandis felt that 


1 The data were collected in connection with Con- 
tract Nr 177-472, Nonr-1834 (36) with the Ad- 
vanced Research Projects Agency and the Office of 
Naval Research to study “Communication, Coopera- 
tion and Negotiation in Culturally Heterogeneous 
Groups” (F. E. Fiedler, C. E. Osgood, L. S. 
Stolurow, and H. C. Triandis, principal investi- 
gators). We are indebted to L. R. Tucker for con- 
sultations on the analysis of the data, and to M. 
Fishbein for critical comments on an earlier version 
of the manuscript. 


race rather than belief congruence was the 
critical determinant. In his study, belief con- 
gruence was measured by Morris’ (1956) 
“13 ways to live.” The subjects were asked to 
think of persons whose philosophy of life co- 
incided with their own most preferred and 
least preferred ways of life. The results 
showed that race accounted for about four 
times as much of the variance in social dis- 
tance judgments as did belief congruence, 
although both effects were highly significant. 

Rokeach (1961) replied to Triandis with 
the objection that the long and involved pas- 
sages of Morris’ ways to live were too vague 
and not sufficiently salient for the subjects. 
He concluded that his position had not actu- 
ally been challenged by Triandis’ data. Ac- 
cording to Rokeach, the major difference in 
method was that Triandis had used vague 
descriptions of stimulus persons, such as 
“Negro, same philosophy of life”; while 
Rokeach had used stimulus persons with spe- 
cific beliefs, such as “Negro who believes in 
God” or “white atheist.” 

Stein, Hardyck, and Smith (1965) pre- 
sented a study designed to reconcile these 
differences. They asked 44 white ninth- 
graders to complete a teen-age social distance 
scale in which the subjects indicated how 
friendly they felt toward white and Negro 
teen-age stimulus persons who were like or 
unlike themselves in values. The subjects had 
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indicated their own values 2 months earlier. 
Stein et al. obtained a strong effect on belief 
congruence and a lesser race effect. However, 
their scale was limited to positive items such 
as “invite home to dinner,” “have as member 
of social group,” “work on committee with,” 
etc. Triandis’ criticism of Rokeach applies 
equally to Stein et al. Prejudice involves 
negative behaviors as well as the lack of posi- 
tive behaviors. It is consistent with much 
previous research to say that friendship ac- 
ceptance is significantly determined by belief 
congruence, but that prejudice is a» more 
general phenomenon than the simple lack of 
friendly behavior. 

Part of the confusion in previous discus- 
sions stems from the assumption that preju- 
dice is a unidimensional construct. Triandis 
(1964b) challenged this assumption and pre- 
sented the results of several factor analyses 
which indicated that social acceptance (as- 
sumed here to be the opposite of prejudice) 
is a multidimensional construct. This finding 
has been supported by much additional re- 
search, including studies with student popula- 
tions of both sexes, Indian and American 
students (Triandis, Fishbein, & Hall, 1964), 
and Negro and white Illinois students (Davis 
& Grobstein *), 

Triandis’ (1964b) basic factors were ob- 
tained from factor analyses of behavioral 
differential scales, and were as follows: 

Factor I. Formal Social Acceptance with 
Superordination versus Formal Social Rejec- 
tion, defined by high loadings on items 
such as “I would admire the ideas Of OD 
would admire the character of,” “I would 
obey,” “I would cooperate in a political cam- 
paign with,” etc, 

Factor II. Marital Acceptance versus Mari- 
tal Rejection defined by high loadings on “I 
would marry,” “I would date,” “I would fall 
in love with,” etc. 

Factor III. Friendship Acceptance versus 
Friendship Rejection defined by high load- 
ings on “I would accept as an intimate 
friend,” “I would eat with,” “I would gossip 
with,” etc. 

Factor IV. Social Distance, defined by high 

2E, E. Davis and Nadine Grobstein, “Multi-mode 


Factor Analysis of Inter-personal Perceptions,” ONR 
technical report (in preparation). 
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loadings on “I would exclude from the neigh- 
borhood,” “I would prohibit admission to my 
club,” “I would not accept as a close kin by 
marriage,” etc. 


Factor V. Subordination, defined by high | 


loadings on “I would obey,” “I would not 
not treat as a subordinate,” “I would be 
commanded by,” etc. 

Triandis’ (1964b) analysis permitted the 
computation of the relative variance ac- 
counted for by the race, age, occupation, sex, 
and religion of the stimulus person as de- 
terrhinants of responses on each of these 
factors. It was found that occupation was, by 
far, the most important determinant for 
Formal Social Rejection® (Factor I) and for 
Subordination (Factor V). Incongruence of 
sex and age, followed by race and religion, 
were the primary determinants of variance 
for Friendship Rejection (Factor III). For 
Social Distance (Factor IV), the variance 
was almost completely determined by race. 

Thus, the findings of the studies described 
above can be reconciled. Rokeach’s (1961) 
data involved the friendship factor; so, did 
those of Stein et al. (1965), Byrne and Wong 
(1962), and other studies in which belief dis- 
similarity emerged as the major. variable. 
Triandis’ (1961) data were more general, 
since his Social Distance scale included both 
friendship and negative behaviors. His data 
were primarily determined by responses to 
the item “exclude from the neighborhood,” 
which about 55% of his subjects indicated 
they would do when the stimulus person was 
a Negro. As a result, race explained most of 
the variance. The fact that different items 
Operate in different ways can also be seen 
in Table 5 of Stein et al. (1965). For the 
items “invite home to dinner,” “live in the 
same apartment house,” and “date my sister/ 
brother,” a substantial amount of. variance 
was accounted for by race, in some cases 


more than that accounted for by belief dis- į 


similarity. 

The difficulties with the replication of the 
Byrne and Wong (1962) study, encountered 
by Byrne and McGraw (1964), and the 


*Since the scales were scored such that a high 
score means rejection, we shall refer to these five 
factors by their negative poles for the remainder 
of the article. 


: 


| 
i 


k 


ree 


DETERMINANTS OF BEHAVIORAL INTENTIONS 


methodological issue raised in the latter 
paper, are clariñed by the data of the study 
presented here. 

This study offers additional data on the 


, influence of both belief and race dissimilarity 


in determining social perception of stimulus 
persons. The data were collected in a pretest 
of attitudes toward civil rights issues and 
Negroes and whites who favor or oppose 
strong civil rights legislation. The subjects 
were 300 white, male University of Illinois 
students. Though the data were collected for 
another purpose, that is, to provide a basis 
for assigning subjects to different experi- 


mental conditions in a study of negotiations 


between Negroes and whites on civil rights 
issues (Davis & Triandis, 1965), they also 
permit clarification of the conditions under 
which either race or belief dissimilarity are 
the important determinants of social accept- 
ance or rejection, 


METHOD 
Subjects 


White, male, University of Illinois students, num- 
bering 300, from the introductory psychology pool 
were tested for 1 hour in February 1964. 


Questionnaire 


Eight stimulus persons were presented. These rep- 
resented all combinations of Negro-white, male- 
female, and favorable toward strong civil rights 
legislation—or opposed to major changes in status 
quo on civil rights. The subjects were asked to rate 
the stimulus persons on 12 semantic differential 
(SD) scales (Osgood, Suci, & Tannenbaum, 1957) 
and 15 behavioral differential (BD) scales (Triandis, 
1964b). ‘The scales were selected so as to represent 
tHe classic evaluation, potency, and activity factors 
of the semantic differential and the five factors of 
the behavioral differential, isolated in the Triandis 
(1964b) study. In addition, the subjects responded 
to 10 positive and 10 negative (reversed) F Scale 
items; 10 tolerance for minorities items and 10 criti- 
cism of’ social institutions items, from the factor 
analysis of atttitudes performed by Davis, Wrigley, 
and Costelein (1960). The F Scale items were fur- 


. ther used to compute measures of acquiescence re- 


sponse set and authoritarianism controlled for 
Tesponse set, in the manner described by Triandis 
and Triandis (1962). 

In another part of the questionnaire, subjects 
were asked to rate 35 issues and terms on five scales 
from the evaluative factor of the semantic dif- 
ferential, as well as on the scale “Gmportant- 
unimportant.” The 35 issues and terms were selected 
from a much larger list after factor analysis, and 
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represented key political issues, as well as the major 
concepts involved in the civil rights debate of 1964. 
The Civil Rights Act of 1964 was before the United 
States Senate at the time of the testing, and received 
good coverage in both the student newspaper and 
the local press. £ 

The total number of variables obtained from the 
questionnaire was 140. They can be broken down as 
follows: 8 stimulus persons rated on 3 semantic dif- 
ferential and 5 behavioral differential factors, totaled 
64 variables; 35 issues and terms, rated on 2 factors 
(evaluation and importance), added another 70 vari- 
ables. The two Davis et al. (1960) scales, two F 
Scale measures, and the response acquiescence and 


authoritarianism measures provided 6 more variables. 
« 


RESULTS 
Analysis of the Stimulus Person Elements 


The argument that different characteris- 
tics of the stimulus persons are important 
in determining the “behavioral intentions” 
(Triandis, 1964a) of subjects is supported 
by examination of the matrices of correla- 
tions among the eight stimulus persons for 
the various behavioral intentions. Each of the 
five factors of the behavioral differential is 
supposed to form a unidimensional response 
continuum, If this is the case, then the pat- 
tern of correlations among the eight stimulus 
persons should form a Guttman simplex. A 
simplex involves a pattern of intercorrelations 
among variables in which the columns and 
rows are arranged so that the cells with the 
highest correlations are nearest the diagonal, 
and the values of r decline as a function of 
the cell’s distance from the diagonal. 

Foa (1965) has provided a detailed expla- 
nation of such simplex patterns and has indi- 
cated how to measure the relative importance 
of the elements entering into a set of inter- 
correlated variables. It is possible to see 
intuitively that the most important element 
will segregate the variables most clearly, the 
second most important element next most 
clearly, etc. For example, consider Table 1a, 
which presents the intercorrelations of the 
eight stimulus persons on the Formal Social 
Rejection factor (above the diagonal) and 
the Friendship Rejection factor (below the 
diagonal), The ordering of the column and 
row titles indicates the dimension which has 
determined the simplex pattern. On the 
Formal Social Rejection factor, belief is quite 
clearly the element which orders the pattern. 
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TABLE 1 
INTERCORRELATIONS OF THE STIMULUS PERSONS FOR EACH OF THE SCALE FACTORS 
la. Formal social rejection with subordination (above the diagonal) and friendship rejection (below the diagonal) 
2 
NMP NFP WFP WMP NFC NMC WFC WMC 
NMP 100 83 75 71 31 28 —19 —22 NMP 
NFP 79 100 76 72 42 33 —16 —17 NFP 
NFC 68 77 100 72 27 21 —06 —13 WFP 
NMC 69 66 77 100 20 19 —07 —05 WMP 
WMP 50 47 40 38 100 79 39 42 NFC 
WFP 44 44 36 37 47 100 42 44 NMC 
WFC 17 12 25 32 43 30 100 77,» WEC 
WMC 12 10 27 33 47 25 63 100 WMC 
NMP NFP NFC NMC WMP WFP WEC WMC 
1b, Marital rejection (above diagonal) and social distance (below diagonal) 
NMP NMC WMP WMC NFC NFP WFP WFC 
NMP 100 58 23 21 08 14 —11 —12 NMP 
NFP 75 100 33 30 08 05 —18 05 NMC 
NFC 67 87 100 59 06 06 —07 —02 WMP 
NMC 66 82 87 100 01 04 —03 03 WMC 
WFP 36 40 31 34 100 87 23 —02 NFC 
WMP 31 36 31 33 48 100 23 —10 NFP 
WMC 12 09 il 17 21 45 100 29 WEP 
WFC 04 10 13 18 29 32 49 100 WFC 
NMP NFP NFC NMC WFP WMP WMC WFC ë 
1c. Subordination (above diagonal) 
NMP NFP NMC NFC WFP WNP WMC WFC 
100 70 63 58 51 50 35 28 | NMP 
100 65 77 64 52 39 36 | NEP 
100 70 52 53 56 49 | NMC 
100 58 45 45 47 NFC 
100 54 42 55 WFP 
100 69 41 WMP 
100 58 WMC 
100 WFC 


Note.—Decimals omitted. N = negro, M = male, P = Pro strong civil rights legislation, W = white, F = female, C = cón 


strong civil rights legislation, 


If one looks for the common element in the 
top four and bottom four stimulus persons 
(designated, in this case, along the top row 
and right-hand column of the table), one can 
see that the correlations among stimulus 
persons having strong anti-civil-rights_ atti- 
tudes are positive and high, and the correla- 
tions between stimulus persons having pro- 
and those having anti-civil-rights attitudes are 
generally negative. The next most important 
element is found by looking at the “doublets” 
formed by adjacent stimulus persons, and 
discovering common elements in the four 


doublets. In this analysis, belief is seen to 
be the most important element for Formal 
Social Rejection and race the second most 
important one. 

In contrast, the Friendship Rejection fac- 
tor shows a clear ordering by race, followed 
by belief. For the Marital Rejection factor 
(Table 1b, above the diagonal) the most im- 
portant element is sex (which is hardly sur- 
prising!), but the next most important ele- 
ment is race. For the Social Distance factor 
(Table 1b, below the diagonal) the most im- 
portant element is race, followed by belief. 
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For Subordination (Table 1c, above the di- 
agonal) it is also race, followed by belief. 
Thus, &lthough both race and belief are im- 
portant, race is more important for most of 
the factors of behavioral intentions, especially 
those behaviors that are more intimate; belief 
becomes more important in the less intimate 
behaviors of the Formal Social Rejection 
factor. 


Analysis øf the Subject Factors 


ae 


The 140 variablese derived from the fe- 


` sponses of the 300 subjects were analyzed by 


the Tucker and Messick (1963) procedure 
for factoring persons. This technique results 
in subject factors determined by the covari- 
ances of the subject’s responses to the vari- 
ables. Five subject factors were obtained by 
varimax rotation of the principal axis factors 
of subjects. An interpretation of the sub- 
sequent factors was made possible by the 
inspection of the transformed characteristic 
vectors of the variables. 

The first, third, and fourth subject factors 
were relatively unimportant for our purposes. 
The first factor grouped subjects who had a 
tendency to check the extremes of scales; 
subjects low on this factor had a tendency 
to give nonextreme responses. The extreme 
checkers were somewhat more prejudiced 
than their opposites, but the tendency was 
not very strong. The third and fourth factors 
separated subjects on political, rather than 
racial issues, and will not be considered here. 
, The second and fifth factors, which sepa- 
tated subjects on the basis of prejudice, will 
be discussed in some detail. Although both 
of these types were prejudiced, there were 
also important differences between them. 

Subjects loading high on Factor IT were 
the “conventionally prejudiced”; that is, 
these subjects tended to reject Negro stimulus 


: persons regardless of the latter’s beliefs on 


civil rights, They tended to consider most 


- civil rights issues as unimportant. They indi- 


cated a willingness to “be commanded by” 


_. whites who were “opposed to changes in the 


status quo on civil rights”; they did not 


. show such a willingness with respect to Ne- 


groes of the same belief. Subjects who were 
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high on Factor II, as compared with those 
low on this factor, were significantly more 
authoritarian (F Scale corrected for acquies- 
cence), less tolerant of minorities, and less 
critical of established social institutions‘ 
(Davis et al., 1960, scales) than the lows on 
this factor. They disapproved of a federal 
law against housing discrimination, the es- 
tablishment of biracial committees, and the 
enforcement of the Supreme Court decision 
on school desegregation. They also negatively 
evaluated integrated hospitals, Negro school 
administrators, and other Negro professionals. 
They indicated that segregated schools, seg- 
regated hospitals, segregated housing, and the 
concept of civil rights itself were neither good 
nor bad. They responded very negatively, 
however, to a Negro president, freedom rides, 
freedom marches, sit-ins, and interracial 
marriages. 

This picture continued in other data which 
had been collected. Factor II highs felt that 
integrated housing would lead to a decline of 
property values and to slums; however, they 
did not feel that segregated schools would 
lead to a lack of freedom. Interracial mar- 
riage was regarded as leading to lower morals, 
lack of progress, a bad society, more con- 
flict and “mongrelization” of the races. They 
felt that a Negro president would result in 
a depression and injustices to whites. Thus, 
the highs on Factor II were classic cases of 
interracial prejudice. 

By contrast, the lows on Factor II tended 
to think of civil rights issues as very impor- 
tant and good. They were liberal on inter- 
racial marriage, low on the F Scale, tolerant 
of minorities, and critical of social institu- 
tions. They approved of a federal law con- 
cerning housing discrimination, of biracial 
committees, of enforcement of the Supreme 
Court’s decision on school desegregation, of 
integrated hospitals, Negro professionals, etc. 
They were opposed to separate but equal 
accommodations. However, they did show 
evaluations slightly below the theoretical 
mean for a Negro president, freedom rides, 
freedom marches, sit-ins, and interracial mar- 
riages (although, of course, their mean re- 
sponses were significantly higher than the 


high Factor II’s). 
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Thus, the contrasting picture of the highs 

and lows on Factor II was quite consistent 
with common-sense notions about the atti- 
tudes of prejudiced subjects. 
” The highs on Factor V behaved as Rokeach 
would predict. They responded primarily to 
the belief of the stimulus persons, rather than 
to their race. They rejected stimulus persons 
who were in favor of strong civil rights 
legislation, and approved of persons who were 
against changes in the status quo on civil 
rights, regardless of the race of the stimulus 
person, These ratings were clear in both the 
behavioral and the semantic differentials, In 
other words, these subjects indicated that 
they considered anti-civil-rights persons as 
“good,” and they were willing to undertake 
favorable behaviors with them regardless of 
their race. Thus, these subjects would appear 
to be “belief prejudiced.” 

While the highs on Factor II tended to 
regard civil rights issues as relatively unim- 
portant, the highs on Factor V tended to 
regard them as very important, They con- 
sidered the John Birch Society, separate but 
equal accommodations, the State Rights 
Amendment to the Constitution, segregated 
schools, segregated hospitals, segregated 
housing, and interracial marriages to be im- 
portant, High Factor V subjects were clearly 
opposed to a federal law concerning housing 
discrimination, and also to socialized medi- 
cine, disarmament, federal enforcement of the 
Supreme Court school desegregation decision, 
freedom marches, sit-ins, interracial mar- 
riages, etc. They positively evaluated segre- 
gated housing. These high Factor V subjects 
had, however, a certain similarity with high 
Factor II subjects in being high on authori- 
tarianism, low on tolerance for minorities, 
and low on criticism of social institutions, 

The lows on Factor V were very similar 
to the lows on Factor II. 

Thus, two types of Prejudiced subjects 
have been isolated. Subjects high on Factor 
II are conventionally prejudiced in that they 
reject people primarily on the basis of race; 
they also consider civil rights and many 
political issues as quite unimportant, Subjects 
high on Factor V show belief prejudice and 
consider most issues as very important. 


Harry C. Trianpis AND EARL E. Davis 


Weights Given to Race and Belief 


Description of the two parameters. We | 


have already indicated that the relative in- 
fluence of race and belief in determining 
behavioral intentions depends upon two pa- 
rameters: the response continuum and the 
subject type. 

As described above, the six response con- 
tinua with which we will be dealing here are 
the evaluative factor of the semantic dif- 
ferential and the five behavioral differential 
factors isolated by Triandis (1964b). 

The two kinds of prejudiced subjects iso- 
lated by a factor analysis of persons have 
been described above. The problem of select- 
ing actual subjects representative of these 
two factors, for purposes of further analysis, 
is complicated by the fact that these two 
subject types, although different in certain 
respects, also have a certain amount of simi- 
larity. (The correlation over all 300 subjects 
between the subjects’ loadings on Subject 
Factor II, and their loadings on Subject 
Factor V was .54.) Thus, we adopted the 
following procedure. We first selected the 50 
highest loading, and the 50 lowest loading 
subjects for each of the two factors, regard- 
less of possible overlap. We then placed those 
subjects who did overlap in a separate cate- 
gory, and thereby arrived at pure highs for 
each of the two factors, pure lows for each 
factor, as well as a group of mixed highs, 
and a group of mixed lows, that is, subjects 
who loaded high on both Factors II and V, 
or loaded low on both of these factors. In 
addition, we selected a group of subjects whe 
were neutral, that is, in the middle range on 
all subject factors, Thus, as a basis for fur- 
ther analyses of the effect of subject type 
in determining the relative importance of 
race and belief, we have seven types of sub- 
jects: high Factor IT, low Factor II, neutral, 


high Factor V, low Factor V, mixed highs, ” 


and mixed lows. 

Analysis of variance and the two parame- 
ters. The weights given to race and belief 
may be seen by considering the sums of 
Squares and percentages of variance Con- 
trolled by race and belief in analyses of vari- 
ance based on composite scores for each 
subject type on the semantic and behavioral 
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TABLE 3 


PERCENTAGES OF VARIANCE FOR RACE AND BELIEF 
AS A FUNCTION OF SUBJECT TYPE 


5 Response continuum 
Subj i Friendshi; ial di 
type | edoan | AP | Social distance 
Race | Belief | Race | Belief | Race | Belief 
High II | 29.3 | 7.5 | 44.4 8 | 68.4 | 11 
Neutral | 4.2 | 3.5 | 104] .0 | 24.5 2 
High V | 2.8 | 12.3 | 13.7 | 3.0 | 43.9 | 3.1 


differential factors. Table 2 presents the re- 
sults of these analyses of variance for the 
six response continua and the seven subject 
types. 

If one notes the percentages of variance 
given in Table 2, one can see that these 
percentages (weights) have a systematic rela- 
tionship to both the type of subject and the 
response continuum., In comparing Factor II 
highs with Factor V highs, it may be seen 
that the former consistently give more weight 
to race than do the latter across all the re- 
sponse continua. Conversely, it may be seen 
that Factor V highs consistently give more 
weight to belief than do Factor II highs 
across all the response continua. These dif- 
ferences between factor types are particularly 
evident for subjects who are high (preju- 
diced) on one or the other of these two 
factors. However, even the lows tend to con- 
firm the difference between the two types. 
As may be seen, Factor V lows consistently 
give more weight to belief than do Factor II 
lows across all continua. 

In general, low (unprejudiced) subjects 
assigned more weight to belief than to race. 
Tt is quite reasonable to expect that, whereas 
unprejudiced subjects would not discriminate 
on the basis of race, they would tend to 
reject stimulus persons who are anti-civil- 
rights, especially if they are white, 

The manner in which both subject type 
and response continuum determine the rela- 
tive weights given to race and belief is per- 
haps best made clear by focusing attention 
upon selected and simplified portions of 
Table 2. Tables 3 and 4 Present such ex- 
cerpts. 
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Table 3 illustrates how the weight (per- 
centage of variance) given to race and be- 
lief is a function of the subject type. The 
relative weights for race and belief are then 
given for three response continua. These are 


Formal Social Rejection, which is the least 


intimate of the BD factors; Friendship Re- 
jection, which seems to be intermediate in 
intimacy; and Social Distance, which appears 
to be a more intimate factor and one also 
involving negative behaviors. On the Formal 
Social Rejection continuum high Factor II 
subjects show race accounting for approxi- 
mately four times as much variance as be- 
lief; for high Factor V subjects on the other 
hand, belief accounts for more than four 
times as much variance as does race, For 
neutral subjects race and belief are of nearly 
equal importance on this continuum. 

Looking at Friendship Rejection, which is a 
somewhat more intimate response continuum, 
it can be seen that race in general accounts 
for a somewhat larger amount of the vari- 
ance. Nevertheless the differences between 
Factor II and Factor V subjects are appar- 
ent. For Factor II subjects race accounts for 
an overwhelming percentage of the variance; 
and for Factor V subjects, although race is 
still more important than belief, the relative 
difference is not nearly so great. Furthermore, 
belief accounts for a considerably larger per- 
centage of the variance for Factor V subjects 
than for Factor II subjects. 

With Social Distance the tendency for race 
to account for a larger percentage of the 
variance than belief manifests itself eve? 


TABLE 4 


PERCENTAGES OF VARIANCE FOR RACE AND BELIEF 
AS A FUNCTION OF RESPONSE CONTINUUM 


Subject type 
Response continuum Mixed highs Neutrals 
cee 
Race | Belief | Race | Belief 
Evaluation 1.6 | 51.0 0 2a 
Formal social rejection | 22.0 | 33.0 | 42 | 3 4 
Subordination 241 | 31 | 61 0 
Friendship rejection 47.6 3.6 | 10.4 2 
Social distance 63.4 | 4.0 | 24.5 0 
Marital rejection 25.4 1.3 | 18.9 : 
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more strongly over all subject types. How- 
ever, the ratio of race to belief is considerably 
greater” for Factor II subjects than for Fac- 
tor 'V subjects. Once again belief accounts 


„for somewhat more of the variance for Factor 


V subjects than in the case for Factor II 
subjects. It is interesting to note that for 
the neutral subjects race becomes increas- 
ingly important with the intimacy of the re- 
sponse continuum, but its weight tends to be 
somewhere between those for high Factor IT 
and high Factor V subjects. 


Table 4 illustrates® the weights given “to 


race and belief as a function of the response 


continuum, Along the left-hand column all 
six response continua are listed in what ap- 
pears to be ascending order of intimacy. The 
importance of the response continuum is il- 
lustrated here on the basis of two subject 
types, namely, mixed highs (i.e., subjects 
who had high loadings on both Subject Fac- 
tors II and V) and neutrals. For both of 
these subject types it can be seen that with 
increasing degrees of intimacy the relative 
importance of race and belief varies system- 
atically, whereby race becomes increasingly 
important and belief decreasingly important. 
This is particularly illustrated in the case of 
the mixed high (prejudiced) subjects. The 
intimacy of Marital Rejection may be as- 
sumed a priori; one can see that the relative 
importance of race, in comparison with be- 
lief, is greatest here (race accounts for ap- 
proximately 20 times as much variance as 
does belief, compared with a ratio of about 
15:1 for Social Distance, the next most inti- 
mate continuum). The only reason why the 
absolute percentage is not higher is that sex 
accounts for about 25% of the variance 
(which is to be expected). Furthermore, the 
Sex X Race interaction is highly significant. 

Although both of the subject types in Table 
4 show roughly the same trend with respect 


to the relative importance of race and belief 


over the response continua, it is important 


* to note the differences in the absolute size of 


the weights, reflected in the sums of squares 
in Table 2. As may be seen, these sums of 
squares for the prejudiced subjects are of a 
much greater magnitude than is the case with 
the neutral subjects. 
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Discussion 


Previous researchers in the area of inter- 
group relations have assumed that social dis- 
tance statements have similar response char-« 
acteristics. The present study has presented 
data which show that this view is too simple. 
The race and belief characteristics of stimu- 
lus persons acquire different weights depend- 
ing upon both the nature of the items to 
which the subjects are responding and the 
type of subject making the response. 

Specifically, the Social Distance and Mari- 
tal Rejection factors are responded to in the 
manner described by Triandis (1961), and 
Triandis and Triandis (1960, 1962). The 
Evaluative and Formal Social Rejection fac- 
tors are responded to more in the manner 
described by Rokeach et al. (1960); this is 
particularly true in the case of unprejudiced 
subjects. The Friendship Rejection and Sub- 
ordination cluster of behaviors is responded 
to either on the basis of race, or on the basis 
of belief, depending upon whether the sub- 
jects are high or low on general prejudice. 
Those subjects who are prejudiced respond 
to the race component. Data show that both 
race and belief are important in the case of 
highly prejudiced subjects responding to the 
Formal Social Rejection factor, but that be- 
lief becomes less important in the other re- 
sponse continua. This finding is consistent 
with the arguments of Stein et al. (1965). 

The relative weights given to race and 
belief are a function of both personality (the 
type of subject) and the response continua. 
Two types of subjects were isolated through 
factor analysis. One type was a “conven- 
tionally” prejudiced subject who responded 
much more to the race than to the belief of 
the stimulus person. The other type was a 
“pelief”? prejudiced type who was more con- 
cerned with the belief of the stimulus persons 
than their race. The conventionally preju- 
diced subjects were strikingly different from 
the belief prejudiced subjects on all of the 
response continua of the behavioral differen- 
tial, giving a much larger weight to race than 
did the belief prejudiced subjects. 

Examination of the three clusters of be- 
haviors which are responded to differently 
reveals that the Social Distance—Marital Re- 
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jection cluster includes the most intimate be- 
haviors, Acceptance in the neighborhood, as 
a close kin by marriage, acceptance for mar- 
riage and acceptance for dating, are the typi- 
“cal behaviors, A second cluster involves the 
Friendship Rejection and Subordination be- 
haviors, Here we find behaviors such as eat- 
ing with, accepting as intimate friend, as well 
as behaviors such as treating as a subordi- 
nate. These behaviors appear to be inter- 
mediate in intimacy and social commitment. 
The final cluster, Formal Social Rejection, 
includes behavior such as admire the ideas 
of, admire character of, elect to political of- 
fice, etc. Evaluation of the stimulus persons, 
that is, whether they are good, clean, fair, 
etc., shows response characteristics which are 
similar to those of Formal Social Rejection. 
It is clear that the Formal Social Rejection 
cluster of behaviors includes the least inti- 
mate behaviors, and this factor, together with 
the semantic differential evaluative factor, 
involves a minimum of social commitment. 

Thus, the results of the present investiga- 
tion show that the more intimate the be- 
havior, the larger is the weight given to the 
race component. In the case of the most in- 
timate behaviors, all subjects, regardless of 
whether they are classified as prejudiced or 
unprejudiced, give social distance responses. 
In the case of the least intimate behaviors, 
most subjects give unprejudiced responses. In 
the case of the behaviors that are inter- 
mediate in intimacy, there is a clear sepa- 
ration of the prejudiced from the unpreju- 
diced subjects, as well as of the convention- 
ally prejudiced from the belief prejudiced 
subjects, 

A possible theoretical interpretation of 
these findings suggests itself, The norms of 
behavior in our society are most clearly speci- 
fied in the case of intimate behaviors and 
are less clearly specified in the case of less 
intimate behaviors, Thus, the normative re- 
jection of persons who are racially different 
is found in its strongest form in the case of 
intimate behaviors, By contrast, norms may 
play a smaller role in the case of nonintimate 
behaviors. In this latter case, personality and 
previous experience, as well as the total atti- 
tude structure of the individual, may control 
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most of the variance of social rejection or 
acceptance. 

Finally, the Rokeach (1961)—-Triandis 
(1961) controversy would appear to be re. | 
solved by the specifications of two parame- 
ters. In the case of intimate behaviors, 
Triandis’ arguments appear correct; in the 
case of nonintimate behaviors, Rokeach’s ar- 
guments appear correct; in the case of be- 
haviors intermediate in intimacy, both race 
and belief are important. However, some sub- 
jects behave more as described by Triandis 
and other subjects more as described by 
Rokeach, The first type, the conventionally 1 
prejudiced, is distinguished from the second ` 
type, the belief prejudiced, particularly on | 
those behaviors that are intermediate in ; 
intimacy. 
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DIFFERENTIAL COMMUNICATION OF AFFECT BY 


HEAD AND B 


4 experiments demonstrated that head and body nonverbal cues provide dif- 
ferent affective information. Head cues carry information primarily about what 
particular affect is being experienced, and relatively little about intensity of 
affect or level of arousal. Body cues reverse this pattern, communicating in- 
formation primarily about level of arousal or degree of intensity of an affective 


experience, but relatively little about what 
Photographic stimuli were drawn from 


photographs were rated on Schlosberg’s 3 dimensions of emotion. In each 
experiment 3 separate groups of judges each viewed 1 cue version: head, body, 


or whole person (head and body), 


In recent studies of expression through 
body movements and facial expressions, con- 
sistent results contrast sharply with the hap- 
hazard and contradictory earlier findings. In 
a review of the research of the preceding few 
decades Bruner and Tagiuri (1954) incisively 
stated the methodological problems which 
had impeded progress. While confused results 
prevented drawing conclusions about the po- 
tential richness of nonverbal behavior as a 
source of information, what evidence there 
was seemed to suggest that nonverbal be- 
havior is a meager source of information, with 
agreement in interpretation of the behavior 
far exceeding accuracy of judgment, and ac- 
curacy, when found, usually dependent upon 
knowledge of the situation, An experiment by 
Giedt (1955) confirmed this impression; he 
found that judges of patient interviews made 
far more accurate decisions when they read a 
typescript than when they saw a silent film, 
and showed no benefits when exposed to a 
sound film, 

While the methodological problems are far 
from solved, and indeed many of them are 
still often ignored, the new spate of more 
encouraging studies have shared one feature: 
focusing on spontaneously occurring inter- 
active nonverbal behavior. Most of the earlier 


1This project was supported by Postdoctoral 
Research Fellowship MF 6092 and by Research Grant 
MH 07587 from the National Institute of Mental 
Health, United State Public Health Service. The 
author is indebted to Sanford Autumn and George 
Stone for their many helpful suggestions about the 
research design and preparation of the manuscript. 
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ODY CUES* 


t particular affect is being experienced. 
5 standardized stress inte;views, All 


studies had sampled an individual’s be 
in isolation, or ignored the relevant int 
tion with the experimenter; usually the st 
ject was asked to pose a feeling, or 
taneous behavior was evoked by a novel 
bizarre stimulus; and, frequently the subj 


recent experiments have sampled nonverb 
behavior as it occurs during an ongoing, vel 
ally communicative, interpersonal relatio 
ship. S 

Direct measures of nonverbal behavior hav 
been related to independent measures 
mood (Dittmann, 1962) and to the verbal 
content of an interaction (Exline, 196: 
Sainesbury, 1954, 1955), With the m 
usual design of measuring an observer's judg- 
ments, nonverbal behavior has been shown t0 
communicate accurate information about 
affect (Ekman, 1965a, 1965b), about m 
mentary changes during an interaction CO} 
sistent with the concomitant verbal behavior 
(Ekman, 1964a), about changes in the qui 
ity of an interpersonal relationship (Ek 
1965a; Hoffman, 1965), about psychod 
namic and psychodiagnostic features (Mi 
Danet, & Norton, 1959), and about whe 
a pupil understands a teacher (Maccob 
Jecker, Breitrose, & Rose, 1964). 

In view of this evidence that a pers 
nonverbal behavior can carry diverse 
Sages, it becomes important to begin to spe 
the link between particular nonverbal í 
and the inferences drawn by an observi 


CoMMUNICATION OF AFFECT 


There are two problems: on what basis to 
isolate nonverbal cues in the stimulus be- 
havidr, and what kind of inferences by ob- 
servers to measure, The problem of isolating 
nonverbal cues is complicated by the lack of 
any obvious natural units, and by the fact 


‘that body movement and facial expression 


encompass an enormous number of simul- 
taneously occurring stimuli, continuously 
emitted, any of which might carry differen- 
tial megsages to an observer. Even within a 
still photograph, where time has been arti- 
ficially halted, there are a variety of stimuli 
which an observer might decode, depending 


a upon whether he observes the feet, the hands, 


the shoulders, the eyes, etc. 

Most past research on nonverbal behavior 
has focused on the interpretation of only one 
set of nonverbal cues, usually those provided 
by the face. A few studies have shown that 
information is also communicated by the 
body: studies of posture, gait, and the 
hands. Yet, no direct comparison of these 
different nonverbal cues in terms of the type 
of information each might communicate has 
been possible, since the stimulus persons, the 
observers, and the type of inferences meas- 
ured have varied widely. 

The present study provides an initial basis 
for specifying the link between cue and in- 
ference, by showing two broad categories of 
nonverbal cues taken from photographs of the 
same stimulus person to a common group of 
observers, and comparing observers’ ability 
to draw the same set of inferences from each 
of the two sources, The results might well 
vary depending upon both the two classes of 
nonverbal cues and the type of message or 
class of information selected for study. In 
recognition of our philosophic antecedents, 
the decision was made to divide nonverbal 
behavior at the neck, into the time-honored 
categories: head and body. The second choice 
was to focus upon the class of information 
which has received the greatest past study, 
information about affect, and to explore how 
the head and body might differ in the sub- 
classes of affective information each com- 
municates, 

Informal inspection of photographs from 
past experiments suggested that the affective 
information communicated by the head might 
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well be independent of that communicated 
by the body. Almost identical facial expres- 
sions, seeming to signify similar affect, for 
example, pleasure or anger, were sometimes 
accompanied by markedly dissimilar body 
positions. Similar body positions seeming to 
communicate related information about level 
of arousal were occasionally accompanied by 
markedly dissimilar facial expressions. Dif- 
ferences to be discussed later, between head 
and body in anatomical properties and func- 
tional activities during a conversation, sup- 
ported these observations and led to the 
formulation of the following hypothesis about 
nonverbal behavior occurring during a con- 
versation, Hypothesis: head cues carry in- 
formation primarily about what particular 
affect is being experienced (anger, sadness, 
joy, etc.) but provide relatively little infor- 
mation about the intensity of the affect or the 
level of arousal; body cues reverse this pat- 
tern, communicating information primarily 
about the level of arousal or the degree of 
intensity of the affective experience, but pro- 
viding relatively few cues about what particu- 
lar affect is being experienced. 

Schlosberg’s (1954) formulation of three 
dimensions of emotion offered a method of 
testing this hypothesis by measuring the 
information communicated about these three 
dimensions by head and body cues. Although 
Schlosberg’s evidence for his theory was based 
solely upon judgments of facial expressions, 
which were usually posed, Schlosberg did be- 
lieve that the theory was pertinent to emo- 
tional expression in general. Schlosberg’s 
theory stated that underlying the variety of 
different affect states were three dimensions: 
a pleasantness dimension, a dimension of at- 
tending to or rejecting stimulation, and an 
intensitive dimension from sleep to tension. 

In a number of studies of facial expression 
(Engen, Levy, & Schlosberg, 1957, 1958; 
Levy & Schlosberg, 1960; Schlosberg, 1952; 
Triandis & Lambert, 1958), judgments on 
the attention-rejection and the pleasantness- 
unpleasantness dimensions differentiated pho- 
tographs which had been identified as show- 
ing any of the six Woodworth affect cate- 
gories (happiness, surprise, fear, anger, dis- 
gust, contempt). That successful predictions 
could be made of the Woodworth affect labels 
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from judgments on attention-rejection and 
pleasantness-unpleasantness was Schlosberg’s 
main argument for suggesting that these two 
scales should be conceived as the dimensions 
Underlying the recognition of different affect 
states, In terms of the hypothesis under 
study here, it would be expected that judg- 
ments on the pleasantness-unpleasantness and 
attention-rejection dimensions are primarily 
derived from head cues and that these dimen- 
sions are much less pertinent to judgments 
made from body cues. Conversely, judgments 
on Schlosberg’s third dimension, sleep-tehsion, 
could be expected to be derived primarily 
from body cues. 

Unfortunately, it was not possible to test 
the hypothesis in this fashion because inter- 
correlations have been found between the 
attention-rejection dimension and each of the 
other two. Although Schlosberg had proposed 
that these three dimensions were independent, 
Abelson and Sermat (1962) found that at- 
tention-rejection was correlated with pleasant- 
ness-unpleasantness, and in our own research 
(Ekman, 1964b) this dimension has been 
found to be intercorrelated with both pleas- 
antness-unpleasantness and with sleep-tension. 
The intercorrelations of attention-rejection 
with sleep-tension and with pleasantness- 
unpleasantness might make attention-rejec- 
tion applicable to both head and body cues, 
completely apart from the hypothesized dif- 
ference in the information conveyed by these 
two sources. Therefore, attention-rejection 
results were not utilized to test the hypothe- 
sis, but a more limited test was made utiliz- 
ing only the dimensions empirically demon- 
strated to be independent, pleasantness-un- 
pleasantness and sleep-tension, Thus, in 
determining whether the head might convey 
more information than the body about the 
different affect States, the test is limited to 
only one of the possible dimensions underly- 
ing differences between affect states, pleasant- 
ness-unpleasantness; the test of whether in- 
formation about level of arousal is conveyed 
primarily by the body utilizes, of course, the 
sleep-tension dimension.? The hypothesis can 


2 The findings on attention-rejection will be re- 
ported in another article as a part of a series of 
other experiments directly addressing the question 
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be rephrased as follows: more information 
relevant to the pleasantness-unpleasantness 
dimension than to the sleep-tension dimension 
is available from head cues; and more infor- 


mation about sleep-tension than about states , 


of pleasantness is available from body cues. 
Four experiments were conducted to test 
this hypothesis. Different groups of judges 
participated in each experiment, and in cer- 
tain instances different photographic stimuli 
were shown. The general method , will be 
described first. The results of all four experi- 
ments will then be reported and discussed. 


METHOD 
Stimuli 


The nonverbal stimuli were selected from the rec- 
ords of five standardized stress interviews. In these 
interviews the interviewer, a staff psychologist or 
psychiatrist, had attacked and criticized the moti- 
vation and competency of the subject, a student in 
psychology or psychiatry. After 10 minutes of this 
stress period, the interviewer attempted to induce a 
catharsis phase by explaining his own behavior, 
praising the subject for his resiliency in meeting the 
attack, joking with the subject, etc.? Time-sample, 
still photographs were taken every 30 seconds iii the 
first interview (A), every 15 seconds in the second 
interview (B), and every 5 seconds in the last 
three interviews (C, D, E). The pictures showed 
only the subject’s behavior, with a profile view in 
Interviews A and B, and a full-face view of the 
entire head and body in C, D, and E. 

In Experiments I, II, and III photographs were 
randomly selected without prior inspection from the 
total sample. In Experiment IV, no attempt was 
made to acquire a representative sample of the 
interview; instead, only the most highly communi- 
cative photographs were chosen. The selection cri- 


terion was based on the results of another series, 


of experiments (Ekman, 1965a), in which judges 
were required to guess whether each randomly 
selected photograph had been taken during the 
Stress or catharsis phase of the standardized inter- 
views. Photographs were chosen for Experiment IV 
from those which more than 80% of the judges had 
correctly identified, with this level of accuracy 
replicated in at least two separate experiments. In 
all four of the new experiments the number of pic- 
tures from the stress phase equaled the number 
from the catharsis phase. 

Three separate cue versions showed only the 
stimulus person’s head, the body up to the neck, 
the whole person (head and body). These three 


of adequate models of emotional expression, in 
which new dimensions are also proposed. 

®The stress interview procedure and selection of 
subjects for these interviews has been described at 
greater length in Ekman (1964a, 1965a). 
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TABLE 1 
i DESIGN OF THE FOUR EXPERIMENTS 
; Total number 
TEIE Number of stimul a 
Experiment Sas showa i || ean agani MEER oa or E e aoi Cue conditions 
I 2 (A,B) 24 Random Anchor pictures Head 
Body 
Whole 
II 5 (A,B,C,D,E) 60 Random Anchor pictures Head 
Body 
TIL 5 (A,B,C,D,E) 60 Random No anchor Head 
5 pictures Body 
o Whole 
IV 5 (A,B,C,D,E) 42 Criterion No anchor Head 
` selettion pictures Body 
Whole 


different cue versions were made through photo- 
copying procedures on 35-millimeter slides and 
projected for the judges. With a few pictures some 
difficulty was experienced in separating head and 
body because a hand was in the vicinity of the 
face; with these particular stimuli in the head 
version part of the hand showed in the photograph, 
and in the body version part of the hand was not 
showing. In all of the experiments the photographs 
of the five stimulus persons were presented in a 
random order. 

Experiment I, conducted in 1961, showed photo- 
graphs from the two interviews (A and B) recorded 
at that time. The other three experiments, conducted 
2 years later, employed photographs from all five 
stress interviews, The differences among the stimuli 
are given in Table 1. 


Judgment Task 


In all four experiments judges were required to 
rate the emotion being experienced by the person in 
each photograph on each of Schlosberg’s three 
dimensions. Judges had no knowledge of the inter- 
view situation, or even of the fact that the pictures 


were drawn from interviews. Schlosberg’s generous 


help made possible the use of instructions and 
answer sheets identical to his, Each dimension was 
presented as a 9-point scale, with the name of that 
scale placed under the numbers “I. . - 9.” A short 
Paragraph defined each scale: 


Pleasantness-unpleasaniness. You are to rate 
each picture on a 9-point scale where 1 indicates 
that the person in the photograph is feeling about 
aS UNPLEASANT Or unhappy as imaginable, and 9 
indicates the maximum PLEASANTNESS. 

Attention-rejection. You are to rate each picture 
on a 9-point scale where 9 indicates that the 
person in the photograph is feeling the maximum 
ATTENTION, as if the person is making every effort 
to see something. A rating of 1 indicates that the 
person is feeling the maximum REJECTION, as if the 
person is trying to shut out or keep out any 
stimulation. Inattention is not the true opposite 


of attention, but occupies a position of about 5, 
midway between ATTENTION and REJECTION. 

Sleep-tension, You are to rate each picture on a 
9-point scale where 1 indicates that the person in 
the photograph is feeling the complete relaxation 
of strep, whereas 9 would be given to the most 
“emotional” expression you can imagine, in which 
the person is very excited and shows maximum 
TENSION. 


Procedure 


In each experiment each of the three cue condi- 
tions—head, body, and whole person—was viewed 
by a separate group of judges. A group of judges 
saw the entire set of pictures within a cue condition 
three times, with each photograph projected for 20 
seconds during each presentation. In Experiments I 
and II judges rated the photographs on pleasantness- 
unpleasantness during the first presentation, on at- 
tention-rejection during the second, and on sleep- 
tension during the third. 

In the first two experiments, before using each 
rating scale, judges were shown Schlosberg’s anchor- 
ing photographs to help define its meaning. These 
photographs show posed facial expressions. They 
were not employed in Experiments III and IV, be- 
cause of the possibility that a picture of a posed 
face was not appropriate for defining the meaning 
of a rating scale to be applied to a picture of an 
unposed body. This difference in procedure is shown 
in Table 1. In Experiments III and IV the dimen- 
sions were rated in a balanced order. 


Judges 


Separate groups of college freshmen served as 
judges. Individuals whose parents were foreign-born, 
or who were themselves foreign-born, were dis- 
carded from the data analysis. Judges who did not 


4 Other research conducted on attempting psychi- 
atric diagnosis from photographs of patients has 
found that foreign-born judges and judges whose 
parents are foreign-born perform significantly below 
native-born judges on this task. 
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TABLE 2 


NuMBER OF JuDGEs ror Eac Cue CONDITION IN 
EACH EXPERIMENT 


Experiment Head Body Whole 
I 34 41 35 
aa 17 22 — 
poig 18 19 26 
Iv 18 18 17 


respond to every photograph were also discarded. 
The number of judges remaining in each experiment 
is shown for each cue condition in Table 2.5 A 


RESULTS 


The first and most direct test of the hy- 
pothesis was to determine by calculating a 
coefficient of concordance (Siegel, 1956, p. 
229) the extent of judge agreement in rating 
the different cue versions on the two dimen- 
sions, The prediction was that there would be 
a higher concordance on_pleasantness-un- 
pleasantness than on sleep-tension when the 
head is judged, and a higher concordance on 
sleep-tension than on pleasantness when the 
body is judged. 

Table 3 shows that the concordances ob- 
tained in all four experiments support the 
hypothesis, although there is some variation 
in the absolute size of the coefficients. One of 
these variations was anticipated; since there 
were no anchor photographs in Experiment 
TIT, somewhat less agreement than in Experi- 
ment II was expected; this was confirmed. 

Another method of testing the hypothesis 
was to correlate, on each dimension, judg- 


5 The number of judges discarded from the data 
analysis because they failed to yield complete rec- 
ords was less than 10% of the sample in any experi- 
mental group; the number of judges excluded be- 
cause they or their parents were foreign-born ranged 
from 5 to 35% of an experimental group. 


TABLE 3 
Concorpances ror EACH DIMENSION 
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ments of the head with judgments of the body 
versions of the same pictures, According to 
the hypothesis, if the head and body carry | 
primarily different information, there should 

be no necessary correlation between judg- «| 
ments of the head and of the body. These 
intercorrelations are not independent of the 
coefficients of concordance. If little agreement 

is found in judgments of a particular cue 
condition on a given dimension, a low cor- 
relation results when the ratings-on that di- 
mension are correlated with judgments 
from any other cue soutce. Table 4 shows ` 
Spearman rank-order correlations calculated 

on the median ratings for each photograph . 
on each dimension for comparison of head/ 
body, head/whole, and body/whole cue con- 
ditions, On neither the pleasantness-unpleas- 
antness dimension, nor the sleep-tension di- 
mension was there a significant correlation 
between judgments of head and body. 

The results on the head/whole correlations 
and the body/whole correlations suggest some 
qualifications to the hypothesis. Clearly, rat- 
ings of the head and of the whole person! on 
the pleasantness-unpleasantness scale are 
highly intercorrelated, while there is no rela- 
tionship between judgments of body and 
whole person cue versions on this dimension. 
But, on sleep-tension ratings, not only were 
there moderate correlations between judg- 
ments of body and whole person, as would 
be expected, but there were similarly mod- 
erate correlations between judgments of head 
and whole. Thus, while the evidence so far 
clearly suggests that little information about: 
pleasantness-umpleasantness is communicated 
by the body, and that the head provides more 
information about pleasantness-unpleasantness 
than about sleep-tension, the qualification 
must be added that some information about 


AND Eacu Cur CONDITION 


IN EACH EXPERIMENT 


S Head Body Whole 
I n/m | w I u m | v I m | iW 
Pleasant-unpleasant | .386* | .675* 589* | .771* | 194 ome i 
r : ; a91 | 1 vo? 
Sleep-tension 164 | .319* | 193% | (250+ | `447x ‘ave eh es aie Bee fie 
*p <.001, 
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TABLE 4 


RANK-ORDER CORRELATIONS BETWEEN CUE CONDITIONS ror EACH 
DIMENSION WITHIN EACH EXPERIMENT 


Head and body Head and whole Body and whole c 
Pe Scale 
i: II Til IV I ul Iv I ul IV 
Pleasant-unpleasant | —.042 | —.096 | .038 | — 037 | .821* | .894* | .946* |—.142 | —.125 | —.003 
Sleep-tension .072 .067 | .163 .239 | .413* | .527* | .729* 512*| .455*|  .572* 


*p <.001. 


sleep-tehsion is communicated by the head. 
o This qualification is in agreement with‘ the 

finding shown in Table 3 that there were low 

, but significant concordances on the sleep- 
| "tension dimension for the head-cue condition. 
} Another way fo explore the question under 
study involves determining whether known 
changes in interpersonal interaction could be 
inferred from head cues or body cues. The 
photographic stimuli had been drawn from 
the stress and catharsis phases of a standard- 
ized interview, and other experiments had 
demonstrated that these photographs com- 
mugicate information relevant to the planned 
manipulation of the interaction. Thus, an- 
other method of testing the hypothesis was 
to compare ratings of the head for the stress 
and catharsis photographs, and similarly com- 
pare ratings of the body for the photographs 
from these two interview phases. From 
knowledge of the stress interview, it could be 
expected that the stress photographs would 
be rated as more unpleasant than the cathar- 
sis pictures. According to the hypothesis, 
however, information about pleasantness is 


and catharsis photographs was expected only 
in judgments of the head. Similarly, it could 
be reasoned that stress would be rated closer 
to the tension end of the scale than would 
catharsis; and, this was anticipated only for 
judgments of body cues. 

Table 5 presents these data for the four 
experiments. The judgments were analyzed 
by first obtaining a median rating for each 
judge across all of the stress photographs 
from the five interviews, the same for the 
catharsis photographs, and the difference be- 
tween the two medians, A Wilcoxon matched- 
pairs signed-ranks test (Siegel, 1956, p. 75) 
was then performed on the difference scores 
for all judges. The prediction for the pleas- 
antness-unpleasantness dimension was sup- 
ported. The lack of difference on head or 
whole cues in Experiment I can be attributed 
to the fact that the experiment was heavily 
weighted (about 70%) with photographs of 
a particular stimulus person shown in other 
studies to have been difficult to judge. In the 
other three experiments, pleasantness-un- 
pleasantness ratings differed for the head and 


` 
carried by the head and not by the body, and not for the body. The lack of difference in 
therefore such a difference between stress pleasantness ratings on the body, in contrast 
J TABLE 5 
MEDIAN on Att Stress AND MEDIAN ON ALL CATHARSIS PHOTOGRAPHS FOR 
> Each DIMENSION AND Each CuE CONDITION 
í Head Body Whole 
I I I IV ij u u Iv I m IV 
Pleasant-unpleasant 
St . 4.88 | 3.50 | 5.33 | 5.00 4.71 | 4.85 | 4.85 | 4.33 | 3.89 
Catharsis 402 Sp 5.72* | 7.90* | 4.75 | 4.96 4.56 | 4.72 | 5.21 | 5.68* | 6.89* 
é Sleep-tension 
t % 475 | 5.50 |483 | 4.31 5.14 | 5.00 | 5.38 | 5.21 | 5.60 | 4.75 
Catharsis oh 5.44 | 5.21 | 5.50 432 | 5.23 | 5.22 | 4.90 4.98 | 5.61 | 4.88 


* The difference between judgments of stress and catharsis photogral 


phs is significantly different,  < «001. 
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to the difference obtained on the head, would 
have more import if a more substantial dif- 
ference in pleasantness ratings on the head 
had been obtained for the stress/catharsis 
photographs than was found in Experiments 
II and III. In anticipation of such a doubt, 
the photographs in Experiment IV had been 
selected so as to maximize the likelihood of 
obtaining different affect ratings of stress in- 
terview pictures. In this experiment, where 
there is a very large difference between the 
pleasantness-unpleasantness ratings for stress 
and those for catharsis photographs when 
judgments are based on head or whole person 
cues, there is still no difference between 
pleasantness-unpleasantness ratings of these 
two types of photographs when judgments are 
based on body cues, 

The prediction on the sleep-tension dimen- 
sion was not borne out. In none of the ex- 
periments was a difference found between 
stress and catharsis photographs on this di- 
mension, regardless of whether judgments 
were based on head, body, or whole person 
cues, An explanation, admittedly after the 
fact, is that the two interview experiences 
may have been characterized by the same 
level of arousal. It should be remembered 
that the word tension does not refer to a nec- 
essarily unpleasant state of affairs, Schlosberg 
claimed that, as the intensitive dimension, 
sleep-tension was independent of the nature 
of the affect; this has been borne out by a 
zero-order correlation between ratings on 
pleasantness-unpleasantness and sleep-tension 
dimensions, The catharsis phase was not a 
relaxed experience as compared to the stress 
phase; while the affects differed, their inten- 
sity did not. One bit of supporting evidence 
was a relationship detected between sleep- 
tension ratings and the stimulus person’s 
hands, There appears to be a definite, though 
not perfect, relationship as follows: when the 
hands are not showing, or are against legs or 
trunk, the picture was rated toward the sleep 
end of the scale; when the hands are in front 
of the person, but not out in space, the pic- 
ture was rated toward the middle of the 
scale; and when the hands are out in space, 
he picture was rated toward tension. These 
variations in hand activity occurred with 
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similar frequency in both the stress and 
catharsis phases, although within these ac- 
tivity levels the specific positions of the hands 
and fingers differed. 


DISCUSSION 


A number of factors limit the generality of 
the results. 

First and importantly, judgments were ob- 
tained on only five stimulus persons, who 
may have a number of similarities because 
they were all trainees in psychology or psy- 
chiatry. Certainly the pool of stimulus per- 
sons must be increased and varied before the 
results can make any claim to generality. 

A second limitation is that nonverbal be- 
havior was sampled from a single situation, a 
stress interview. The results of other experi- 
ments (Ekman, 1965a) indicate that this was 
not an atypical situation, considering that 
judges, though untrained, must have recog- 
nized the behavior in order to accurately de- 
cipher it; nevertheless other forms of inter- 
action should be sampled. 

A third problem is the artificiality inherént 
in presenting behavior in the static form of 
still photographs. From other data it appears 
unlikely that the use of motion picture film 
records would yield significantly different 
results on the pleasantness-unpleasantness di- 
mension, In a pilot experiment (Ekman, 
1965b) judgments of a new sample of stimu- 
lus persons, for which the photographic stim- 
uli were bursts of five frames taken at 1-sec- 
ond intervals, rather than single frames, pro- 
duced concordances of a similar magnitude. 
The results of another study (Hoffman, 1965) 
allow more direct comparison of motion pic- 
ture film and single frame stimuli. Hoffman 
utilized the same stress interview, employed 
the stress/catharsis judgment task, and the 
accuracy levels he obtained with 5 seconds of 
motion picture film are identical to those we 
obtained with the single frame. The relation- 
ship demonstrated, between the stress/ca- 
tharsis task and the pleasantness-unpleas- 
antness ratings reported here, suggests that 
the use of films rather than still photographs 
might not appreciably modify the results on 
this dimension. It seems likely, however, that 
ratings on the sleep-tension dimension would 
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change appreciably if the stimulus unit were 
a sample of motion picture film. The con- 
cordances reported here are probably lower 
than those which might be derived from stim- 
uli which captured sequence of movements 
and acceleration of activity over even a short 
period of time. 

A final problem is one common to all studies 


' utilizing what has been termed a communi- 


cative, as distinguished from an indicative, 
experimenta] design (Ekman, 1965a). Analy- 
sis of Sudges? reactions reveals only what 
judges are able to do, but not what further 
information may be available in the stimuli 


, either to the more tutored eye or through 


more precise measurement and analysis tech- 
niques. 7 

In view of these sampling and methodolog- 
ical problems, the reader is urged to approach 
the discussion to follow with caution, and 
regard the interpretations as tentative. 

There are two questions to discuss: Why 
do head cues furnish less information about 
sleep-tension than about pleasantness-unpleas- 
antness? And, why do body cues communi- 
cate almost no information about pleasant- 
ness-unpleasantness and yet still carry infor- 
mation about sleep-tension? The first ques- 
tion focuses attention on a discrepancy 
between our results and Schlosberg’s. 

Schlosberg’s theory held that the intensi- 
tive dimension of emotion, as represented by 
the sleep-tension dimension, is relevant to 
facial expression and not in a subsidiary role. 
While Schlosberg did not employ a measure of 


_judge agreement, he regarded the sleep-tension 


dimension as tentative, and in describing the 
reasons for his caution, partially forecast the 
present difficulties as follows: 


Much more work has to be done before we can 
be satisfied with the intensitive dimension of facial 
expressions. For one thing, we were working with 
a collection of pictures posed to represent emotions; 
this concentration on one end of the continuum in- 
troduces “series effects” in the ratings [1954, p. 87]. 


It is precisely the differences between non- 
verbal behavior which occurs when a subject 
is prevented from talking and is asked to use 
the nonverbal channel alone to communicate 
emotions, and the nonverbal behavior which 
Occurs more spontaneously when an individual 
is conversing in an emotionally toned relation- 
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ship, which probably best account for the 
differences between our results and Schlos- 
berg’s. Posing can elicit nonverbal behavior 
which rarely occurs within the constraints 
usually operative during a spontaneous inter- 
personal interaction. If we had asked our 
stimulus persons to look sleepy or lethargic, 
their facial cues might have elicited much 
greater judge agreement on the sleep-tension 
dimension—but they would not have been 
representative of interactive nonverbal be- 
havior. When an individual is talking or di- 
rectly reacting to another’s communications, 
the face is alive. The quality of the affect 
shown in the face does vary, but while the 
person is engaged in the interaction he can- 
not go to sleep; his eyes do not droop; his 
jaw does not sag. 

Thus, if the dimensions of emotion are to 
apply to interactive nonverbal behavior, then 
information from head cues about sleep- 
tension should be relegated to a subsidiary 
role; if the dimensions are to apply to the 
potential range of information communicated 
by head cues, in particular if the behavior is 
purposely posed free of restraints to repre- 
sent the range of possible information or if 
the interaction is a highly special one, then 
sleep-tension should occupy a more important 
place. 

Essentially the same line of reasoning is 
relevant to the question of why the body 
provides information about sleep-tension. 
While the face is occupied with maintaining 
visual contact and with speaking, and thus 
communicates less about a dimension such as 
sleep-tension, the body is less constrained, re- 
quired merely to keep the person in his chair. 
The body posture, and the positions of spe- 
cific body parts can appear relaxed or tense. 
Positions of the hips and of the trunk can 
place the person closer to or farther away 
from the other person. Positions of the hands, 
arms, or feet can accent or underline part of 
the verbal message, or can in a miniature 
portrayal act out part of the verbal message, 
or a theme which has not been verbalized. 
The body can be engaged in instrumental ac- 
tivity (lighting a pipe, scratching the head, 
cleaning the fingernails, stretching, etc.), or 
can be revealing the readiness to take some 
action relevant to the interaction, Observers 


1734 


may infer from these various bodily activi- 
ties information about degree of intensity 
and/or level of arousal. An observer can infer 
intensity of an affect or extent of arousal from 
boy photographs in which the behavior ap- 
pears either mobile or highly immobile. Our 
research in progress suggests, at least tenta- 
tively, that high levels of arousal can be 
associated with either much or little body 
movement, depending upon whether the per- 
son is directly expressing the affect, or inhib- 
iting and holding in the expression of the 
affect. r 

Certain types of bodily activity probably 
carry information about pleasantness-unpleas- 
antness, but occur infrequently during spon- 
taneous conversations; that is, some gestures 
have a pleasant or unpleasant quality; certain 
action patterns which in miniature act out 
part of a verbal message can also carry this 
kind of information; and, certain types of 
bodily activity which a sophisticated observer 
might interpret on a psychodynamic level 
(e.g., the interpretation of scratching finger 
movements as hostility directed inwards by 
Mahl et al, 1959) could also lead to in- 
ferences about a dimension such as pleasant- 
ness-unpleasantness. While various parts of 
the body might convey such information, 
probably the position of the fingers most 
typically carries information akin to that 
provided by the face.* Another area of over- 
lap is the position of the head; the tilts and 
swings of the head may carry information 
akin to that provided by the body. 

The terms level of arousal and intensity of 
affect have been used jointly to characterize 
the information measurable on the sleep- 
tension dimension, Schlosberg, in his descrip- 
tion of this dimension, and other authors who 
discuss arousal and intensity of affect, have 
used the terms as synonymous. There may be 
a slight difference between the two, however, 
as a function of the kind of nonverbal cues 
available to the judge. If the observer sees 
only body cues, he probably interprets the 
information available in terms of how aroused 


In a recently completed experiment, we have 
demonstrated that photographs showing particular 
hand positions do communicate information about 
attitude, interpersonal manner, and affect (Ekman, 
1965b). 
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or energized the organism is when he rates 
the behavior on the sleep-tension scale. If the 
observer sees the face as well as the body, 
then he could use the information about 
arousal communicated by the body as an 
intensity quantifier of the particular affect 
communicated by the head. The distinction is 
a minor one, and our recent research has 
taken a different approach exploring just 
what type of information is being measured 
by the sleep-tension dimension. 

In a partially completed series of “experi- 
ments, new rating scales ‘have been devised, 
and judgments of the same sets of photo- 
graphs have been obtained, so that judgments 
of each of these new dimensions can be com- 
pared with the data already available. The 
first new scale was designed to investigate 
whether sleep-tension ratings might reflect 
judgment of physical appearance more than 
inferential judgment, such as is made in the 
pleasantness-unpleasantness ratings, about an 
internal feeling state of the stimulus person. 
This scale was labeled physically mobile- 
physically immobile. The second new scale 
was labeled aroused-unaroused, since one of 
the main theoretical interpretations of the 
sleep-tension ratings is that they measure 
level of arousal. A third scale was labeled 
active-passive, since other writers (Triandis 
& Lambert, 1958) have pointed out the simi- 
larity between this factor from Osgood’s re- 
search on the semantic differential and the 
sleep-tension dimension. 

While no conclusions are possible yet, it is 
clear that while ratings on all three new 
scales are correlated with ratings on sleep- 
tension, the correlations among the new 
scales are greater than the correlation of any 
one of them with sleep-tension. Further, un- 
like sleep-tension, these three new scales 
seem less independent of pleasantness-un- 
Pleasantness ratings, At this point, the in- 
terpretation of the sleep-tension dimension 
appears to be a more complex problem than 
was anticipated, 

The problem for this study was to specify 
the link between nonverbal cues and infer- 
ences drawn by observers. Only one class of 
information has been considered here, and 
there is little reason to suppose that the same 
Tesults would be obtained from inferences 
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about another class of information. In fact, 
in another work (Ekman, 1964a) there was 
some ‘evidence of similarity between head 
and body cues as modifiers of a verbal mes- 
sage. The study has, however, established 
that there is a difference between these two 
broad classes of nonverbal cue sources, the 
head and the body, in the communication of 
information about affect. This is regarded as 
only a first step. Further subdividing into 
more specific, cues within each of these stim- 
ulus classes is still to be done. 
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The effect of changes in information load and complexity of Ss’ conceptual 


structure on information search and information utilization is measured. A’ 


complex decision-making task is used to provide opportunity for differences 


in information handling. It is*found that generally information soarch de- 


creases as information load increases and that Ss scoring high on measures of 
the complexity of conceptual structure are less affected by information load 
changes than their lower scoring counterparts. It is also found that the integra- 


tive handling of the information attained through search follows the lines of 
the Schroder, Driver, and Streufert theory and parallels the findings of 
Streufert and Schroder with integration in decision making. 


In a recent volume Schroder, Driver, and 
Streufert (1965) have been concerned with 
information acquisition and information proc- 
essing by individuals and groups. These au- 
thors have proposed that persons scoring 
high on measures of conceptual structure 
(Schroder & Streufert, 1963; Streufert & 
Schroder, 1963) are more “information ori- 
ented” and should generally process more 
information in complex decision-making situ- 
ations, The maximal level of information 
handling would hypothetically vary along the 
lines of the inverted U curve (with informa- 
tion handling plotted on the ordinate, and 
environmental conditions, i.e, information 
load, plotted on the abscissa), Individual or 
group differences in conceptual structure 
would produce a family of such inverted U 
curves. It is postulated that the general char- 
acteristic complexity of a person’s conceptual 
structure relates to the elevation and shape 
of his inverted U curve. 

Streufert and Schroder (in press) have re- 
cently shown that the information-processing 
characteristics of decision-making groups fol- 
low the lines proposed by that theory. Less 
is known about the various aspects of in- 
formation acquisition. In this context the 
term “information orientation” permits dif- 
ferent interpretations, It could imply differ- 


1 This research was in part supported by the Re- 
search Council, Rutgers-The State University. 
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ences in general “interest” in information, 
differences in perceptual characteristics, dif- 
ferent reasons for obtaining information, and 
so forth. In this paper we are concerned with 
some of the possible questions which could 
be asked about the information orientation 
of persons and groups scoring high or low 
on measures of conceptual structure. 

We are here specifically concerned with 
three processes of information search and 
utilization: delegated search for information, 
that is, requests for information from an 
external source by individuals taking part in 
a complex information-processing task; self 
initiated information search, that is, all de- 
cisions made by task-oriented four-man 
groups which are specifically designed to 
gain more task relevant information; and 
integrative utilization of information gained 
through search in subsequent group decision 
making. We compared the performance of 
Persons (or homogeneous groups) differing in 
the complexity of conceptual structure, and 
we also considered the effect of changes in 
environmental conditions (ie. information 
load). The questions asked in this research 
are: What are the effects of variation n 
information load and individual differences in 
conceptual structure on delegated information 
search? What are the effects of variation m 
information load and individual differences in 
conceptual structure on self-initiated infor- 
mation search? Is integrative information 
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utilization similar to the general complex in- 
formation processing characteristics measured 
by Streufert and Schroder (in press)? 


METHOD 


"Sub jects and Task 


Male students, numbering 185, at an eastern uni- 
versity were tested to determine their level of com- 
plexity in conceptual structure. Tests utilized were 
the Sentence Completion Test (Schroder & Streufert, 
1963), and the Impression Formation Test (Streufert 


& Schroder 1968). Eighty subjects were selected on 


4 (40 subjects) scores on 


` 


—_— 


the basis of very low (40 subjects) or very high 
these tests.? Differences be- 
tween the two groups of subjects on SAT verbal 
and quantitative scores were small and insignificant. 
Structurally homogeneous four-man teams were 
formed, These teams „were placed in a tactical game 
situation (for a detailed description of the simula- 
tion procedure, see Streufert, Clardy, Driver, Kar- 
lins, Schroder, & Suedfeld, in press). Each team was 
given the task of making decisions regarding the 
invasion of a mythical island. Teams were told 
that they were playing against another team which 
supposedly had received instructions to defend the 
island. All the functions of the enemy team were 
performed by the experimenters. Consequences of 
the subjects’ decisions, as determined by the experi- 
mentess, were fed back to the subjects. Information 
regarding these consequences was slightly favorable 
at all times. Analysis of a rating scale filled out 


„by the subjects in various phases of the game indi- 


cated that they perceived themselves in a moderately 
advantageous position at all times. 

The kind of success (not the level) experienced 
by the teams was determined by the strategy played 
by the supposed enemy team (the experimenters). 
The sequence of strategies played was the same in 
all cases, In this way all teams were kept approxi- 
mately equal in the general kind of operations em- 
ployed by them, their level of success, and the 
strategic moves made by the supposed enemy team. 
“All groups of subjects were exposed to different 
load conditions. During each 30-minute game pe- 
riod, subjects received either 2, 5, 8 10, 12, 15, oF 
25 independent informative statements. These state- 
ments would be in response to orders or requests 
received from the teams of subjects or, if no such 
orders had been received, general information re- 
garding other relevant “observations.” The order 
of the load periods was varied. From a number of 
tandom sequences those were selected which would 


“insure that each load condition would appear at 


least once in each position. 


2The impression formation test was scored in 
accordance with the manual of Streufert and 
Schroder (1963). The sum of the two highest scores 
on'the sentence-completion test was used in arriv- 
ing at a value of subjects on that 
lected subjects scored high on both tests or low 
on both tests. 
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Each group of structurally complex persons was 
matched with a group of structurally simple per- 
sons in terms of the information load sequence. Ten 
different sequences were used. 


Data Collection 


Delegated information search, After the end of 
each 30-minute playing period, 56 of the subjects 
(who were members of seven structurally simple 
and seven structurally complex teams) were given 
a number of tests and rating sheets to fill out. 
Among these was the following item: 


The information you are receiving is prepared for 
you fn the same way it would be prepared for 
real commanders by a staff of intelligence officers, 
These persons have been instructed to inform 
you only of important occurrences. You may feel 
that these men do not give you sufficient infor- 
mation or do not give you adequate detail. On 
the other hand, you may feel that the informa- 
tion you are receiving is too detailed and you are 
presented with some unimportant information. 
You may instruct these intelligence officers to in- 
crease or decrease the amount of information they 
present to you. We would like you to decide this 
matter for yourself. Please do not consult the 
other commanders on this issue at any time. We 
will adjust the information flow according to the 
majority opinion in your group. Please check your 
preference in comparison to the immediately pre- 
ceding game period: 
I would prefer to: 

receive much more information 

receive a little more information 

receive about the same amount of information 

receive a little less information 

receive much less information 


In the analysis, the above choices were assigned 
scores 5, 4, 3, 2, and 1, respectively. (In actuality, 
information load was predetermined and was not 
adjusted in accordance with requests by subjects.) 

Self-initiated information search, In contrast to 
the above measure, which was gained from each 
individual subject, this measure as well as the meas- 
ure of information utilization was gained from the 
group as a whole. Self-initiated information search 
is an expression of the number of information search 
or reconnaissance orders given by the teams of sub- 
jects to their own troops during a particular 30- 
minute period. 

Integrative information utilisation, Here we are 
concerned with the use made of the information 
gained through self-initiated search. A score of 1 
was assigned to each future decision which was 
a consequence of a particular information search 
move (cf. Streufert & Schroder, in press). The total 
score for any playing period is then the number 
of future decisions which are consequentially related 
to information search (reconnaissance) decisions 
made by a team during that particular period. 
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Frc. 1, Average amount of information requested 
through delegated search under changing load con- 
ditions for subjects differing in the complexity of 
conceptual structure. 


RESULTS AND Discussion 
Delegated Information Search 


Measurement of self-initiated information 
search and integrative information utilization 
is based on data which permit parametric 
analysis, so that analysis of variance tech- 
niques could be used. The data concerned 
with delegated information search, however, 
are derived from ratings, and permit only 
nonparametric analysis. Statements of sig- 
nificance are based on the Wilcoxon matched- 
pairs signed-ranks test (for tests of differ- 
ences between loads) and the Mann-Whitney 
U tests (for tests of difference between 
structures), 

Disregarding differences in structure, a sig- 
nificant drop in delegated information search 
was obtained between Load Conditions 12 
and 15, and Load Conditions 15 and 25 
(p< .05 and <.01, respectively). These 
results are clearly indicated in the slopes of 
information search curves in Figure 1. It ap- 
pears that an increase in information load, 
from what Streufert and Schroder (in press) 
have shown to be suboptimal to optimal load 
conditions for complex information process- 
‘ing, does not alter information search charac- 
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teristics to a large degree. However, as in- 
formation load increases toward superoptimal 
levels (cf. Streufert & Schroder, in press), 
delegated information search beings to drop. 
Yet it should be noted that even at highly 
superoptimal information loads (i.e., 25 mes- 
sages per 30-minute period), the subjects are 
still asking for increased information levels, 
Observation of subjects’ behavior under dif- 
ferent load conditions suggests that they 
would prefer environments with load condi- 
tions producing optimal complexity” of per- 
fortnance as found by Streufert and Schroder 
(in press). The apparent inconsistency in 
the behavior of the subjects (requesting 
additional information at superoptimal load 
levels) might be due to at feast two possible 
causes: Subjects may feel that some signifi- 
cant information is not reported, or, an ex- 
traneous factor, such as a possible effect of 
social desirability of engaging in information 
search may be operating. Testing -for a dif- 
ference between those subjects requesting 
“much more information” and those request- 
ing the “same amount of information” or “a 
little less information” under high load condi- 
tions using scales of social desirability and 
response set may be of value. 

Differences in delegated information search 
for subjects differing in the complexity of 
conceptual structure were found under Infor- 
mation Load Conditions 15 and 25 (p < .05 
and <.01, respectively). In line with the 
theory of Schroder et al. (1965) this finding 
could be explained as a consequence of lack- 
ing sensitivity of persons scoring low on 
conceptual structure measures to changes i 
environmental conditions. Another possible 
explanation might be a greater effect of social 
desirability on low scorers. If information 
search is socially desirable, and if measures 
of conceptual structure in this population 
correlate with social desirability (correla- 
tions in other populations have varied from 
—.41 to +.01), then one may expect less 
sensitivity to superoptimal load from low 
scorers, whose response would be in part de- 
termined by social desirability, than one may 
expect from high scorers. In other words, 
high scorers would be more sensitive to in- 
formation changes, per se. 
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CoNCEPTUAL STRUCTURE 


Self-Initiated Information Search 


An analysis of variance technique (Type I 
Analysis; Lindquist, 1953) was used to test 
for significance. The interaction effect (F 

* = 2.52, p < .05) and the main effect for loads 
(F = 7.26, p < .01) are significant. The main 
effect for structure (F = 2.06) is not signifi- 
cant, Mean scores for self-initiated informa- 
tion search are plotted in Figure 2. As in 
the analysis of delegated information search, 
we find ĉthať increases in information load 
result in decreasing information search. How- 
ever, a look at the graphic representation in 
Figure 2 seems to suggest that the decrease 
Sn information search is a near-linear function 
of information load rather than confined to 
only superoptimal information loads (as it 
was for delegated information search). If one 
considers an averaged curve (over struc- 
tures), then self-initiated information search 
responses appear to drop from approximately 
two such decisions per period for highly sub- 
optimal information loads to one decision in 
highly superoptimal information loads. In 
other words, some self-initiated information 
search continues to occur. 

However, if the subjects had been inter- 
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Fic. 3. Integrative utilization of previously sought 
information under changing load conditions for 
groups of subjects differing in the complexity of 
conceptual structure. 


ested in receiving considerably more informa- 
tion (as their responses on the delegated in- 
formation search measure would suggest), 
they should have made considerably more 
active information search decisions than they 
actually did. The fact that they did not 
lends some support to the suggestion that the 
continued high delegated information search 
is not necessarily due to actual interest in 
additional information. 

It is interesting to note that the interaction 
effect for structures and loads in self-initiated 
information search is significant, although the 
main effect for structures is not significant. 
It appears that structurally simple persons 
are considerably more sensitive to changes in 
information load than are structurally com- 
plex persons. This result can be explained in 
terms of current theoretical work concerned 
with conceptual structures and information- 
processing characteristics. In an analysis of 
the response characteristics of structurally 
simple and complex persons, Schroder et al. 
(1965) and Streufert and Schroder (in press) 
have shown that structurally simple persons 
(or groups consisting of such persons) re- 
spond more directly to immediate environ- 
mental information. Conversely, structurally 
complex persons (or groups) respond in a 
more integrated strategic fashion. These find- 
ings, according to Streufert and Schroder, 
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hold under all but extreme sub- or super- 
optimal load conditions. 

In other words, structurally simple groups, 
who would tend to relate one stimulus to one 
"response, should require more information 
under suboptimal loads than structurally 
complex groups, who can reutilize informa- 
tion for more complex integrated decisions. 
On the other hand, under superoptimal load 
conditions, structurally simple groups should 
be satisfied with available information, since 
it gives them sufficient stimuli to produce 
responses. Structurally complex groups, who 
make more integrated strategic decisions, 
would be likely to require additional relevant 
information to permit integration no matter 
what the information load level may be. 

Further, internal search as suggested by 
Lanzetta (1963) may be more typical of 
structurally complex subjects. This internal 
search may not vary directly with informa- 
tion load, so that more balanced search 
characteristics would result for subjects en- 
gaging in this method of generating relevant 
“nformation” on their own. 


Integrative Information Utilization 


An analysis of variance technique (Type I 
Analysis; Lindquist, 1953) was used for test- 
ing for significance. The interaction effect 
(F = 4.82, p < 01) as well as the main ef- 
fects for structures (F = 33.61, p < .01) and 
information loads (F= 6.53, p< 01) are 
significant, The mean number of integrated 
information search orders are plotted in 
Figure 3. The results are quite similar to 
those found by Streufert and Schroder (in 
press) for general integration patterns. Gen- 
erally, structurally complex persons show 
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considerably higher integration levels than 
structurally simple persons. Integrative in- | 
formation utilization is optimal (at least for 
structurally complex groups) somewhere near 
the midrange of information load. For struc- 
turally simple groups, integrative information 
processing breaks down completely when in- 
formation load reaches extreme suboptimal | 
and superoptimal levels. This finding, then, 
gives additional support to the hypothesis 
of Schroder et al. (1965) about, complex 


information-processing behavior. 
A J 
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COOPERATIVE CHOICE IN A 


PRISONER’S DILEMMA GAME + 
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reciprocated less when such behavior 
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In recent years a number of investigators have 
used the two-person Prisoner’s Dilemma game, 
illustrated in Table 1, to study cooperation and 
competition.? In this game, if each person chooses 
“Black” 100% of the time the worst that can 
happen is tò win nothing, and therefore, each can 
insure against any loss, On the other hand, the 
optimum joint strategy is for both persons to 
chogse “Red”. on every trial and receive three 
units per trial, that is, the mutual or joint ex- 
pected value is greatest when both players play 
Red 100% of the time. Since there is no “ra- 
tional” basis for maximizing gain, and given the 
assumption that playing Red or Black denotes a 
cooperative or competitive response, respectively, 
the game is ideally suited for the study of com- 
petition and cooperation. 

In order to determine the rel 
which influence an individual to cooperate or 
compete, one of the major difficulties is that each 
individual is responding to the responses of the 
l jations in the instruc- 
tions given to subjects (Deutsch, 1958) or varia- 
tions in the values of the payoff matrix (Scodel, 
Minas, Ratoosh, & Lipetz, 1959), the independ- 
ent variable is difficult to vary systematically. 
Some investigators, therefore, have used a stooge 
who is instructed to use a predetermined strategy. 
Minas, Scodel, Marlowe, and Rawson (1960), 
for example, used a stooge who “matched” the 
choices of the subject. Over 50 trials in this 
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game, 2 related experiments were con- 


s that individuals would reciprocate cooperative 


he Ist study systematically varied the 
and competitive responses by the 
d competitive responses by S, respec- 
fa simulated partner who matched 
most Ss did not reciprocate cooperative 
ed. Males reciprocated 


was nonoptimal. 


matching condition, 36% of the choices of 30 
subjects were cooperative. A similar deception 
was used by Scodel (1962) and by Bixenstine 
and Wilson (1963) who found that subjects were 
more likely to cooperate if a stooge or “simu- 
lated other” competed initially and then coop- 
erated on later blocks of trials. 

The basic purpose of these studies is to de- 
termine the conditions under which individuals 
are likely to cooperate or compete. In accordance 
with this objective, the present study, consisting 
of two related experiments, attempted to pro- 
vide further tests of the general hypothesis that 
an individual faced with a cooperative partner 
would show cooperative behavior in return. 

The first experiment (Study I) was designed 
to vary systematically the “matching” condition 
employed by Minas et al. (1960), without strain- 
ing the credulity of subjects. Instead of match- 
ing the subject’s responses on 100% of the trials, 
as in the study by Minas et al., the “matching” 
probability was varied; for example, in one con- 
dition the subject’s responses were randomly 
matched on 75% of the trials. Under this condi- 


TABLE 1 
PAYOFF MATRIX FOR PrisONER’s-DILEMMA- 
Tyre GAME 
Person B 

DAS E tte 

Red Black 

Red 3,3 Bibs 

Person 

A Black 5,1 0,0 

Note.—Left and right cell entries denote payoffs for Players 


A and B, respectively. 
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tion, however, when the subject made a coopera- 
tive response, he received feedback that his part- 
ner was cooperating 75% of the time, but when 
the subject made a competitive choice, he re- 
ceived feedback that his partner was cooperating 
25% of the time, that is, competing 75% of 
the time. Hence, the subject received discrepant 
feedback as a function of his own response. 

If we let P(cp/cp) denote the conditional 
probability of a cooperative response by the 
simulated partner given a cooperative response 
by the subject, and let P(cm/cm) denote the 
conditional probability of a competitive response 
given a competitive response by the subject, one 
method of making feedback consistent and in- 
dependent of the subject’s response is to set 
P(cp/cp) =1—P(cm/cm). For example, if we 
set P(cp/cp) =.75 and P(cm/cm) =.25, the 
subject receives consistent feedback that his 
partner is cooperating 75% of the time. How- 
ever, this situation is equivalent to a simulated 
partner who randomly cooperates on 75% of the 
trials, In Study I, P(cp/cp) and P(cm/cm) were 
systematically varied, some groups receiving con- 
sistent feedback from the simulated partner while 
others received discrepant feedback. Neverthe- 
less, despite this variable of feedback discrep- 
ancy, it was predicted that cooperative behavior 
would vary directly with P(cp/cp) and inversely 
with P(cm/cm). 

The purpose of the second experiment (Study 
TI) was to test the effects of a simulated partner 
who imitated or matched the previous response 
of the subject, hereafter referred to as the “Lag 
1” condition, This procedure, presumably, com- 
municates to the subject, “I'll cooperate if you 
will, and T'Il compete if you do.” Under this con- 
dition, an individual with an initial tendency to 
compete will be immediately exposed to recipro- 
cal competition, will not be reinforced for com- 
petitive behavior, and will experience the futility 
of competitive behavior. Similarly, an individual 
with an initial cooperative tendency will have 
his cooperative behavior reciprocated and rein- 
forced. Consequently, the Lag 1 condition was 
employed in an attempt to induce cooperative 
behavior without straining the credulity of the 
subjects. 

METHOD 
Subjects 


For both experiments, subjects were students en- 
rolled in introductory Psychology classes. Subjects 
for Study I were 36 males and 36 females while sub- 
jects for Study II were 40 males and 40 females. 


Procedure 


All subjects were scheduled in pairs of the same 
sex and could see each other as they entered the ex- 
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perimental room. They were seated across a table 
facing each other with a wooden partition between 
them so that once seated, they could not sce each 
other. At right angles to the partition separating the 
subjects was a second partition which screened each 
subject from the experimenter. Once seated, they 
were instructed not to communicate with each other 
until the end of the experiment. On the table before 
each subject was a red and black button (push- 
button switches) corresponding to cooperative and 
competitive choices. The color of the buttons was 
counterbalanced so that for half of the subjects, 
pressing the red button represented a cooperative 
choice and for the other half, pressing the red but- 
ton represented a competitive choice. 

The payoff matrix shown*in Table 1 was mounted 
on the partition before each subject and after ex- 
plaining the game to the subjects, they were in- 


structed that there would be a large number of 


trials and that their task was to accumulate as 
many points as possible. After each trial, subjects 
received one of four different colored poker chips, 
representing 5, 3, 0, and —1 points, through a small 
slot at the bottom of the partition separating the 
experimenter from the subjects. 

A light on the panel before each subject signaled 
the start of a trial and each trial lasted» 15 seconds. 
Eighty trials were administered with a 1-minute in- 
terval between blocks of 20 trials. During these in- 
tervals, the subjects were asked to “stack” their 
chips and instructed not to talk. The proportien of 
cooperative responses on the last block of 20 trials 
was used as a measure of asymptotic response prob- 
ability. 


Study I 


Study I consisted of two major independent vari- 
ables: P(cp/cp) and P(cm/cm), that is, the condi- 
tional probability of matching cooperative and com- 
petitive responses, respectively. As an example, if 
P(cp/cp) was .75, on trials where the subject made 
a cooperative choice, the subject received 3 points 
with probability .75 and —1 point with probability 


25; similarly, if P(cm/cm) was .75, on trials where 


the subject made a competitive choice, the subject 
received 0 with probability .75 and 5 points with 
Probability .25. Three levels of each of the two vari- 
ables were employed: .25, .50, and .75. Treating 
sex of subjects as a third independent variable, 4 
3X3X2 factorial design was used. 


Study IT 
Tn Study II involving the Lag 1 condition, on the 


first trial, the subject’s initial response was matched, | 


and thereafter, the subject’s previous response Was 
always matched. Two sets of experimental instruc- 
tions were used. For one group of 20 males and 20 
females, the same set of instructions was used as in 
Study I, hereafter referred to as the “regular” in- 
structions. However, postsession interviews with sub- 
jects in Study I had revealed that most subjects 
orientation had been to “beat” the other person, 
that is, to win more points rather than to maximize 
gain, Hence, to determine the effects of a less com- 
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petitive orientation, a second group of 20 males and 
20 females was also given the Lag 1 condition with 
the following additional instruction, hereafter re- 
ferred to as the “modified” instruction: “your task 
js not to win more points and beat the other person 
but simply to accumulate as many points as pos- 
sible.” Study II, therefore, consisted of a 2 X 2, Sex 
X Instruction, factorial design. 


RESULTS AND DISCUSSION 
Study I 


The analysis of variance indicated that three 
sources ,of variance were significant at the 5% 
level: the main effect of sex (F = 4.211, df= 
1/54), the main effect of P(cp/cp) (F = 3.740, 
df = 2/54), and the interaction between Sex and 
P(cb/cp) (F = 3.211, df = 2/54). Figure 1 shows 
the nature of this interaction and indicates that 
for males the effects of P(cp/cp) are negligible, 
while there is a slight tendency for females to 
be more cooperative if they are led to believe 
that their partner is more cooperative. Neverthe- 
less, note that even for the most cooperative 
group of females, the mean probability of co- 
operative vhoice is less than .30, In general, then, 
despite the fact that main effect for P(cp/cp) is 
significant, the results indicate that neither males 
nor, females were very likely to reciprocate co- 
operative behavior. 

The data for males indicate that they did not 
respond differentially to either of the experi- 
mental conditions, In effect, males responded 
competitively under all conditions and ignored 
the responses of the other player. One might 
conclude, therefore, that men are more competi- 
tive and less “rational” than women. However, 
with false feedback from the simulated partner, 
the optimal strategy was to compete on every 
trial. Hence, under the conditions presented to 
them, males behaved quite rationally, and it is 
reasonable to assume that they competed in or- 
der to maximize payoff. 

The significant sex difference found in this 
study showing females to be more cooperative 
than males appears to be inconsistent with the 
results found by Lutzker (1960) showing no dif- 
ferences between males and females. However, 
the two studies are not directly comparable in 
that false feedback was employed in this study, 
and one possible explanation for the discrepancy 
is that if pairs of females are initially competi- 
tive, subsequent cooperative behavior takes place 
only when one of them is willing to cooperate 
and take a loss for an extended number of trials. 
This willingness to undergo a temporary loss 
has been described by Anatol Rapoport * as a 


3 Unpublished manuscript entitled “Formal Games 
as Probing Tools for Research on Human Behavior. 
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Fic. 1. Mean proportion of cooperative responses 
on the last block of 20 trials as a function of the 
conditional probability of a cooperative choice given 
a cooperative choice. 


“martyr run,” and when subjects are actually 
playing against each other, as in Lutzker’s study, 
it is plausible that not many of them are willing 
to be martyrs. Under the conditions of the pres- 
ent study, however, it is conceivable that some 
of the females may have been led to believe that 
the other was acting the martyr. 

In this connection, it is interesting to speculate 
about the type of individual who might feign 
ignorance of a martyr run, and take advantage 
of such a “sucker.” In postsession interviews with 
subjects, many of them, especially the males, felt 
that the other person “didn’t understand the 
game or used poor strategy.” Some of the males 
even admitted that they thought that the other 
person was attempting to cooperate, and yet, 
these subjects had taken advantage of these at- 
tempts. A second explanation for the significant 
sex difference, therefore, is that females may be 
more concerned with interpersonal relations than 
males (ie., more sociable, sympathetic, and sensi- 
tive to the responses of others), and it is plau- 
sible that they might hesitate to take advantage 
of others. Thus, two competing tendencies may 
be operating in this situation, namely, a tendency 
to maximize personal gain and a tendency to be 
considerate, fair, and understanding. Males, evi- 
dently, had less conflict between these competing 
tendencies than females. 

For females, the differential effect of the two 
independent variables presents an interesting 
problem. In all of the experimental conditions, 
the subject received feedback about the choices 
made by her partner through her payoff. There- 
fore, if information about the other person’s 
choice influences the subject’s behavior, both 
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variables should affect response probability. Since 
only one of these variables, P(cp/cp), had a sig- 
nificant effect, one implication is that, for females 
at least, value of payoff interacts with feedback, 
that is, the effects of feedback that the other 
player made a cooperative or competitive choice 
depend upon the outcome or consequence of this 
information. This interpretation of the results is 
consistent with the results found by Atkinson 
(1957) and Feather (1959) indicating that in a 
decision-making situation, expectation or subjec- 
tive probability interacts with the valence or 
utility of the outcome. 


2 


Study IT 


The analysis of variance indicated that the 
main effects of sex (F = 10.258, df = 1/76) and 
of instruction (F = 5.955, df =1/76) were both 
significant at the .05 level. Table 2 shows the 
mean proportion of cooperative responses for the 
four groups and indicates that males were more 
cooperative than females, and that the modified 
instructions, as predicted, increased the tendency 
to cooperate. With regard to the interaction be- 
tween sex and instructions, the data seems to 
suggest that the modified instructions had a 
greater effect on females than on males. How- 
ever, the interaction, with df = 1/76, is not sig- 
nificant at the .05 level (F = 2.718, p < .10). 

The implication of the significant main effect 
of instructions is clear—the initial set or ori- 
entation of subjects plays a major role in induc- 
ing cooperative behavior in the prisoner’s di- 
lemma game. Unless specifically instructed not 
to do so, subjects are likely to perceive the 
situation as a game in which the purpose is to 
‘beat the other player, and it is plausible that 
this initial set accounts for the strong competi- 
tive tendency shown by subjects not only in this 
study but in previous studies as well. 

With regard to the effect of the Lag 1 condi- 
tion, a comparison of the data of Study II with 
that of Study I indicates that, for males at least, 
the Lag 1 condition markedly increased the tend- 


TABLE 2 


MEAN PROPORTION OF COOPERATIVE RESPONSES 
UNDER “Lac 1” CONDITION 


Instructions 
Regular Modified 
Males -483 -563 
Females .228 472 


Note.—Proportions based upon last block of 20 trials. 
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ency to cooperate. For the 12 males of Study I 
under the condition P(cp/cp) = .75, the mean 
proportion of cooperative choice was .13, and 
this is significantly different from the pooled 
mean of .52 for the 40 males of Study II (¢= 
3.660, p< .01). A similar comparison was made 
for female subjects (mean proportions of .29 
versus .35) but this difference was not signifi- 
cant. Thus, the effects of the Lag 1 condition 
affected males more than females. 

Probably the most plausible explanation of 
this result is that males simply maximize gain 
more than females. Under the conditions of 
Study I, competitive behayjor was optimal while 
under the Lag 1 condition in Study II, there was 
no way to take advantage of the other player 
since cooperative choice was the only way to 
maximize gain. Thus, reciprogating cooperative 
choice was optimal in Study II but was non- 
optimal in Study I, and it is plausible that males 
are not as likely to reciprocate unless it maxi- 
mizes gain. 

It should be noted that the Lag 1 condition 
employed in this study is analogous to, the “con- 
dition cooperative” strategy described by Solo- 
mon (1960). However, his procedure involved 
Sequential choice on each trial, the subject re- 
sponding first, and then the simulated partner. 
Over six trials in this condition, Solomon found 
that subjects were more likely to cooperate with 
a conditionally cooperative partner than with an 
unconditionally cooperative partner. Since sub- 
jects in the present study were required to make 
simultaneous choices and 80 trials were adminis- 
tered, the results are not directly comparable. 

Finally, the most interesting and unexpected 
result was obtained from postsession interviews 
with subjects. Some of them were under the im- 
pression that they were “cooperating” with each 


other although, as defined in this study, they had: 


not really cooperated. That is, under the modi- 
fied instructions, 5 of 20 males and 3 of 20 fe- 
males had implicitly believed that they were C0- 
operating with each other by alternating between 
competitive and cooperative responses, sacrificing 
1 point and receiving 5 points on alternate trials. 
This implicit cooperative strategy, however, 
clearly does not maximize joint expected value. 
_ The implication of this result is that conclu- 
sions about the degree to which individuals co- 
operate in the Prisoner’s Dilemma game, or any 
game, is a function of how cooperation is defined. 
Most investigators, including the present investi- 
gator, have defined cooperation as a response Ï 
which joint expected value is maximized. How- 
ever, if a subjective criterion of cooperative be- 
havior were to be employed, it is quite plausible 
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that cooperation may not be such an elusive 
phenomenon as implied by previous investigators. 
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with that position. (b) 


hypotheses were strongly supported. 


k A number of studies conducted during the past 

decade have demonstrated the effect of attitude 
on judgments of statements about a social ob- 
ject (e.g., Hovland & Sherif, 1952; Manis, 1960, 
1961; Weiss, 1959). Most of these studies have 
investigated judgments of the favorableness of 
statements about some social group Or social 
policy. The choice of favorableness as the dimen- 
sion along which statements are judged is related 
historically to attempts to construct attitude 
scales by the method of equal-appearing inter- 
vals (Hinckley, 1932; Thurstone & Chave, 1929). 
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EFFECT OF ATTITUDE ON JUDGMENTS OF PLAUSIBILITY * 


PATRICIA WALY ano STUART W. COOK ? 
New York University 


Ss were asked to rate the plausibility of 40 statements about segregation. Ss’ 
own attitudes were measured by a self-report inventory. Hypotheses: (a) A 
ausible or effective argument by Ss who 
agree with the position which the statement supports than by Ss who disagree 
Despite the difference in absolute rating of the argu- 
ments between judges with differing attitudes, they will agree on the relative 
plausibility of the various arguments supporting a given side of the issue. Both 


Our understanding of the influence of attitude on 
judgment would be broadened if studies were 
conducted on the judgment of attributes other 
than favorableness, Everyday experience sug- 
gests that motivated distortion of judgment is a 
very common occurrence. 

The work reported here deals with judgments 
of plausibility. We were led to the study by ob- 
servations made in the course of another study, 
a replication of a study by Jones and Kohler 
(1958) concerned with the effect of attitude on 
memory. To test their hypotheses these authors 
had constructed a set of six one-sentence argu- 
ments for segregation and another set for inte- 
gration. Half of each set were intended to be 
plausible, the other half, implausible. In the 
course of our pilot work with these statements 
we noted that what seemed plausible to one 
judge sometimes seemed implausible to another. 
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We suspected that the explanation for this was 
to be found in the judges’ own racial attitudes 
and decided to examine this systematically. 

Our hypotheses were as follows: 

al. A statement will be considered a more 
plausible or effective argument by subjects who 
agree with the position which the statement sup- 
ports than by subjects who disagree with that 
position. 

2. Despite the difference in absolute rating of 
the arguments between judges with differing 
attitudes, they will agree on the relative plausi- 
bility of the various arguments supporting a 
given side of the issue. > 


METHOD 
Materials 


Forty statements were devised, 20 of which were 
arguments for segregation and 20 for integration. 
Within each subset, 10 of the items were intended as 
poor or ineffective arguments for the side they sup- 
ported and 10 as good or more effective arguments. 
Inevitably within each such group of 10 items degree 
of effectiveness varied; hence, the result is not a 
dichotomy of effective and ineffective arguments but 
a range of values along a plausibility continuum. 

Examples of the statements of each type are given 
below: 


Ineffective prosegregation: Among the unfortu- 
nate consequences of racial integration is the 
mental deficiency which is often found in children 
of mixed marriages. 

Effective prosegregation; Immediate desegrega- 
tion damages the Negro by giving him responsi- 
bilities for which he does not have the necessary 
background and preparation. 

Ineffective prointegration: If integration is 
adopted race hatred will quickly disappear in 
Southern communities which are now torn apart 
by the issue. 

Effective prointegration: Segregation leads to a 
waste of our human resources by decreasing the 
scientific and professional training given to capa- 
ble Negroes, 


Procedure 


Each of the 40 statements was labeled “prosegre- 
gation” or “prointegration”; the total set was ar- 
ranged in random order. Under each statement was 
an 1l-point graphic rating scale ranging from very 
ineffective (—5) to very effective (+5). 

Subjects were told: 


We are interested in arguments for segregation 
and for integration to be used in the construction 
of a psychological test. We have collected 40 
arguments and need to find out now which of 
these will be the best ones to include in the final 
questionnaire. Therefore, we need to know how a 
large number of people, looking at them inde- 
pendently and objectively, view these statements. 
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Each of the 40 statements has previously b 
labeled according to whether it is an argum 
for segregation or for integration. Your task: is 
follows: Imagine that you are judging a di 
between two teams on the topic of segregation ) 
integration. Acting as an impartial judge, you 
to rate each argument presented by either side 
terms of its effectiveness. 


Subjects were then asked to note the scale w 
each statement and instructed, ‘ 


If you think this statement is a very poor 
argument for the side that it supports and | 
easily be torn down by the opposition, éircle 
— Š to indicate that you think this is one of 
weakest and most ineffective arguments that ¢ 
be given. On the other hand, if you think 
statement is one of the best arguments that 
possibly be advanced, circle the +5. If you 
it is somewhere in between, ecircle one of 
numbers from —4 to +4. 


Following the completion of the rating task 
jects were given a self-report attitude invent 
containing a 57-item scale on attitude toward 
groes. Twenty-five of these items dealt with s 
aspect of segregation or integration. Thee rem 
had to do with attributes of Negroes and person 
acceptance of them in a variety of community 
sitions and social relationships. 

Subjects were tested in groups. om 


Scoring 


The ratings were scored as follows: subjects’ 
ings on the prosegregation items were assigned sco 
from +5=0 to — 5 = 10; that is, the more ie, 
tive the argument was considered to be by the Sl 
ject, the higher his score. Scores for the prointeg 
tion items were reversed, that is, the more effect 
the argument was considered to be by the subj 
the higher his score. Thus, a high score for the to 
set of 40 items resulted from judging integrati 
items as effective and segregation items as inefft 
tive. Such a score, in terms of our basic hypothe: 
should be associated with an equalitarian attit 
or lack of prejudice, Similarly, the self-report mea 
ure was also scored so that a high score represente 
a more favorable attitude and a low score a more 
prejudiced one. 


Subjects 


In the course of the study there were sevel 
revisions of items, instructions, and rating scales. 
total of 431 subjects were tested during the variol 
stages of the study; they included students in cO 
leges and universities in the Northeast, Midy 
and border South. The final form of the judgm 
Measure was administered to 199 undergradi 
students, recruited partly from introductory 
chology courses in fulfillment of a course re 
Ment for research Participation, and partly 


Mary E. Collins and E. Haupt, personal 
munication. 


art 
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TABLE 1 TABLE 2 
Setr-Report ÅTTITUDE INVENTORY PLAUSIBILITY RATINGS 
= 
: Group M SD Reliability Group M SD Reliability 
» Midwestern university | 45.84 | 9.25 92 Midwestern university | 254.77 | 33.92 86 
Border-South college 30.66 | 13.15 95 Border-South college 199.68 | 65.94 92 
Total 39.17 | 13.45 .96 Total 230.56 | 57.45 .93 


Note.—Possible range: 0-57. 


college dormitories. The latter were paid a small fee 
for taking past. Eighty-eight of these subjects were 
attending a private college in the border South; 111 
attended a Midwesterm university. The results for 
these 199 subjects with whom the final form of the 
judgment measure was used are reported here; the 
»findings in the earlier stages of the study were 
quite consistent with those being reported. Since 
the score ranges of the Midwest and border-South 
groups did not entirely overlap on either the attitude 
inventory or the judgment measure, the results for 
the two groups are presented separately as well as 
together. 


* RESULTS 


Table 1 presents the mean, standard deviation, 
and reliability * on the self-report attitude in- 
ventory for the total sample of 199 subjects and 
the two subgroups of which it was made up. As 
was anticipated, the Midwestern university 
yielded a skewed distribution of scores with a 
concentration at the upper (unprejudiced) end. 
The border-South college sample had a mean 
near the center of the possible range of scores 
and a larger standard deviation. The reliability 
(internal consistency) was high in both samples. 

Table 2 presents the comparable findings on 
the plausibility rating scale. The results here 
were similar to those on the self-report inven- 
tory; the Midwestern university mean was higher 
“than that of the border-South college, the stand- 
ard deviation of the latter was larger, and the 
reliability was high in both samples, though 
slightly lower in the Midwestern sample than 
in the Southern. The distribution of scores in 
the Midwestern sample was, however, more 
nearly normal than that of the self-report in- 


All reliabilities were computed by the Tryon 


' variance formula for internal consistency. 


5A copy of this rating scale giving instructions to 
the subject and the items has been deposited with 
the American Documentation Institute. Order Docu- 
ment No. 8499 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress, Washington, D. C. 20540. Remit in advance 
$1.25 for microfilm or $1.25 for photocopies and 
make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 


Note.—Possible range: 0-400, 


ventory scores, and there was no concentration 
at the upper end of the range of scores as was 
found on the attitude inventory. 


Hypothesis 1 


To test the hypothesis that a statement will 
be considered a more plausible or effective argu- 
ment by subjects who agree with the position it 
supports than by those who disagree with that 
position, the correlations of the self-report scores 
and the scores based on the plausibility ratings 
were computed. They were .64 at the Midwestern 
university, .88 at the border-South college, and 
.84 for the total sample; all of these correlations 
are significant beyond the .001 level, Thus, the 
hypothesis was strongly supported, 

As a further test of the effect of attitude on 
judgment, the ratings of the plausibility of the 
20 prosegregation and 20 prointegration state- 
ments separately by the two extreme groups, as 
measured by the attitude inventory, were com- 
pared. Only the extreme eighths were used, 
rather than the more conventional extreme quar- 
ters, since the skew of the distribution in the 
prejudiced direction would have made the pro- 
segregation group much less clearly extreme than 
the prointegration group if extreme quarters of 
the distribution were used. Table 3 presents the 
mean ratings by each group of the segregation 
and integration items, As predicted, the pro- 
segregation subjects gave the prosegregation 
arguments a mean rating on the “effective” end 


TABLE 3 


MEAN PLAUSIBILITY RATINGS OF PROSEGREGATION 
AND PROINTEGRATION ITEMS BY 
EXTREME GROUPS 


Items 
Subject 
Prosegregation | Prointegration 
Prosegregation +1.78 —2.01 
(N = 26) 
Prointegration —2.80 +1.60 
(N = 26) 


—— 
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TABLE 4 
CORRELATIONS OF Item SUBSETS WITH SELF-REPORT INVENTORY 
i Plausibl Implausibl Plausible W 

9 Group e segregation integration integration Total 
Midwestern university i53. 49 24 31 64 
Border-South college .16 .70 65 94 88 

Total -76 -16 .49 .88 84 


of the scale and the prointegration items a mean 
rating on the “ineffective” end of the scale, and 
the prointegration subjects did just the opposite. 

In preliminary work, minor but inconsistent 
differences in the correlations between the atti- 
tude inventory and the four subsets of 10 items 
each had been found from sample to sample. 
There was a tendency for the implausible inte- 
gration subset to yield somewhat lower correla- 
tion coefficients than the other three subsets. 
Our attempts to increase this correlation by 
item revision within the subset were not entirely 
successful, However, these items were retained 
since they made some contribution to the reli- 
ability of the total set and to the correlation of 
the total score with the attitude inventory. The 
correlations for the sample being reported here 
are presented in Table 4. 


Hypothesis 2 


The prediction here was that, despite the an- 
ticipated effect of own attitude on absolute rat- 
ings of item plausibility, prosegregation and pro- 
integration judges would show high agreement on 
the relative plausibility of the various arguments 
supporting a given side of the issue, The most 
severe test of this hypothesis is to examine the 
correspondence in the ordering of the items by 
the extreme attitudinal groups. Again the extreme 
eighths of the total sample were used; correla- 
tions between their mean plausibility ratings of 
the 20 segregation items and the 20 integration 
items were computed. These correlations were 
-73 for the segregation items, .81 for the inte- 
gration items. Thus the hypothesis was strongly 
supported; there was considerable agreement 
between the extreme attitudinal groups in their 
ordering of the effectiveness of the arguments 
for a given side of the issue. 


Discussion 


The results obtained with plausibility ratings 
seem promising enough to suggest that they may 
serve as a disguised measure of attitude. The 
need for such measures arises from two consid- 
erations. The first is that self-report measures 
are susceptible to distortion from considerations 


of social desirability. This is especially true for 
controversial social attitudes where the subject 
has reason to believe that there is a proper answer 
or that a particular answer, is viewed as proper 
by the experimenter. The second consideration 
is that self-report measures are of limited value 
in attitude-change experiments. Such experiments 
often require repeated measurement of the same 
subjects. This gives rise to the danger that sub- 
jects will realize the connection between the 
experimental manipulation and the attitude 
measure and, consequently, will cooperate with 
(or resist, as the case may be) the experimenter 
when giving their postexperimental responses to 
self-report measures. 

One of the characteristics we might anticipate 
in a satisfactory disguised measure is that its 
correlation with a self-report measure would’ be 
higher for a sample of subjects who are less sub- 
ject to social desirability influences with respect 
to the attitude in question than for a sample 
more subject to such influences, since one would 
expect that the self-report measure would be a 
truer indicator of the subjects’ own positions in 
the former sample than in the latter. As applied 
to attitudes toward Negroes, this would mean a 
higher correlation in a sample of Southern sub- 
jects than in a non-Southern one, since social 
norms toward equalitarian views on race rela- 
tions would be stronger in the latter case. Our 
results are in accord with this expectation; the 
correlation between the plausibility and self- 
report scores is .88 in the Southern sample as 
against .64 in the non-Southern. A ¢ test shows 
this difference to be significant (p< 0001). 
While the limit set by the test reliabilities is 
somewhat lower for the latter correlation than 
for the former, this would account for only a 
small part of the observed difference. While these 
results are consistent with the possibility that the 
plausibility ratings are less subject to social de- 
sirability influences than are the responses to 
the self-report inventory, further investigation 15, 
of course, needed to demonstrate the validity of 
this interpretation. 

As a disguised attitude measure, plausibility 
ratings should have wide applicability. It should 


: 
a 


a 


F 
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prove possible to construct the necessary argu- 
ments to be rated for opposite sides of any issue 
on which varying attitudes may be anticipated. 
Thus the technique is not limited to the measure- 
ment of attitude toward minority groups. 
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© EFFECT OF CERTAINTY ABOUT A DECISION UPON 
POSTDECISION EXPOSURE TO CONSONANT AND 
DISSONANT INFORMATION * 


JUDSON MILLS? 


University of Missouri 


The effect of certainty about the correctness of a choice upon postdecision 
interest in consonant and dissonant information was investigated experimentally. 
College women ranked different products according to desirability, made a 
choice between 2 of them and then rated their interest in reading ads for each 
of the products. Certainty was manipulated by allowing some to choose be- 
tween products they had ranked about equal in desirability and others between 
products they had ranked very differently. As expected, interest in ads for the 
chosen product (consonant information) was greater when certainty was low 
than when it was high. Contrary to expectation, interest in ads for the rejected 


about the choice. 


This study is concerned with the effect of 
certainty about the correctness of a decision on 
Postdecision exposure to consonant and dissonant 
information, A number of studies stimulated by 
Festinger’s (1957) theory of cognitive dissonance 
(Adams, 1961; Canon, 1964; Ehrlich, Guttman, 


' Schönbach, & Mills, 1957; Mills, Aronson, & 


Robinson, 1959; Rosen, 1961) have found that 
following a choice, people tend to prefer infor- 
mation favoring the chosen alternative (conso- 
nant information) to information favoring re- 
jected alternatives (dissonant information). A 

1 This study was supported by a grant from the 


National Science Foundation. 
* Formerly at Syracuse University. 


product (dissonant information) was not affected by amount of certainty 


recent study (Mills; 1965) measured interest 
in the two kinds of information separately and 
found evidence both of seeking of consonant 
information and avoidance of dissonant informa- 
tion. 

According to dissonance theory, seeking of con- 
sonant information and avoidance of dissonant 
information should be a function of the amount 
of dissonance that exists. However, the evidence 
concerning the effect of amount of dissonance or 
certainty about a decision on selective exposure 
to information is rather meager. Brodbeck (1956) 
found that people who were uncertain about 
their position on an opinion issue after hearing a 
speech opposing it were more interested in com- 
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municating with someone who agreed with their 
position than persons who were certain about 
their position. Mills and Ross (1964) found that 
when persons were highly committed to a posi- 
tidn on an opinion issue, the less certain they 
were their side was the best one, the more they 
preferred information supporting their side to 
information supporting the other side. Both 
these findings are based on correlational data and 
are, therefore, particularly subject to alternative 
interpretations. In both studies, the measure of 
exposure to information involved relative interest 
in consonant versus dissonant information. 

In the present study, certainty about a» deci- 
sion was manipulated experimentally and interest 
in consonant and dissonant information following 
commitment were measured independently of 
each other. It was expected that interest in con- 
sonant information would be greater, the lower 
the certainty about the choice, and that interest 
in dissonant information would be less, the lower 
the certainty. In addition, an attempt was made 
to manipulate the importance of the choice in 
order to test the effect of this variable on ex- 
posure to consonant and dissonant information. 


METHOD 


The basic procedure was similar to that used in 
a study by Mills (1965). Under the guise of 
market research, college women ranked 20 products 
according to desirability, made a choice between 2 
of them and then rated their interest in reading 
advertisements for each of the 20 products, There 
were four experimental conditions: low certainty- 
high importance, low certainty-low importance, high 
certainty-high importance, high certainty-low im- 
portance. In the low-certainty conditions, subjects 
were given a choice between the products they had 
ranked second and third in desirability; in the high- 
certainty conditions, between the products they had 
ranked second and nineteenth. In the high-impor- 
tance conditions, the subjects were told that how 
one chooses is indicative of fundamental personality 
characteristics; this was not mentioned in the low- 
importance conditions. 

The subjects were 80 women attending a university 
summer session. They were interviewed in their 
dormitory rooms. After introducing herself and elicit- 
ing the subject’s cooperation, the experimenter 8 
explained that she was trying to find out how 
Popular some new products for personal care will be. 
The subject was asked to read cards describing 20 
products and to rank them “according to how much 
you personally like them.” The products were: hand 
cream, cleansing cream, deodorant, creme hair rinse, 
hair spray, toothpaste, wave-set lotion, shampoo, 
face soap, mouthwash, talcum powder, liquid eye- 


3 Jan Leask deserves thanks for her assistance in 
collecting data. 
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liner, chap stick, mascara, bubble bath, suntan lotion, | 
liquid nail formula, first-aid cream, nail-polish re- 
mover, face powder. The following is an example of 
one of the descriptions: 


Hand cream: A satiny smooth lotion which is not 
heavy and sticky but light and fresh. This hand “ 
cream leaves hands soft and moisturized without 
oiliness. j 


After the ranking of desirability was completed, 
the experimenter mentioned that for helping with 
the study everyone receives a free gift. The subject 
was told she would be given a choice between two 
of the products. Once she chose it would be impos- 
sible'to change. She was asked to write her choice 
on a card and sign her name. As the experimenter \ 
pretended fo look up the two products between 
which the subject could choose, she mentioned the 
following to subjects in the high-importance condi- 
tions: A 


You know everytime I give people a choice like 
this it reminds me of a study that I read in a 
marketing journal a couple of weeks ago. It seems 
that a very strong relationship was found between — 
how one chooses between products like this and ~ 
one’s fundamental personality charactéristics like 
artistic ability, creative judgment, academic suc- 
cess and social success, As I understand it it’s 
quite certain that one’s choice is indicative of 
these other things. I thought that was quite 
interesting. It’s something I’d never considered. 


No mention of this was made to subjects in the , 
low-importance conditions. Subjects in the low- | 
certainty conditions were given a choice between the 
products the subject had ranked second and third 
in desirability; subjects in the high-certainty condi- 
tions, a choice between the products ranked second 
and nineteenth. An equal number of subjects were 
randomly assigned to each of the four experimental 
conditions. 

Following the subject’s choice, the experimenter 
went on to say that she was also trying to find 
out just how much interest people will have in seeing—- 
ads for the different products so that plans could 
be made for how much each of them should be 
advertised. The subject was given a rating sheet 
and asked to indicate her interest in reading ads 
about each of the 20 products by circling a number 
from O (not at all interested) to 20 (extremely | 
interested). 

When the ratings of interest were completed, the 
subject was asked to indicate how certain she was , 
that the product she chose would be the best of 
the two by picking a number from 0 (not at 
certain) to 20 (extremely certain). She was also 
asked to indicate how important the choice seem 
to her at the time on a scale from 0 (not at & 
important) to 20 (very important). The experi- 
menter then mentioned to the subject that there 
was something more to the study than she hat 
been told and asked if she had any idea what it 
was. One person’s reply indicated suspicion of the 
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instructions; she was not included as a subject. 
Finally the experimenter revealed the deceptions 
and got the subject to promise not to discuss the 
experiment with anyone. 


RESULTS 


Before presenting the results for interest in 
consonant and dissonant information it is ap- 
propriate to consider the evidence concerning 
the success of the attempts to manipulate cer- 
tainty and importance. The mean rating of 
certainty for the two low-certainty conditions 
combined? was’ 14.3; the mean for the two high- 
certainty conditions combined was 17.8. The dif- 
ference between the means’was significant beyond 
the .01 level. The ratings of certainty provide 
independent evidence that the experimental ma- 
nipulation of certainty was successful. 

The mean rating of importance for the two 
high-importance conditions combined was 12.1; 
the mean for the two low-importance conditions 
combined was 13.2. The mean rating of impor- 
tance for the high-importance conditions was 
actually lower than the mean for the low- 
importance conditions, but the difference did not 
approach significance. The evidence from the 
ratings of importance indicates that the attempt 
to manipulate importance was not successful. 

As in the study by Mills (1965), a strong 
relationship was found between the initial rank- 


. ings of the desirability of the various products 


and the ratings of interest in reading ads about 
them. The mean rating of interest over all 80 
subjects was 16.1 for whichever product the sub- 
ject ranked first, 13.0 for whichever product was 
ranked fifth, 11.8 for whichever was ranked 
tenth, 7.9 for whichever was ranked fifteenth, 
and 4.0 for whichever was twentieth. The average 
correlation between the rankings of desirability 
and the ratings of interest for the 18 products 


+ hot involved in the choice was —.62 (N = 80).° 


Seventy-three of the 80 separate correlations 
Were negative. 

A In order to eliminate differences due to the 
initial ranking of tke products, the ratings of 
interest.in ads about the chosen product (con- 
sonant information) and ads about the rejected 
product (dissonant information) were adjusted 


| by means of the same procedure used in the 


earlier study (Mills, 1965). The linear regres- 
sion of the ratings on the rankings for the 18 
Products not involved in the choice was calcu- 
lated separately for each subject. On the basis 
Of the regression equations, predicted ratings 
were calculated for each subject for the rankings 


sAn significance tests by analysis of variance. 
Determined with the use of the z transformation. 
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TABLE 1 


MEANS OF ADJUSTED INTEREST IN CONSONANT | 
AND DISSONANT INFORMATION FOR THE 
EXPERIMENTAL CONDITIONS $ 


Adjusted interest `“ 

Ceunty and N 

opoe, Consonant Dissonant 

information information 

Low Cert 

High Imp. 20 +4.37 

Low Imp. 20 + .45 

Both 40 Tor 
High Cert. 

High Imp. 20 —1.81 

Lôw Imp. 20 — .42 

Both 40 —=1.11 


of the chosen and rejected products. The ad- 
justed ratings were determined by subtracting 
the predicted ratings from the actual ratings. 

Means of the adjusted ratings of interest in 
ads for the chosen product (consonant informa- 
tion) and ads about the rejected product (dis- 
sonant information) for the different experi- 
mental conditions are presented in Table 1. It 
can be seen from Table 1 that, as expected, 
interest in consonant information was greater in 
the low-certainty conditions than in the high- 
certainty conditions. The difference between the 
low- and high-certainty conditions was significant 
at the .05 level. The other differences for inter- 
est in consonant information did not approach 
significance. 

Further examination of Table 1 reveals that 
interest in dissonant information was not lower 
in the low-certainty conditions than in the high- 
certainty conditions, as was expected. The mean 
for the two low-certainty conditions combined 
(—.72) was slightly higher than the mean for 
the two high-certainty conditions combined 
(—.80). None of the differences for interest in 
dissonant information approached significance. 

It should be mentioned that a question can 
be raised about the adjustment of the ratings 
of interest for those subjects who chose the 
alternative which was initially ranked lower. 
However, the results would not be affected if 
those subjects were excluded. Considering just 
the subjects who chose the higher ranked alterna- 
tive, the mean adjusted interest in consonant 
information was +1.73 for those in the low- 


6 The lower ranked alternative was chosen by 21 
of the 80 subjects, 7 in the low certainty-high im- 
portance condition, 10 in the low certainty-low im- 
portance condition, and 2 each in the high certainty- 
high importance and high certainty-low importance 
conditions, 
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certainty conditions (V=23) and —1.70 for 
those in the high-certainty conditions (N = 36). 
The mean adjusted interest in dissonant informa- 
tion was —.83 for those in the low-certainty 
conditions (W = 23) and also —.83 for those in 
the high-certainty conditions (N = 36). 


Discussion 


The results provide experimental evidence 
which has previously been lacking that interest 
in consonant information following a choice is 
greater, the less the certainty about the correct- 
ness of the choice. Such a relationship would 
be expected both from dissonance theory and 
from a theory based on the assumption that 
people want to feel certain when they take an 
action that it is better than the alternatives 
(Mills & Ross, 1964). In this instance, the two 
theories lead to the same prediction but this 
is not always the case, for example, the latter 
makes predictions about behavior prior to a 
choice not made by the former, 

In contrast to the positive results for interest 
in consonant information, interest in dissonant 
information did not differ in the low- and high- 
certainty conditions. The results indicate that 
avoidance of dissonant information is not deter- 
mined by amount of certainty or dissonance as 
had previously been assumed (Festinger, 1957). 
The negative results for interest in dissonant 
information cannot be explained away by as- 
suming that the manipulation of certainty was 
unsuccessful or that the measure of interest in 
dissonant information was inadequate. The dif- 
ference between the certainty conditions for 
interest in consonant information together with 
the evidence from the ratings of certainty demon- 
strates that certainty was successfully manipu- 
lated. Since the procedure used to measure inter- 
est in dissonant information was identical to that 
for the measure of interest in consonant informa- 
tion which yielded positive results, it is rather 
difficult to argue that interest in dissonant 
information was not adequately measured. 

It should be mentioned that the failure to 
find differences in avoidance of dissonant in- 
formation is quite consistent with previous re- 
search. In their comprehensive survey of the 


BRIEF ARTICLES 


evidence concerning dissonance theory, Brehm 
and Cohen (1962) noted that previous studies 


all failed to find evidence confirming the derivation 
that dissonant information will be avoided in propor- 
tion to the amount of dissonance produced [p. 93]. 


Avoidance of dissonant information appears to 
be unrelated to the amount of dissonance that — 
exists. 

The lack of differences between the impor- 
tance conditions for interest in consonant and 
dissonant information can be attributed to the 
failure to manipulate importance indicated by 
the,evidence from the ratings of importance, For — 
this reason it is not possible to draw any con- 
clusions about the effect of importance of a 
decision on selective exposure to information 
following the choice. a 


REFERENCES 


Avams, J. S. Reduction of cognitive dissonance by 
seeking consonant information. Journal of Abnor- 
mal and Social Psychology, 1961, 62, 74-78. ` 

Breum, J. W, & Conen, A. R. Explorations in 
cognitive dissonance. New York: Wiléy, 1962. ‘ 

Bropseck, May. The role of small groups in medi- 
ating the effects of propaganda. Journal of Abnor- — 
mal and Social Psychology, 1956, 52, 166-170. 

Canon, L. C. Self confidence and selective exposure 
to information. In L. Festinger, Conflict, decision 
and dissonance. Stanford: Stanford Univer. Press, 
1964. Pp. 85-95. 

Enrica, Danuta, Gurrman, I., SCHÖNBACH, P. & 
Mutts, J. Postdecision exposure to relevant in- 
formation. Journal of Abnormal and Social Psy- 
chology, 1957, 54, 98-102. 

Festincer, L. A theory of cognitive dissonance. 
Stanford: Stanford Univer. Press, 1957. 

Mus, J. Avoidance of dissonant information. Jour- 
nal of Personality and Social Psychology, 1965, 
2, 589-593. of 

Muss, J., Aronson, E., & Rosmson, H. Selectivity | 
in exposure to information. Journal of Abnormal 
and Social Psychology, 1959, 59, 250-253. 

Muis, J., & Ross, A. Effects of commitment and 
certainty upon interest in supporting information. 
Journal of Abnormal and Social Psychology; 
1964, 68, 552-555. AN 

Rosey, S. Postdecision affinity for incompatible 
information. Journal of Abnormal and So 
Psychology, 1961, 63, 188-190. 


(Received February 11, 1965) 


Journal of Personality and Social Psychology 
1965, Vol. 2, No. 5, 753-756 


REPLICABILITY OF TWO GSR INDICES AS A FUNCTION OF 
STRESS AND COGNITIVE ACTIVITY * 


LYLE H. MILLER anp BARRY M. SHMAVONIAN 


Duke University 


A replication of an experiment by 


Katkin in which 2 GSR indices, non- 


specific activity and basal skin resistance, were found to vary independently 
as a function of experimental conditions was conducted. An attempt to ex- 
plicate the nature of the psychological dimensions underlying the phenomena 
was made by adding a noninterview control group to Katkin’s design. The 4 
® groups in the 2 X 2 factorial design were: stress/interview, nonstress/interview, 
stress/noninterview, and nonstress/noninterview. The divergence of GSR non- 
specifics And basal skin resistance previously feported by Katkin was repro- 


duced for the stress groups. There 


were no significant differences between 


interview and noninterview groups. It was concluded that the changes fol- 


. 

The concept of a two-component GSR has 
been extant in the literature for some time. 
Richter (1929), Darrow (1933), and, more 
recently, Edelberg and Wright (1962) have em- 
ployed simultaneous palm-palm and dorsum- 
dorsum efèctrode placement techniques to effec- 
tively demonstrate the bi-dimensional nature of 
the exosomatic GSR. Wilcott (1959), utilizing 
a single pair of electrodes, has demonstrated bi- 
dimensional features in the endosomatic GSR. 

_ The stimulus specificity of these two dimen- 
sions has been amply demonstrated and various 
arguments have been forwarded in an attempt 
to explicate their functional significance. Darrow 
(1933) argues for a perceptual-motor explana- 
tion, whereas Richter (1929) takes a more 
physiologically oriented position and relates both 
components to temperature regulation. Edelberg 
and Wright (1962) speculate that a perceptual 
component resides in the stratum lucidum of 
the epidermis and that a defensive reaction 
‘relating to anxiety and apprehension is related 
to sweat-gland activity. Wilcott (1959), however, 
offers no speculation regarding the functional 
significance of the two endosomatic components 
he reports. 
_ The GSR findings reported by Katkin (1965) 
indicate not only the possibility of discriminat- 
ing two exosomatic GSR components from a 
single pair of electrodes but also that of ascrib- 
ing some psychological significance to them. A 
cursory inspection of Katkin’s results suggests 
that the two GSR components discriminated, 
number of nonspecifics and basal conductance 
levels, relate somewhat to the cognitive and/or 
1This investigation was supported in part by 
Grants 5-F1-MH-24, 259-02, and MH 06022-03 from 
the National Institute of Mental Health, United 
States Public Health Service. 


lowing the interview could not be attributed solely to the interview. 


affective involvement of the subjects across ex- 
perimental conditions. Closer inspection of the 
data, however, reveals that such a conclusion is, 
at best, premature. The increase in nonspecific 
activity with a concomitant increase in basal 
conductance levels is a predictable result. The 
dramatic discordance of these two measures 
following the interview condition, however, is 
not a predictable result and could be an artifact 
of cigarette smoking during the interview or gross 
postural adjustments accompanying the shift 
from a reclining to a sitting position. 

Two experiments were conducted in the Psy- 
chophysiology Laboratory at Duke University in 
an attempt to replicate Katkin’s general findings 
while controlling for these two potential sources 
of artifact, Experiment I was aimed at estab- 
lishing the effect of cigarette smoking on GSR 
basal conductance and number of nonspecifics, 
Experiment II was a replication of Katkin’s 
experiment with the inclusion of a noninterview 
group as a control for the effects of gross pos- 
tural adjustments. 

EXPERIMENT I 

During Katkin’s interview condition, his sub- 
jects were permitted to smoke cigarettes. Since 
there is little to be found in the literature per- 
taining to the effect of smoking on GSR, an ex- 
periment relating GSR to cigarette smoking was 
conducted. 


Apparatus 

A sound attenuated experimental chamber, Model 
201 Biophysical GSR amplifier, respirometer, and 
Grass Model 5 polygraph were used. Electrodes 
were of the AG-AGCL type. A monopolar place- 
ment was utilized with a current density of 10 
microamps per square centimeter at the active elec- 
trode. 
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Procedure and Subjects 


Ten male subjects, all cigarette smokers, were 
drawn from the introductory psychology classes at 
Duke University. Upon arrival at the Psychophysi- 
ology Lab they were asked to wash their hands 
and were then conducted into the experimental 
chamber where they were seated in a comfortable 
reclining chair. All subjects had been requested not 
to smoke for at least 2 hours prior to the experi- 
ment. In order to insure a common initial level, 
however, subjects rested for 1 hour in the chamber 
before the start of the experiment. At the end of 
the rest hour, the GSR electrodes and the respirom- 
eter were attached to the subject and the following 
instructions were given: 


The purpose of this experiment is to examine 
certain physiological variables while you rest. I 
am interested particularly in the electrical resist- 
ance of your skin and your respiration. These 
[at this point the experimenter pointed to the 
GSR electrodes] measure the electrical resistance 
of your skin. This [at this point the experimenter 
pointed to the respirometer] is to measure your 
breathing. 

There will be three parts to this experiment, 
each part lasting ten minutes. During the first 
part of the experiment, you are simply to relax 
without going to sleep. The beginning of the sec- 
ond period will be marked by an increase in the 
brightness of the chamber lights. When the lights 
brighten, I would like you take the cigarette 
from the chair on your right, light it up with 
the lighter on the chair and read pages 278 and 
279 of the book that is lying on the chair [Maier, 
1955]. When you have finished your cigarette, 
put it out in the ash tray on the chair to your 
right. The beginning of the third period will be 
marked by a dimming of the chamber lights. If 
you have not finished your cigarette by this time 
Please extinguish it in the ash tray, replace the 
book on the chair, and lie back and relax just 
as you did in the first period. 

Do you have any questions? 


MEAN NUMBER NON-SPECIFICS 
COLLAPSED OVER STRESS GROUPS 


4—A Interview 
A="A Non- Interview 


NO. OF NON-SPECIFICS 
A 
[o] 
[e] 


REST STRESS! REST 


Fic. 1. Mean number of nonspecifics for interview 
groups collapsed over stress groups. 
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MEAN DR VALUES COLLAPSED OVER 
STRESS GROUPS 


REST 


Fic, 2. Mean deviation ratio values for interview 
groups collapsed over stress groups. 


STRESS" REST 


Are the instructions quite clear? 

I am going to leave the chamber now. If there 
is any reason to communicate, we can do so over 
the intercom. The lights will dim shortly after 
I leave the chamber. I will return at the end of 
the third period. 


Originally a nonsmoking control group was in- 
cluded in this design making some rough equation 
of the bodily movement involved in taking the 
cigarette from the chair, lighting it, etc., necessary. 
It was hoped that reaching for and reading ‘the 
book would provide some sort of rough equation, 
Data were reduced according to Katkin’s criteria 
and subjected to a £ test for differences. 


Results 


No differences were found in the number of 
nonspecific or basal conductance levels between 
the pre- and postmoking periods. Since no dif- 
ferences were found, it was concluded that smok- 
ing has little or no effect on GSR nonspecifics 
or basal conductance levels and the nonsmoking 
control group was not included in the experiment. 


EXPERIMENT II 


This experiment consisted of a 2 X 2 factorial 
design in which the factors were stress and 
interview conditions. Since Katkin did not in- 
clude a noninterview group in his experiment, 
it is difficult to ascertain whether the dramatic 
shift in basal conductance levels was due to the 
cognitive involvement of the subject during the 
interview period or to the gross postural ad- 
justments accompanying the shift from a re- 
clining to a sitting position. Therefore, a non- 
interview control group was added to what was 
otherwise a precise replication of Katkin’s ex 
periment. 
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Procedure and Subjects 


Subjects were 20 male undergraduates at Duke 
University drawn from the introductory psychology 
classes and randomly assigned to one of the four 
experimental groups: stress-interview, nonstress- 
interview, stress-noninterview, nonstress-noninter- 
view. Upon arrival at the Psychophysiology Lab, 
the subjects were asked to wash their hands and 
then led into the experimental chamber and seated 
in a comfortable reclining chair. 

Experimental conditions for the stress, nonstress, 
and interview groups were as described by Katkin. 
Conditiogs far the noninterview groups were the 
same as for the interview groups except that non- 
interview subjects wene engaged in an innotuous 
conversation during the interview period rather than 
given the intensive interview described by Katkin. 
«Time spent in the chamber by the experimenter 
was the same for all groups. 

Data were reduced according to Katkin’s criteria 
and both number of nonspecifics and deviation ratios 
were subjected to a repeated measures analysis of 
variance as described by Winer (1962). 


Results 


Signifi€int effects were found for Conditions 
and for the Stress X Conditions interaction for 
both nonspecifics and deviation ratios. Statistical 
analysis of the deviation ratios produced Fo go 
values of 11.01 for Conditions and 13.99 for 
the Stress X Conditions interaction which were 
significant at the .01 level. The analysis of non- 
specifics yielded an Fy 39 value of 4.02 for Con- 
ditions which was significant at the .05 level and 
an Fe39 of 5.57 for the Stress X Conditions 
interaction which was significant at the .01 
level. This indicates that stress, threat of elec- 
tric shock, is a significant stimulant to GSR 
activity with regard to both number of non- 
Specifics and basal conductance values. The 


MEAN NUMBER OF NON-SPECIFICS 
COLLAPSED OVER INTERVIEW GROUPS 


ÆA Stress 
A- -ANon-Stress 


NO. OF NON-SPECIFICS 
A 
a 
ce} 


REST STRESS REST 
Fic, 3. Mean number of nonspecifies for stress groups 
collapsed over interview groups. 
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MEAN DR VALUES COLLAPSED OVER 
INTERVIEW GROUPS 


DEVIATION RATIOS 


REST “STRESS © REST 


Fic. 4. Mean deviation ratio values for stress groups 
collapsed over interview groups. 


interview conditions, however, failed to demon- 
strate a significant effect on either GSR measure. 

Figure 1 shows the mean number of non- 
specifics across conditions for the interview and 
noninterview groups collapsed over stress. Fig- 
ure 2 shows the mean deviation ratio values for 
the interview and noninterview groups collapsed 
over stress. 

It is to be noted that nonspecifics increased 
markedly during the stress period and decreased 
after the interview condition, whereas deviation 
ratios showed an increase from initial levels 
during the stress condition with an even greater 
increase following the interview condition. 

Figure 3 shows the mean number of non- 
specifics for the stress and nonstress groups col- 
lapsed over interview conditions. Figure 4 shows 
the mean deviation ratio values for the stress 
and nonstress groups collapsed over interview 
conditions. 

The elevation of both number of nonspecifics 
and deviation ratios during the stress period is 
marked and contrasts with the discordance of 
the two measures following the interview con- 
dition. 

DISCUSSION 


The general replicability of Katkin’s findings 
has been demonstrated in this experiment and 
the argument for the discrimination of two 
separate components of the exosomatic GSR 
from a single set of electrodes has gained added 
weight. Although the evidence is quite strong that 
these two components, which usually act in 
synergy, can be dissociated under appropriate 
conditions, the nature of the psychological phe- 
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nomena underlying these components has yet to 
be defined. 

Data from Experiment I indicate that ciga- 
rette smoking has no effect on either number 
of, nonspecifics or basal conductance levels. 
Therefore, it was concluded to be an inadequate 
explanation of Katkin’s results. 

Data from Experiment II indicate that dur- 
ing stressful situations the number of nonspecif- 
ics and basal conductance levels increase in 
synergistic fashion. After a stress-reducing inter- 
view or an innocuous conversation with the ex- 
perimenter, however, they tend to dissociate with 
basal conductance values remaining at, or climb- 
ing to, relatively high levels and the number of 
nonspecifics decreasing. 

Within the context of this experiment it is 
impossible to determine whether the increase in 
basal conductance levels subsequent to the inter- 
view condition is a function of cognitive activity, 
postural adjustments, or both. The only reason- 
able control for the interview group, the innocu- 
ous conversation group, in Katkin’s paradigm is 
obviously inadequate. No matter how innocuous 
the conversation, there is bound to be considera- 
ble cognitive activity involved which may be 
reflected in basal conductance levels. On the 
basis of the data presented, the increase in basal 
conductance levels following the interview situa- 
tion cannot be attributed solely to the cognitive 
activity elicited by the interview, 
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Perhaps experiments designed for the express 
purpose of determining the nature of the phe- 
nomena underlying these two GSR components 
will provide clearer definitions, Such experiments 
should involve rigorous control of stress and cog- 
nitive variables and should eliminate as much “ 
artifact-producing movement as possible. 
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SOCIAL DESIRABILITY CORRELATES OF SOCIAL RESPONSIBILITY 


LEROY A. STONE 
Mental Health Research Institute, Fort Steilacoom, Washington 


Attempts to measure social responsibility by 
means of objective personality measures are not 
new, Gough, McClosky, and Meehl (1952) de- 
veloped such a scale for adolescents, They de- 
fined the responsible person as “one who shows a 
ready willingness to accept the consequences of 
his own behavior, dependability, trustworthiness, 
and a sense of obligation to the group [p. 74].” 
A few years later, Harris ( 1957) developed an- 
other social responsibility scale (for children) 
which correlated, on the order of .40 with the 
Gough et al. scale. One of the major differences 
between the two scales, according to Harris, was 
that his scale contained a greater proportion of 
items involving social and societal obligations. 
The Harris scale was recently revised by Berko- 


witz and Daniels (1964). The items used in the 
Berkowitz-Daniels Social Responsibility scale 
(SRS) do not appear to be at all subtle and if a 
person (who was answering the items) -wished 
to present a favorable picture of himself, he 
would have no difficulty selecting his answers. y 

The purpose of the present study was to in- 
vestigate possible social desirability correlates of 
the Berkowitz-Daniels SRS. 


METHOD 


The Berkowitz-Daniels SRS was added to the 
battery of tests administered to newly admitted 
patients at Western State Hospital. This battery 
already included the Marlowe-Crowne Social Desira- 
bility scale (Crowne & Marlowe, 1960) and the 


A 


A 


= - 


Brier ARTICLES 757 


MMPI. The MMPI K scale is described by Meehl 
and Hathaway (1956) as a measure of “plus-getting.” 
The Edwards Social Desirability scale is based on 39 
items obtained from the MMPI L, F, and K scales 
and from the Taylor Manifest Anxiety scale. These 
instruments were administered to 49 recently ad- 
mitted patients. 


RESULTS 


Product-moment correlations were computed 
between the Berkowitz-Daniels SRS and the Ed- 
wards (1957) Social Desirability scale, the K 
scale, and the Marlowe-Crowne Social Desira- 
bility &ale The correlation between the SRS 
and the Marlowe-Growne Social Desirability 
scale was the highest (r= .65, p < .001). The 
next highest relationship was with the Edwards 


ə Social Desirability scale (r = .53, p < .001). The 


relationship with, the K scale was lower, how- 
ever, it still was statistically significant (r = .39, 
p<.01). 

Since the observed relationships between the 
Berkowitz-Daniels SRS and measures involving 
social desirability were generally more than 
moderate, the legitimacy of the social responsi- 
bility construct independent from social desira- 
bility confoundings may be questioned. When 
one finds two supposedly distinct constructs 
covarying highly with each other, one is re- 
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minded of William of Occam’s razor: Entia non 
sunt multiplicanda, praeter necessitatem. 
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RESPONSE TO STONE 
LEONARD BERKOWITZ 


University of Wisconsin 


I would have been surprised and disappointed 


. lf the Social Responsibility scale (SRS) em- 


ployed by Berkowitz and Daniels (1964) had not 
been related to the various social desirability 
Measures, Indeed, a few minutes of thought 
should have convinced Stone (1965) that the 
correlations he obtained actually provide addi- 
tional-evidence for the validity of the SRS. 
The social desirability scales assume that high 
Scorers know what are “good” and “bad” item 
responses as defined by societal consensus, and 
are motivated to conform to this consensus. 
Putting it simply, the high scorer knows what is 
Socially proper and wants to appear proper. But 
these are essentially the same tendencies tapped 


by the SRS. The responsible individual, as de- 


fined by this instrument, also is strongly aware 
of the ideals of his particular segment of society 
aad apparently seeks to adhere to these stand- 
ards. 


A recent unpublished study by Berkowitz 
and Lutterman is illustrative. Upon interviewing 
a representative sample of Wisconsin adults, we 
found that the people with the highest scores on 
an abbreviated version of the SRS were most 
likely to report conformity to many of our soci- 
etal ideals. Within both the middle and working 
class proportionally more of these persons said 
they had contributed money to educational and 
religious organizations, had done volunteer work 
for charitable, health, civic, and/or political 
groups, belonged to several organizations or 
clubs, had a strong interest in both local and 
national politics, and had voted in the 1960 and 
1962 state elections. Further, they also knew 
more about politics. It is clear, moreover, from 
the total pattern of findings that the people 
scoring highest on the SRS tend to be fairly 
conventional. For example, more of the high 
than low scorers reported being church members 
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and the former were also more likely to hold 
conventional religious beliefs. 

Stone’s results just broaden the generalizations 
that can be made. The SRS assesses a readiness 
toodo what is socially desirable, including the 
giving of socially desirable responses to opinion 
statements. From this point of view, even a more 
subtle and relatively disguised instrument should 
yield positive correlations with the various social 
desirability scales.1 


1Too many people have assumed too readily that 
measures found to be correlated with social responsi- 
bility scales are necessarily subject to extensive arti- 
facts. In many cases the obtained relationships 
actually demonstrate the validity of the measures. 
Thus, Heilbrun (1964) recently has argued that 
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“the tendency for MMPI scales to load inversely 
to social desirability . . . issa consequent of the 
actual inverse relationship between deviancy and 
social acceptability of behavior [p. 3771.” 


REJOINDER TO BERKOWITZ: 
SOCIAL DESIRABILITY OR SOCIAL RESPONSIBILTY ? 


LEROY A, STONE 
Mental Health Research Institute, Fort Steilacoom, Washington ji 


In responding to my findings and suggestions 
(Stone, 1965), Berkowitz (1965) appears some- 
what to have switched definitions in the middle 
of his argument, Not long ago, Berkowitz and 
Daniels (1963) stated that “socially responsible 
actions may be performed—particularly if the 
costs are not too great—without conscious, ex- 
plicitly felt anticipations of reciprocated rewards 
[pp. 429-430].” Now, Berkowitz regards the 
socially responsible individual as being “strongly 
aware of the ideals of his particular segment of 
society and apparently seeks to adhere to these 
standards [p. 757].” He now even suggests that 
socially responsible persons (as defined by scoring 
high on the SRS) “tend to be fairly conven- 
tional.” It appears that Berkowitz is not inter- 
ested in social responsibility as altruistic be- 
havior, but rather is interested in a somewhat 
different phenomenon, the desire to appear (per- 
haps even to one’s self) to be a socially responsi- 
ble person. 

In my opinion, to use the label of social re- 
sponsibility to describe verbalized conformance 
to societal standards is a misnomer. I fail to 
understand why Berkowitz needed the SRS; all 
he needed do was to utilize an already existing 
social desirability scale. What the SRS apparently 
represents is a brief social desirability scale 


which uses items involving social responsibility 
content., 

It has been suggested by Campbell (1960) 
that new psychological tests (prior to their being 
offered for general use) be examined regarding 
their degree of correlation with social desirabil- 
ity measures. Campbell believes that, “If corre- 
lations are reported with trait-appropriate oF 
criterion measures, then it should be demon- 
strated that the new test predicts these measures 
better than does the general social desirability 
factor [p. 549].” My original bone of contention 
was not the fact that the SRS correlated posi- 
tively with social desirability measures but the 
degree which this 22-item scale did so. 
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ETHNIC AND SOCIAL CLASS DIFFERENCES IN PERFORMANCE 
FOR MATERIAL AND NONMATERIAL REWARDS: 


$ NEW ZEALAND CHILDREN? 


THOMAS STORM 
University of British Columbia 
W. S. ANTHONY anp ROGER D. PORSOLT 


University 


of Auckland 


ə ° An experiment was performed to test the hypothesis that Maori children 
would perform better for material than for nonmaterial rewards in comparison 
with p&keha (New Zealanders of European ethnic origins) children, Pakeha 
children from 2 social classes were used, in the expectation that nonmaterial 
rewards would be more effective in middle- than in lower-class Ss. 60 boys were 
tested in a manipulatory speed task, under both reward conditions, with order 
of gonditions counterbalanced, The hypothesis was confirmed in the 5-6 year 


group, but no significant subcultural 
were obtained from 10-11 year olds. 


The basic or modal personality of the New 
Zealand Maori has been the subject of a series 
of investigations summarized by Beaglehole and 
Ritchie (1961). A number of specific hypotheses 
have been formulated in the course of these 
investigations which are more or less supported 
by®various types of evidence, largely participant 
observation, interview, and projective test results. 
This paper reports an attempt to test a predic- 
tion from one of these hypotheses in an experi- 
mental task specifically designed for the purpose. 

One of the generalizations proposed by Ritchie 
(1956) is that 
the Maori personality structure is oriented around 
safe paths to immediate gratification .. . with a 
consequent emphasis on response in terms of direct 
reinforcements rather than secondary reinforce- 
ments [p. 81]. 


* This generalization suggests that Maori children 


will perform better for material rewards than 
for symbolic or inferred rewards, and will con- 
trast, in this respect, with pakeha * children, 
Differences in the effectiveness of the available 
rewards may help to account for differences in 
school achievement. The present study was de- 
signed, therefore, as a test of this hypothesis. 
Douvan (1956) found that the n Achievement 
scores from projective stories of lower-class sub- 
jects were higher when material reward was pres- 
ent for a series of tasks than when no material 


1This research was supported by a grant from 
the Society for Psychological Study of Social Issues 
to the senior author. 

2 The term “pakeha” is a convenient one to refer 
to New Zealanders of European ethnic origins, and 
will be used in this sense throughout the report. 


differences in response to the conditions 


reward was offered. Reward conditions had no 
significant effect on the n Achievement scores of 
middle-class subjects. 

Terrell, Durkin, and Wiesley (1959) have 
shown an interaction effect of social class and 
reward conditions: the lower-class children in 
their study performed better when a material 
reward was present; the middle-class children 
performed worse. Zigler and Kanzer (1962) have 
suggested that there may be age differences in 
the effects of rewards, with the importance of 
material in comparison with nonmaterial rewards 
decreasing with age. In view of these reports, 
children at two ages were tested, and pakeha 
children from both middle- and lower-class back- 
grounds. Maori children with middle-class back- 
grounds are relatively rare and were not included 
in the present study. 


METHOD 
Subjects 


All subjects were enrolled at a single urban pri- 
mary school in a mixed socioeconomic area, Five 
pairs of Maori boys, each pair matched for age, 
were selected from the 5-6 year old population in 
the school. Two pakeha boys were then selected 
from each social-class group matched in age with 
each other and with one of the Maori pairs. There 
were, then, five sets of six boys all of the same age, 
and a total of 10 Maori, 10 lower-class pakeha, and 
10 middle-class pakeha boys of 5-6 years. This 


8 We wish to express our appreciation to the stu- 
dents of Orakei primary school in Auckland who 
served as subjects, to their parents for their kind 
permission for their sons’ participation, and to the 
principal and staff for their cooperation throughout 
this study. : 
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TABLE 1 TABLE 2 
ANALYSIS OF VARIANCE OF SCORES ON ANALysIS OF VARIANCE OF SCORES ON Z 
SPIRAL TAsK: 5- AND 6-YEAR-OLDs Spmat Task: 10- anp 11-YEAR-OLDS 
? Source a | Ms F Source af MS F 
Between subjects Between subjects 
Subculture (S) 2 9.65 1.33 Subculture (S) z 67 | <1 
Order (0) 1 2.02 | <1.0 Order (0) 1 45.07 10.88** 
sxo 2 3.02 | <1.0 SXO 2 1.47 | <1 
Error (a) 24 7.25 Error (a) 24 4.14 
Within subjects Within subjects : i 
Reward conditions (R) 1 82 2.04 Reward conditions (R) 1 3.27 @3.81 
SXR 2 2.82 7.04** SXR 2 27 | <i 
OXR 1 43.35 | 108.38** OXR 1 123.27 | 143.61** 
SxOXR 2 05 | <1.0 SXOXR 2 67 | <1 
Error (b) 24 40 Error (b) 24 86 
** b <.01. > <.01. 


procedure was repeated with the 10-11 year group. 

A boy was considered a Maori if he was enrolled 
by his parents as a Maori. He was considered lower 
class if his father was a skilled or unskilled manual 
worker, He was considered middle class if his 
father was a professional, in business, or a clerical 
worker, Middle-class Maori boys were omitted from 
the study. The effects of ethnic group and social 
class, therefore, cannot be evaluated independently. 
The three groups of subjects at each age level will 
be referred to as “subcultural groups.” 

The same experimenter tested all subjects. He 
(RDP) was a 19-year-old, middle-class pakeha. 


Apparatus 


Twenty triangular spirals were fixed, 8 inches 
apart, into a long wooden board. The spirals were 
made from bronze rod, .25 inch in diameter. Each 
was 10.5 inches high, including a straight section 3 
inches long and three complete turns. During test- 
ing, the apparatus rested on a row of tables 2 feet 
high in an unused classroom. The apparatus and the 
task, described below, were adapted from Davis, 
McDowell, Deter, and Steele (1956) 4 


Experimental Conditions 


There were two conditions, material and non- 
material reward. Each subject was tested under 
each condition. Order was counterbalanced within 
each pair matched for age, in each subcultural 
group, at the 5-6 and 10-11 year levels. At each 
age level then, there were five boys from each sub- 
cultural group who received the material reward 
condition followed by the nonmaterial reward con- 
dition, and five who received the experimental con- 
ditions in the reverse order. 

Tn the nonmaterial reward condition, all 20 spirals 
were threaded with a perspex bead. In the material- 
reward condition, the second, and every fourth spiral 


*We are grateful to Hubert Sampson, of the 
- TWUniversity of Auckland, Psychology Department, 
for suggesting this task. 


thereafter, plus the twentieth, were threaded with an 
orange-colored, butterscotch Lifesaver. The remain- 
ing spirals were threaded with a perspex bead. 


Procedure 


Subjects were tested individually. They were 
brought into the testing room by the experimenter 
and shown the row of threaded spirals. The subject 
was told his task was to take the beads off jhe 
spirals, and the experimenter demonstrated with the 
first bead, replacing it on the spiral.5 

In the nonmaterial condition, only perspex beads 
were present and the subject was told that the ex- 
perimenter just wanted to see how many beads he 
could get off in the given time. When this condition 
followed the material reward, it was emphasized 
that the subject would get no more Lifesavers. 

In the material condition, the experimenter pointed 
out the Lifesavers and stressed to the subject that 
he could keep all the Lifesavers he got off, and that 
the faster he went, the more Lifesavers he would 
get to keep. 

Subjects were told to work from left to right, and 
to use their right hand and to put the other in their 
pocket. Subjects were given 1 minute and 30 seconds 
under each condition. They were not allowed to eat 
the Lifesavers as they took them off, but, in fact, 
no child attempted to do this, No subject was able 
to unthread all the spirals in the time provided. 


RESULTS 


Two scores were obtained for each subject: 
the number of spirals unthreaded in the material 
and the number in the nonmaterial condition. 

Table 1 shows the result of the analysis of 
variance in the 5-6 year boys. The significant 
interaction of subculture with reward condition 
confirms our hypothesis: both pakeha groups 

5 Verbatim instructions in each condition and 


order can be obtained by writing to the senior 
author, 
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scored higher in the nonmaterial-reward (NMR) 
than in the material-reward (MR) condition; 
the difference was greater for the pakeha middle 
class than for the pakeha lower class. Maoris 
scored higher in the MR than in the NMR con- 
dition. 

The Order X Conditions interaction is best 
described as a practice effect; when the order is 
for MR to follow NMR condition, then the MR 
condition yields higher scores than NMR; when 
the order is for NMR to follow MR condition, 
then the NMR condition yields higher scores 
than the MR condition. 

Table 2 shows the result of analysis of’ vari- 
ance in the 10-11 year boys. The Order reflects 
a superiority of performance under the MR- 
NMR order relative to the reverse order; no 
explanation is offered for this. The Order X Con- 
ditions practice effect appears as for the 5-6 
year group. There is no significant Subculture X 
Conditions interaction. 

Table 3 shows the result of analysis of vari- 
ance in all the boys. The Order effect is in the 
same dfrection as that obtained in the 10-11 
year group. The Age effect reflects a superiority 
of performance in the older age group. The Con- 
ditions effect reflects a superiority of perform- 
ance in the NMR condition relative to the MR 
condition. The Order X Conditions interaction is 
the practice effect noted above. The triple inter- 
action Order X Age X Conditions may be de- 
scribed as an increase in practice effect with in- 
creasing age. The Subculture X Conditions inter- 
action is nearly but not quite significant at the 
.05 level; the relations within this interaction 
table are as described for the 5-6 age group 
except that the Maori group scores equally 
under the two reward conditions. It should be 
noted that although the Subculture X Conditions 


* interaction was highly significant at 5-6 years 


and not at all significant at 10-11 years, there 
is no significant Subculture X Conditions X Age 
interaction. 


Discussion 


The results from the 5-6 year boys support 
the initial hypothesis and thus support Ritchie’s 
hypothesis about Maori personality. The predic- 
tion from the hypothesis is not confirmed, how- 
ever, in the older age group. There is no evidence 
in this study that, at the age of 10-11 years, 
Maori boys are any less motivated than pakeha 
cboys for whatever secondary rewards are avail- 
able in our NMR condition, or are less moti- 
vated by these nonmaterial rewards than by 
direct, material rewards. 

The difference in the findings between the age 
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groups with respect to our hypothesis can be 
interpreted only very cautiously in view of the 
failure to find a significant interaction between 
the three variables of age, subculture, and re- 
ward condition in the overall analysis. Our én- 
terpretations are offered only as suggestions for 
further research. 

It may be that the motivation to work for the 
less direct rewards characteristic of the class- 
room competition with others, or with self- 
imposed standards, is developed earlier in pakeha 
children than in Maori, probably in the home, 
and particularly in the middle-class home. This 
motivation may develop later in Maori children, 
however, as an effect primarily of the school 
environment itself. If performance on our task 
under nonmaterial conditions is taken to reflect 
achievement motivation, the results are consist- 
ent with the results of Ausubel’s (1961) study of 
Maori and pakeha adolescents. He found that 
there were no differences between these groups 
in n Achievement score on the TAT, nor in 
expressed educational or vocational aspirations, 
If it were found on further investigation that 
projective or other measures plausibly related to 
the achievement motive did differentiate Maori 
from pakeha children at early ages while failing 
to differentiate at later ages, this would be sig- 
nificant not only for the study of the Maori 
personality, but for theoretical formulations of 
the origins of achievement motivation. The em- 


TABLE 3 


ANALYSIS OF VARIANCE OF SCORES ON 
SPIRAL Task: ALL SUBJECTS 


Source df MS F 
Between subjects 
Subculture (S) 2 4.76 | <1 
Order (0) 1 33.08 5.81* 
Age (A) 1 715.41 | 125.60** 
sxo 2 4,28 | <1 
SXA 2 5.51 | <1 
OXA 1 14,01 2.46 
SXOXA 2 26 | <1 
Error (a) 48 5.70 
Within subjects 
Reward conditions (R) 1 3,68 5.84* 
SXR 2 1.98 3.14 
OXR 1 156.41 | 248.60** 
AXR 1 41 | <1 
SXOXR 2 .26 | <1 
SXAXR 2 1.16 1.84 
OXAXR 1 10.21 16.23** 
SXOXAXR 2 a1 | <1 
Error (b) 48 63 
*p <.05. 
+p <.01. 
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phasis in these formulations so far (e.g., Mc- 
Clelland, 1961; Winterbottom, 1958) has been 
on early child-training demands of the parents. 
It would be interesting if it could be shown that 
“achievement motivation” can also be established 
through the agency of schools or peer groups. 
The greater difference between MR and NMR 
conditions for middle-class subjects than for 
lower-class subjects replicates in the New Zea- 
land setting the findings of Terrell et al. (1959) 
with American children. Lower-class boys in 
their study, however, did better for material 
than for nonmaterial rewards. This was not the 
case in our study. > 
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GROUP SETTING? | 
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Previous studies have shown that when high- and low-anxious (HA and LA) 
Ss, operationally defined by extreme scores on the Taylor MA scale, judged 
the amount of confusion present in the vocabulary responses of schizophrenics 
there was a consistent though transient lowering in interjudge reliability for 
HA Ss. The 3 experiments reported here duplicated the previous methodology 
except that the judges performed in groups rather than individually. Under 


this condition the previously reported difference between HA and LA Ss was 


not found. A comparison with results obtained under individual conditions 


showed that reliability is facilitated in the group setting. 


In studying clinical judgment, Hunt and his 
associates (Hunt & Blumberg, 1961; Hunt & 
Walker, 1963) have demonstrated a relationship 
between anxiety level, defined by scores on the 
Taylor (1953) Manifest Anxiety (MA) scale, 


1This study is part of a larger project being con- 
ducted at Northwestern University under the direc- 
tion of William A. Hunt through Contract Nonr- 
1228 (18) with the Office of Naval Research, 


and performance on a judgmental task. Low MA 
scale subjects were significantly more reliable, 4 
measured by higher interjudge agreement, than 
high MA scale subjects on the first trial of the 
task and equally reliable thereafter. Interpreta- 
tion of these findings was based on the assumed 
Presence and subsequent dissipation of more 1- 
relevant response tendencies (Sp) in the high 
MA scale subjects (Hunt & Walker, 1963)- 


| 
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In the course of our many experiments on 
judgment we have noticed some apparent lessen- 
ing ef tension in the judgmental situation when 
the judges were working in groups rather than 
individually. Because of the assumed connection 
between tension and irrelevant response tenden- 
cies we were led to the further hypothesis that 
the lower reliability demonstrated for the high- 
anxiety subjects would not be present in a group 
situation. 


PROCEDURE 


All subject’ were Loyola undergraduates enrolled 
in introductory psychology courses. The subjects 
took the MA scale a part of a regular classroom 
exercise and participated in the judgment task 8 
weeks later. The 20 high-anxious (HA) and 20 low- 
anxious (LA) subjects used in Experiment I had 
obtained the most extreme MA scale scores in a class 
of 103 students. The 12 HA and 12 LA subjects in 
Experiment II were the extreme scorers in a class 
of 78. The 20 HA and 20 LA subjects for Experi- 
ment III were chosen from a class of 100 students. 
The range of MA scale scores for both the HA and 
LA groups in all three experiments was comparable 
to that *€ported in previous studies (Hunt & Walker, 
1963). 

The stimuli for Experiments I and II were 100 
schizophrenic vocabulary test responses standardized 
by“Hunt and his associates (Hunt & Jones, 1962) and 
used in previous experiments (Hunt & Walker, 1963). 
These were presented in 10 sets of 10 each with 
each set containing two stimuli at each of the first 
3 scale points and one at the remaining 4 points of 
a 7-point scale. Only 50 stimuli were used for Ex- 
periment III. 
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For each of the three experiments, all subjects 
were seated in a classroom and given a data sheet 
on which they were to record their judgments. The 
instructions were: 


We are going to present you with a numbereof 
responses made by schizophrenic patients to vo- 
cabulary test items taken from an intelligence 
test. One of the ways in which the pathology of 
schizophrenia may express itself is through dis- 
organized thinking which results in atypical, un- 
usual, or “abnormal” responses to the items on 
such a test. The qualitative interpretation by the 
clinician of such test responses is one of the bases 
upon which he may make a clinical or diagnostic 
interpretation. The extent of the disorganization 
exhibited in these responses is not uniform. In 
some of the responses it is minimal and in others 
it is extreme. 

You are asked to rate these responses on a 
7-point scale, from 1 through 7, according to the 
severity of the disorganization exhibited in the 
response, with the low end of the scale represent- 
ing minimal disorganization and the high end of 
the scale representing maximal disorganization. 
In making these ratings we are asking you to 
concentrate upon the severity of the disorganiza- 
tion exhibited in the response. In essence, what 
we are asking you to do is to judge how “schizo- 
phrenic” each response is. Some responses will 
seem quite normal; those you would rate 1. Others 
will be so disorganized as to require a 7 rating. 
The majority will fall somewhere in between. 

We are now going to project onto the screen a 
stimulus word and the response to it. Think out 
your rating carefully, but as soon as you make 
up your mind give your response, 


TABLE 1 


SIGNIFICANCE OF DIFFERENCES BETWEEN Mean ĪNTERJUDGE RELIABILITIES FOR HIGH 
Anxious (HA) anp Low ANXIOUS (LA) JopcEs 


Experiment I Experiment II Experiment III 
ce a ee 
Trial HA LA HA LA HA LA 
t t 
M M M M M M 
1 .282 2.42* 1,535 1.565 0.20 1.587 1.735 1.25 
2° o £ i075 1.57 1.126 1.032 0.61 1.284 1.357 0.61 
3 1.557 1.407 1.26 1.739 1.621 0.76 1.562 1.638 0.64 
A 1.734 1.632 0.86 1.983 1.752 1.50 1.835 1.730 0.88 
5 1.326 1.378 0.44 1.364 1.389 0.16 1.382 1.482 0.84 
6 1.321 1.219 0.86 1.381 1.381 0.00 
7 1.595 1.530 0.55 1.872 1.585 1.86 
8 1.500 1.413 0.73 1.635 1.665 0.20 
9 1.353 1.273 0.67 1.306 1.243 0.41 
10 1.648 | 1.585 0.53 1.474 1.634 1.04 
SDaitr, = 119 SDaitt. = -154 SDaitr, = -119 


Note.—All means are s’ values. 
*p <.05. 


ste 
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TABLE 2 
SIGNIFICANCE OF DIFFERENCES BETWEEN MEAN INTERJUDGE RELIABILITIES UNDER 
INDIVIDUAL AND GROUP CONDITIONS 
> $ HA LA 
M SD of M SD af t 
Individuale 1.30 17 1.25 17 
Grae 1.45 22 18 1.61 1.38 16 18 1.70 
Group 2 1,54 .26 18 2.29* 1.49 -20 18 PAi be. 
Individual? 1.25 19 | 125 24 
d rede es 2.01 d 8 2.58* 
Group 3 1.53 .20 1.59 11 


Fy Hunt and Walker (1963) Experiment 2. 
b From Hunt and Walker i563) first five trials. 
*p <.05. 


First you will be given four practice stimuli. 
Do you have any questions before we begin? You 
will have an opportunity to ask questions after 
the practice stimuli, but once the experiment 
starts you will have to hold all questions until 
the end of the experiment. I shall be glad to 
answer any additional questions at that time. 


The stimuli were then projected onto a white 
screen by a projector controlled by the experimenter, 
The interstimulus interval was 5 seconds; the in- 
terset interval, 1 minute. 


RESULTS AND Discussion 


Reliability was measured in these experiments 
by interjudge agreement as represented by the 
product-moment correlation between each mem- 
ber’s judgments and the mean for his group, for 
example, each HA subject against the mean for 
the HA group. These correlations were then 
transformed into 2’ values and the differences 
between the means of both groups on each trial 
were tested for significance by £ tests. 

The results for the three experiments are in 
Table 1. There was only one significant differ- 
ence (p < .05) between the HA and LA groups, 
As in the earlier anxiety-judgment work, the 
difference appeared on a first trial, but the direc- 
tion of the difference was reversed with the HA 
now demonstrating the higher interjudge relia- 
bility. None of the three experiments replicated 
the results of previous anxiety-judgment investi- 
gations in which LA subjects have consistently 
demonstrated a significantly better first trial 
performance. 

In view of Allport’s (1920) early observations 
on “social facilitation” in the group setting it 
seemed desirable to run a comparison of relia- 
bility in the individual and group setting using 


the individual data from a previous study (Hunt 
& Walker, 1963). Only Experiment II from this 
study was comparable as Experiment I along 
with the original Hunt and Blumberg (1961) 
data used repeated stimulus series while*the three 
experiments reported in the present article used 
stimuli which were only presented once as in the 
Hunt and Walker Experiment 2. se 

Table 2 presents a comparison of the mean 4 
values across trials of the present three experi- 
ments run under group conditions with those 
previously obtained in the individual setting. 
Since Experiment III involved only five trials it — 
was compared with only the first five trials of 
the Hunt-Walker series. Looking at both the HA © 
and LA groups in each of the experiments, we 
found that all comparisons showed a greater 
reliability for the group setting. Three of these 
differences were statistically reliable (p < 05): 
Something in the group setting certainly is facili- 
tating reliability. Whether this something is All- 
Port's “social facilitation,” a factor related to 
Spence’s (1963) “drive,” one associated with 
Child’s (1954) “drive stimulus,” or some other 
variable cannot be answered by the present ex- 
periments. It is obvious, however, that the decre- 
ment in reliability suffered by HA judges disap- 
Pears in a group setting. We see no reason tO 
change our previous belief that this is a drive 
stimulus effect. That it seems to be operating 
independently of the general group facilitation 
Suggests that some further explanation contin- 
gent on subsequent research is necessary. 
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RISK TAKING IN NEGRO AND WHITE ADULTS* 


HERBERT M. LEFCOURT 


University of Waterloo, Ontario 


30 Negro and 30 white adult males performed in a risk-taking experiment. 
Contrary to previous findings in skill-demanding tasks, Negroes demonstrated 
more behavior reflecting an internal control orientation than whites in the 


« gambling situation 
less shifts of bets, 


under study. Negroes chose less low probability bets, made 
and generally took less risks than whites. The perception of 


Negroes that success in conventional tasks is controlled by forces other than 

their own efforts is apparently reversed in games of chance. The inference 
Å follows that environmental changes in the opportunity structure will con- 

tribute toward more of an internal control orientation among Negroes. 


In several recent investigations Negroes, con- 
trasted to whites, have appeared more failure 
avoidant and less confident of their ability to de- 
termine their own reinforcements. In a series of 
studies by Katz and his associates (Katz & Ben- 
jamin, 1960; Katz & Cohen, 1962; Katz, Golds- 
ton, & Benjamin, 1958; Katz & Greenbaum, 
1963) Negroes maintained low estimates of their 
own capacities in comparison with whites and 
„seemed less assertive than whites. In studies by 
Battle and Rotter (1963) and Lefcourt and Lad- 
wig (1965) lower-class Negroes scored higher on 
external control than lower-class whites, both on 
attitude measures and performance task meas- 
ures of the internal-external (I-E) control of 
reinforeement dimension (Rotter, Seeman, & 


1 The research was supported in part by the United 
States Air Force, under Contract No. AF 49(638) -317 
monitored by the Air Force Office of Scientific Re- 
search of the Air Research and Development Com- 
mand. The author wishes to express his appreciation 
to Alvin Scodel, Jerome S. Miller, and Irwin Silver- 
man for their help and interest at different stages of 
this investigation. Appreciation is also expressed to 
the administration and staff at the United States 
Public Health Service Hospital in Lexington, Ken- 
tucky. The opinions expressed are strictly those of 
the author and not of the United States Public 
Health Service, 


Liverant, 1962). External control refers to a 
person’s belief that his efforts and his reinforce- 
ments are unrelated. The latter two investiga- 
tions have attempted to interpret Negroes’ lack 
of achievement-oriented behavior as being a 
function of Negroes’ generalized expectations 
that they are powerless to determine their own 
fates. 

These findings, together with those from so- 
ciological and psychiatric investigations (Brody, 
1961; Dollard, 1937; Finestone, 1960; Rose, 
1956), as well as popular fiction (Ellison, 1947; 
Wright, 1937) and essays (Baldwin, 1961), in- 
dicate that Negroes become failure avoidant and 
behaviorally deviant largely because they have 
low expectancies for obtaining positive reinforce- 
ments for more socially desirable behavior. So- 
cially desirable behavior, in this instance refers 
to efforts toward mastery and toward achieve- 
ment striving. 

Littig (1959) has hypothesized that persons 
with strong motivation to avoid failure would be 
more likely to become involved in a game of 
chance than persons characterized as strongly 
motivated to succeed. Within the framework of 
social learning theory (Rotter, 1954) it might 
be hypothesized that subjects who have low ex- 
pectancies that participation in skill situations 
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will lead to success should become more involved 
in nonskill, or chance situations. In essence, 
chance- or luck-determined situations may pro- 
vide an alternative mode for securing reinforce- 
ments such as money when skill-demanding situa- 
tions (work, business, etc.) are perceived as im- 
probable avenues to success, If this formulation 
has some validity then it might help to explain 
what E. F. Frazier has described as the Ameri- 
can Negroes’ obsessive interest in gambling 
(Frazier, 1962). 

The thesis is that Negroes, deprived of most 
achievement opportunities available to whites, 
have come to perceive reinforcements aseunob- 
tainable when seeking to gain them through con- 
ventional channels, As an alternative, Negroes 
can hope for more success through luck- or 
chance-determined activities as in gambling on 
“numbers,” races, cards, etc. 

In the present study the subject population is 
comprised primarily of lower-class Negroes and 
whites. These racial groups have previously been 
found to perform differently in skilled tasks; 
whites exhibited more internal-control behavior 
as reflected in patterns derived from level of 
aspiration performance; Negroes, on the other 
hand, performed as if reinforcements occurred 
by chance (Lefcourt & Ladwig, 1965). It is pre- 
dicted that in a chance-determined task (gam- 
bling) Negroes will demonstrate more internal- 
control behavior than whites, 

Liverant and Scodel (1960) have described 
certain indices of performance in a gambling 
situation said to reflect the I-E dimension, Spe- 
cifically, they predicted that I subjects would 
select more high probability bets than E sub- 
jects, and that I subjects would be less variable 
in their choice of bets than E subjects. These 
predictions were based on the assumption that 
I subjects would seek control in a chance situa- 
tion by using a cautious and planned selection of 
probabilities, E subjects were expected to select 


TABLE 1 
Payorr Matrix 
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bets more on the basis of hunches and the out- 
come of previous trials (gambler’s fallacy) rather 
than on objective probabilities. ® 
The specific hypothesis of the present study | 
is that Negroes, who generally behave (AS. ex 
ternal-control persons in skill tasks, display more 
internal-control behavior in chance situations as 
reflected in more cautious selection of probabili- 
ties, with less variability, and more choice 1 
i 


high probability bets than a comparable sample 
of white subjects. 


METHOD q 
Subjects 


Sixty male prisoners (30 Negroes and 30 whites) 
from a hospital for narcotic addicts participated in 
this experiment on a voluntary basis, Their average 
age was 31 and they all had scored in the average 


range of intelligence. They had’been removed from 
drug dependence for more than 2 months, and had 
cooperated with hospital routines previously. Each 
group was comprised of lower class-urban indi- 
viduals from eastern cities, There were no differ- 
ences between the racial groups in intelligence or 
social class. 


Procedure 


Subjects were all instructed that they were to par- 
ticipate in an investigation of decision-making be- 
havior which had no connection with hospital pro- 
cedures. They were each tested alone by a white 
examiner, 

Each subject was initially given chips worth $7.50 
and was instructed as follows: 


You are going to play dice. You can’t lose any 
money but you have a very good chance to make 
some. You are going to throw the dice 30 times. 
Before you throw each time, you will tell me two 
things. The first is how much you want to bet, 
and that will be either $.10, 20, 30, or 40. The 
second thing you tell me is your choice of bet. 
There are seven possible bets that you can make, 
[At this point the subject is shown the payoff 
matrix, Table 1, which was left in view through- 
out the session.] 


E F G 
7 9 3 
|__—_—_———_ §j 
-50 .80 1.70 
1.00 1.60 3.40 
1.50 2.40 5.10 
2.00 3.20 6.80 
1in6 1in9 1 in 18 
(1 (11) (.06) 
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TABLE 2 
y FREQUENCY OF BETS FOR NEGRO AND WHITE SUBJECTS 
R Bets 
. 
A B Cc D E F G 
Negroes (V = 30) 
Total 228 382 175 29 36 15 35 
M 7.60 12.73 5.83 97 1.20 -50 1.17 
SD 10.8 9.9 1.8 17 2.9 1.2 4.2 
Whites (N =_30) 129 360 117 65 147 33 49 
u © 4.30 12.00 3.90 2.17 4.90 1.10 1.63 
SD e71 9.2 ARS -42 74 2.5 23 
t 1:38* .04 1.21 05 2.50** 1,18 308 
4 * p = .10, one-tailed test, 
s Dp = 01. 


The payoff matrix was explained in terms of the 
objective probabilities and payoffs for each bet. It 
was stressed that the payoff did not include the 
amount bet. Examples were given to clarify the 
procedures, and all subjects stated they compre- 
hended the game. The subject was then informed: 


At the end of the 30 throws, if you have $7.50 
or less, you will keep 10% of that amount. If you 
end with more than $7.50 you keep $.75, plus half 
of anything over it. For example, if you end with 
$13.00, you will have $3.50 profit. You will then 
be given a pack of cigarettes for each $.25 you 
have earned.? 

The outcome of each throw was determined en- 
tirely by chance and, as can be seen in Table 1, each 
bet paid precisely what it would in a completely 
fair game. 


RESULTS 


The analysis of betting behavior is divided into 
two parts. The first deals with risk-taking choices 
and probability preferences. The second part is 

. concerned with variability in risk-taking behavior. 


Risk-Taking Choices 


Table 2 gives the frequencies of bets among 
the seven alternatives for each group. As in the 
Liverant and Scodel (1960) study, the B and C 
bets were strongly preferred. However, the sub- 
jects frequently chose the A bet also, more often 
than the C bet. 
` Mean differences were computed between racial 
groups for each of the seven categories. It was 
Predicted that Negroes would choose more high- 
Probability bets and less low-probability bets 
than white subjects. Although there is a trend 
in this direction regarding the high-probability 
bets, the differences are barely significant (t= 

2 Hospital regulations forbid giving money directly 
to patients. Cigarettes were the actual currency in 
the institution, 


1.38, p <.10, one-tailed test, on Category A). 
However, there is a decided tendency for whites 
to select more low-probability bets than Negroes 
(t= 2.50, p < .01, one-tailed test, on Category 
E). Because the distributions were markedly 
skewed in several categories, Mann-Whitney 
tests were also computed on the same data. 
While most of the t-test findings were supported 
by the nonparametric analysis, a significant dif- 
ference was found between Negro and white 
choice of the G bet in the nonparametric analy- 
sis but not in its parametric equivalent (Negro 
M rank = 27:10; white M rank = 33.90; Z= 
1.73, p = .04). The discrepancy in analyses was 
found to be due to one extreme case in the Ne- 
gro sample who played G 23 out of 30 trials. 
The bets were grouped into three categories in 
accord with Liverant and Scodel’s (1960) pro- 
cedures: high-probability (A), intermediate-prob- 
ability (B, C, and D), and low-probability (E, 


TABLE 3 


FREQUENCY OF BETS DISTRIBUTED AMONG THREE 
Major CATEGORIES BY THE WHITE AND 
Necro Groups 


Intermediate} 


High Low 
probability probability Sr ANA 
(A) (B+C+D) | (E+F+G6) 
Negroes (W =30) 
Total 228 586 86 
M 7.60 19.53 2.87 
SD 10.77 10.97 6.23 
Whites (V=30) 
Total 129 542 229 
M 4.30 18.07 7.63 
SD 7.10 9.43 8.50 
t 1.38* 54 2.43** 


tres el OES Ae SS eee 
*p = .10, one-tailed test. 
++p = 01, one-tailed test. 


768 
TABLE 4 
VARIABILITY IN BET SELECTIONS FOR 
NEGRO AND WHITE SUBJECTS 
Negro | White 
Number of shifts 
Total 271 | 390 
M 9.03 | 13.00 | #=1.98** 
SD 78 7.5 
Proportion of subjects -66 90 |Z=2.24*** 
making one or more un- 
usual shifts 
Proportion of subjects -60 .80 | Z=1.69* 
making two or more un- k 
usual shifts 


*p = .05 
** b <.03, one-tailed test, 
wp = 01. 


F, and G) bets. These data are presented in 
Table 3. The differences between Negroes and 
whites in the choice of high- and intermediate- 
probability bets are in the predicted directions 
though the differences were again barely signifi- 
cant. In the low-probability category, however, 
whites demonstrate a significantly greater choice 
of such bets than Negroes. This parallels the 
findings of Liverant and Scodel with I and E 
subjects. 

As in prior investigations (Scodel, Ratoosh, 
& Minas, 1959), an index of degree of risk taking 
was computed for each subject by using the 
maximum possible payoff for each series of bets. 
This index is comprised of the Sum of Probabil- 
ity X Amount bet regardless of the outcome of 
the bet. That is, bets were summed as if each of 
the 30 bets had a favorable outcome. If the sub- 
ject, for example, played $.10 on the E bet on 
each of his 30 trials, his maximum possible 
payoff would be $15.00 ($.50 x 30). The mean 
for Negroes was $19.08, and for whites, $30.23. 
A ¢ test demonstrates statistical significance (t 
=2.15, p <.025, one-tailed test). Whites ap- 
peared less cautious, wagering more money on 
less probable outcomes than Negroes. 

In addition Negroes and whites were com- 
pared on their mean amounts wagered and their 
mean probability preferences. The former meas- 
ure involves the choice of amount ($.10 to $.40) 
wagered on each bet regardless of the odds. The 
latter represents the mean probability of any 
subject’s 30 bets. If a subject chose the A bet 
for 30 trials, then his mean probability score 
would be .67 (see Table 1), 

No differences were found in the amounts 
wagered (Negro M = $.35; white M = $.34; t= 
-50). However, whites exhibited a preference for 
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lower probability choices (white M = .39; Negro 
M= 47; t= 2.38, p < .02, one-tailed test). 

In the above measures which provide general 
indices of risk taking, Negroes appear more cat- 
tious than whites on both the mean probability 
measure and the risk-taking index (maximum 
possible payoff). The failure of the amount-bet 
index to differentiate groups may be due to the 
restricted range of choices offered ($.10, .20, .30, 
and .40). 


Variability F 
The major measure of variability in betting 


was obtained by counting the number of shifts 
and the magnitude of such shifts from one bet- 


ting category to another in each subject’s set of R 


30 bets. Second, a measure of unusual shifts was 
used. Both measures derive froin Rotter’s (1954) 
measures used for analysis of performance in 
the level of aspiration task. An unusual shift 
generally consists of a raise in goal level after 
failure, or, a decrease in level after success, In 
this instance, an unusual shift consists of choos- 
ing a less probable bet after losing on a more 
probable bet; or, choosing a more probable bet 
after success on a less probable one. The Ta- 
tionale for use of unusual shifts rests on. its 
ability to depict a gambling, or external-control 
orientation, the greater number of unusual shifts 


characterizing the chance orientation. If the sub- « 


ject perceives each event as totally independent 
of preceding events, then unusual shifts should 
be more frequent. Such behavior would be typi- 
cally elicited in gambling situations. However, if 
the subject believes that he controls the event 
in question, then the events are not independent. 
If the subject wins frequently, and he believes 
that it was not merely a series of fortuitous 0c- 
casions, then he should raise his bets. Briefly, the 
subject who believes that he can control out- 
comes should vary his expectation of outcomes 
with previous outcomes, thus exhibiting fewer 
unusual shifts than the subject who believes that 
chance determines each independent outcome, 
The incidence of unusual shifts has been col 
related with I-E measures (Simmons, 1959) and 
is found more commonly in chance than skill 
situations (Phares, 1957). 

As presented in Table 4, whites are found to 
shift significantly more than Negroes (t= 1.97, 
Ż <.03). In addition, when the magnitude of 
each shift (the number of categories shifted 
across) is compounded with frequency of shifts, 
whites are found more variable than Negroes 
(t= 1.52, p<.10) though not definitively s0- 
Second, more whites are found to make unusu: 
shifts than Negroes (Z= 2.24, p < .01). A sec- 


ond comparison was made between subjects mak- 

ing none or one and those making two or more 

unusual shifts. This was done to minimize the 

effect of the occasional unusual shift subjects 

» make toward the end of the game as a “last 

» fling.” A significantly greater proportion of white 
subjects made unusual shifts in this measure 
(2 = 1.69, p = .05). 


Additional Findings 


In addition to the game situation each subject 
completed the Allport-Vernon-Lindzey Study of 
Values Which’ has been related to risk-taking be- 
havior previously (Scpdel et al., 1959). There 
were no differences evidént in any of the six 
value areas in the Study of Values. However, for 
eeach subject, the deviation of his scores was 
computed so that for the white and Negro groups 
there was a mean of an individual variance index. 
This index was developed to investigate group 
differences in value commitments. Low variance 
would indicate a lack of preference consistency 
in this test so that the scores for each of the six 
values would be approximately the same (ap- 
proaching 40). This measure correlated — .30, $ 
<.05 with Dean’s (1961) Powerlessness scale 
which is similar to the Internal-External Control 
Scale (on Dean’s scale a high score indicates 
powerlessness or external control). In the present 
| samples M ranks were computed (Negro M rank 
= 26.17; white M rank = 34.83) and a Mann- 
Whitney test of differences revealed whites as 
being significantly higher in score than Negroes 
(Z= 1.92, p=.03). The finding suggests that 
among the six socially approved values there is 
less differential preference for Negroes or, in 
other words, these are less meaningful choices 
to a group that has low expectancy of success in 
Socially acceptable channels. 


Discussion 


In this study Negroes behaved in a more cau- 
ious, strategic, or internal-control fashion than 
a comparable group of whites. These findings are 
in direct contrast to those in skill-demanding 
4 situations, in which Negroes more often perform 
in an external-control manner. In motivational 
erms, the Negro seems more highly motivated 
to avoid failure in skill situations and more mo- 
tivated to achieve success in chance situations. 
This has been interpreted in a social learning 
Tamework as follows: through segregation and 
discrimination practices, Negroes have developed 
low expectations that reinforcements available 

and valued in our culture can be gained through 
U socially prescribed means. That, in the schoolroom, 
market place, or job, work or effort will rarely 
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lead to valued goals such as social recognition, 
financial comfort and security, protection from 
malevolent community forces, etc. Consequently, 
many Negroes have adopted less acceptable but 
relatively more risky methods for gaining ce% 
tain goals. Frazier’s (1962) description of Ne- 
groes’ obsessive gambling gives focus to one 
such method. 

In this study, the reinforcements used were 
cigarettes which are highly valued and serve as 
a substitute currency within the institution walls. 
Similarly, in the study by Lefcourt and Ladwig 
(1965), cigarettes were the reinforcements, and 
the sample was again drawn from a captive popu- 
lation, though from a different institution, for 
which cigarettes were a valued currency. The 
differences in the studies then reflect subjects’ 
different view of the tasks and not different 
valuation of the goals, It may be inferred that 
Negroes as compared to whites believe that 
achievement in self-evaluative, skill-demanding 
tasks is less instrumental to success than in ex- 
ternally controlled situations where determinants 
of success are largely luck or fate. 
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ANTI-NEGRO BIAS AMONG NEGRO COLLEGE STUDENTS? 


BRUCE L. MALIVER 2 
City College of the City University of New York 


To test the identification with the aggressor hypothesis among Negroes, 160 


northern and southern Negro males, 


grouped by acceptance or rejection of 


anti-Negro statements, were compared for the presence of 23 personality 


variables, Only the predicted relationshi 
and the variables percept of the father, 


were upheld. Accordingly, 


ip between degree of anti-Negro bias 
fear of rejection, and absence of fears, 
general hypotheses about differences in self- and 


parental percept and methods of coping with hostile interpersonal attack were 
rejected. Actual membership in civil rights organizations and participation in 
antisegregation activities were found to vary with anti-Negro bias. No North- 


South differences were found. 


In recent years, the phenomenon of minority 
group self-hatred has received considerable at- 
tention. Probably in response to the thought- 
provoking work of Lewin (1941) and Bettelheim 
(1943), three separate studies of anti-Semitic 


1This article is based on a dissertation submitted 
in partial fulfillment of the requirements for the 
PhD degree at Yeshiva University in October of 
1963. The writer is indebted to Norman B. Gordon 
and Victor Sanua who served as members of the 
dissertation committee, and to Bernard Kutner, 
chairman of the committee under whose supervision 
the investigation was conducted. A summary of the 
study was presented at the Meeting of the Eastern 
rae Association in Philadelphia, April 18, 
1964. 

2The author wishes to express his thanks to the 
administration for their cooperation in the prepara- 
tion of this paper, while he was employed by Co- 
lumbia University’s College of Physicians and Sur- 
geons at the Department of Psychiatry, Harlem 
Hospital. 


attitudes among Jews are available (Adelson, 


1953; Radke-Yarrow & Lande, 1953; Sarnof, 
1951). Since Sarnoff drew hypotheses from the 


general theory of identification with the aggtes 
sor (Freud, 1946) in an attempt to relate degree 
of Jewish anti-Semitism to a variety of person- 
ality factors, his study is of greatest interest 
here, 3 

Sarnoff discussed at length the relationship 
between the theory of identification with the 
aggressor and the study of Jewish anti-Semitism- 


He argued that there are social situations, anal- J 


ogous to the individual Oedipal situation, 1” 
which identificatory responses are likely to be 
evoked. Among the conditions considered neces 
sary for such responses are those in which 48° 
Sressors are determined to impose hostility upon 
social scapegoats, and where the victims a 
socially dependent upon the aggressors (an 
therefore are likely targets), and where the vit- 
tims cannot escape the sphere of influence of 


e 


i 


aggressors. When the position of the Negro in 
current American society is considered, it seems 
obvious that these conditions are also fulfilled, 
perhaps to an even greater extent than they are 


{> for Jews. This then, leads to the questions: Does 


* the concept of identification with the aggressor 
help explain Negro self-hatred and self-rejection? 
Will a relationship obtain between the degree of 
anti-Negro bias and personality variables derived 
from the theory? 

The devaluation of Negroid characteristics 
among Negrges was noted by Herskovits as 
early as 1928. Other writers such as Johnson 
(1943), Frazier (1940, ,1957a, 1957b), and 
Powdermaker (1939, 1943) have observed and 
interpreted various aspects of self-hatred among 
Negroes from the anthropological point of view. 
Studies of attitudes and stereotypes among Ne- 

> groes have shown that the Negro holds the same 

prejudices and stereotypes towards diverse ethnic 
groups as does the white, even to the point of 
largely accepting the negative stereotypes about 
himself (cf. Bayton, 1941, 1942). Authoritarian- 
ism in relation to other attitudes among Negro 

students has been studied by Steckler (1957) 

who, in the process developed a basic anti-Negro 

(AN) attitude scale. Until this current study, 

how@ver, there has been no reported attempt to 

predict personality differences among Negroes 
on the basis of prejudicial attitudes. 

The research to be reported here was, in part, 
an attempt to determine whether the conclusions 
in support of the theory of identification with 
the aggressor, drawn from Sarnoff’s study of 
Jewish anti-Semitism, might be applicable to the 
phenomenon of anti-Negro bias among Negroes. 
Accordingly, effort has been exerted to make the 
general design and the hypotheses of this study 
as similar as possible to those of Sarnoff. 


Hypotheses 


Three broad areas of personality differences 
between Negro subjects who accept and those 
who reject anti-Negro statements were explored. 
It was hypothesized that Negroes scoring high in 
anti-Negro bias would show a concomitantly high 
incidence of negative attitudes towards the 

_ Parents (Hypothesis 1), of negative attitudes 
towards the self (Hypothesis 2), and would be 
more prone to react passively in the face of hos- 
tile interpersonal attack (Hypothesis 3). On the 
other hand, Negro subjects scoring low in anti- 
Negro bias were expected to show a high inci- 
dénce of positive attitudes towards the parents 
(Hypothesis 4) and towards the self (Hypothesis 
5), and to be more prone to direct retaliation 
When attacked (Hypothesis 6). Twenty-three 
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specific predictions about personality variables 
were derived from these six major personality 
hypotheses.’ 

METHOD 


Subjects ? 

The subjects for the research were 160 Negro, 
male college students. Three southern groups (85 
subjects) and two northern groups (75 subjects) 
were tested in five different college locales through- 
out the country. Faculty members who had pre- 
viously helped in similar research were contacted 
and their cooperation was secured,* 

The total group of 160 students proved to be a 
broad *and heterogeneous sample with respect to 
demographic variables. Using Edward’s (1950, p. 
135) procedure to test the significance of transformed 
z' scores on several populations by way of the chi- 
square distribution, it was determined that these 
five smaller samples were randomly drawn from a 
statistically common population. 


Measurement of Anti-Negro Bias 


Anti-Negro bias was measured with a 7-point 
Likert scale composed of anti-Negro items (Steck- 
ler, 1957) and new pro-Negro items. A variety of 
steps were taken to control for acquiescence response 
set and other extraneous measurement variables. In 
an attempt to control the total scale format for 
possible acquiescence set interference, the anti-Negro 
and pro-Negro items were randomly mixed with 
items from the E Scale (Adorno, Frenkel-Brunswik, 
Levinson, & Sanford, 1950) and the forward and 
reversed F Scales (Christie, Havel, & Seidenberg, 
1958) to make up a 60-item scale. 

In addition, the individual anti- and pro-Negro 
items were subjected to a scoring reversal system 
so that, in order to be placed at either extreme of 
the bias continuum, subjects had to be highly con- 
sistent in their racial attitudes. Thus, subjects ulti- 
mately considered to be high in anti-Negro bias, 
were those who had accepted anti-Negro items 
while rejecting pro-Negro items; similarly, subjects 
designated as low in anti-Negro bias were those who 


3 Sarnoff’s detailed derivation of these general hy- 
potheses and the specific predictions from the con- 
cept of identification with the aggressor will not be 
repeated here, in the interest of brevity. The person- 
ality variables subsumed under each of the hypoth- 
eses as presented in Table 1 are self-explanatory. 

4 The author wishes to express his appreciation to 
the following people, all of whom were generous in 
their expenditure of time, advice, and help in gath- 
ering subjects, or enlisting the aid of Negro graduate 
student confederates to administer the protocols: 
Paul I. Clifford, University of Atlanta; James A. 
Bayton, Howard University; Albert and Bernice 
Lott, University of Kentucky and Kentucky State 
College; Leonora C. Lane, Central State College, 
Wilberforce, Ohio; Robert Bierstedt, New York 
University; and Kenneth B. Clark, City College of 
the City University of New York. 
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both rejected anti-Negro items and accepted pro- 
Negro items. Thus, random responses tended to 
cancel each other out. 

The individual items were subjected to the follow- 

ing procedure: 
° 1. All subjects were ranked by their scores on 
Steckler’s AN scale, and the discriminatory power of 
each item was determined. All of the original items 
still proved satisfactory with this diverse group.5 
(See Table A, in ADI supplement.) 

2. Next the subjects were ranked on the basis of 
their scores on the new pro-Negro items, and again 
the discriminatory power of each item was deter- 
mined. (See Table B, in ADI supplement.) 

3. After dropping the one new item that did not 
adequately discriminate between high and low 
quartiles at this stage, the subjects were ranked by 
the sum of the remaining items in both scales. 

4, Finally, the discriminatory power of each item 
in the merged distribution vis-a-vis that distribution 
was determined, and items which did not sufficiently 
discriminate between high and low quartiles were 
dropped. (See Table C in ADI supplement.) 

The final result was a scale of 20 positively and 
negatively phrased items with a split-half reliability 
of +.750, corrected by the Spearman-Brown proph- 
ecy formula. 


Measurement of the Personality Variables 


The specific personality variables subsumed under 
each of the six general hypotheses were rated present 
or absent for each subject on the basis of paper- 
and-pencil test material. Five Thematic Appercep- 
tion Test (TAT) cards (Cards 1, 6BM, 7BM, 3BM, 
and 12M), and 50 items from the Michigan Sen- 
tence Completion Test were rated according to 
strictly defined criteria by two raters working with- 
out knowledge of each other’s judgments, In cases 
of disagreement, the responses in question were 
submitted to a judge who, knowing only that the 
raters had disagreed, made a final judgment using 
the same criteria. 

The projective instruments in this research were 
handled in a manner that is different from the com- 
mon practice in which the subject’s words Provide a 
stimulus for the interpretive skill of the clinician 
who attends to the symbolic meaning (assumed to 
be) underlying the actual words of the response. 
Rather, in this study exact criteria for the presence 


5A 5-page supplement containing the scale items 
and detailed statistical data (quartile means and 
discriminatory power values, overall mean values, 
etc.) about this item analysis procedure has been 
deposited with the American Documentation Insti- 
tute. Order Document No. 8527 from ADI Auxiliary 
Publications Project, Photoduplication Service, Li- 
brary of Congress, Washington, D. C. 20540. Remit 
in advance $1.25 for microfilm or $1.25 for photo- 
copies and make checks payable to: Chief, Photo- 
duplication Service, Library of Congress. 

€ An unpublished test developed at the University 
of Michigan, and used through the courtesy of Irv- 
ing Sarnoff, personal communication, 1961. 
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of each response were specified in an attempt to 
make the coding and rating of the variables as ob. 
jective as possible. An example of this can de seen 
most easily in the use of the TAT in which o 
direct fantasies of death of a parent or sui 
the hero met the criteria for the unconscious oa 
pression of negative parental or self-attitudes, This 
procedure led to an overall agreement of 97.36% 
between the two raters.7 


Summary of Test Battery and Experimental 
Design 


In the first pages of the final test booklet, each 
subject was presented with the 60-item Likert atti- 
tude scale described abovs. Fifty sentence-comple. 
tion items and 5 TAT pictures followed, with ade 
quate space for written responses. Finally, a per 
sonal data sheet was appended in order to elicit 
information about each subject’s age, residence, 
socioeconomic status, etc, Included were questions 
designed to elicit specific information about the 
subject’s attitudes towards “sit-ins,” and his actual 
participation in desegregation activities and organiza- l 
tions. 

Upon return to the investigator, the completed 
protocol was separated into its component parts and 
each part was subsequently handled separately. 

The subjects were ranked on the basis of theit 
final scores on the revised measure of anti-Negtd 
bias, and the highest and lowest thirds of the dis- 
tribution were designated as high- and low-bias 
groups. Except to determine that a curvilinear dis- 
tribution did not occur, the middle third was not 
considered for any subsequent statistical operations. 

Finally, for each personality variable, a count 
was made to determine the number of high- and 
low-scoring subjects who were rated as having that | 
variable present, The significance of these differences 
was determined through the use of the chi-squaté 
distribution, 

„At all stages, independent statistical determina- 
tions were made for the northern, southern, aml 
total groups, 


Major Hypotheses 


As only four of the series of specific sub 
hypotheses were significant beyond the .05 level, 
all major hypotheses (1-6) were rejected (se 
Table 1). : 

These data did not support the expectation” 
that broad personality differences could be pres 
dicted on the basis of the degree of anti-Negt® 
prejudice held by Negroes, 


1 Thanks are here expressed to Murray Garfinkel 7 
and Vera Henriquez who served as rater and in | 
respectively. Perhaps it should be added that all 0%- 
the raters felt uncomfortable laboring under 
constraints and were often driven to comment É, 
even though a given response did not meet i 
stated criteria, they knew clinically (i.e, intuitively) 
that the variable was Present. 


| 
| 
ReEsutts | 


Brier ARTICLES 


It may be noted that there is little likelihood 
that Type II (beta) error was committed here. 
The pdwer efficiency of a statistical test is, in 
part; a function of a sample size (Cohen, 1962). 
With an N as large as 160, and using one-tailed, 
“fourfold chi-square tests with p = .05, regardless 
of the magnitude of the effect in the population, 
these null hypotheses may be accepted with con- 
fidence. 

The hypothesized relationship between atti- 
tudes towards the father and degree of anti- 
Negro bias was sustained in these Negro sub- 
jects. That is, subjects low in anti-Negro bias 
were rated as demonsteating a positive percépt 
of the father more frequently than subjects high 
in anti-Negro bias (x? = 4.48, p = .05). Further, 
the opposite relationship obtained when negative 
attitudes towards tle father were independently 
rated,® that is, subjects high in anti-Negro bias 
expressed negative attitudes about their fathers 
with significantly greater frequency than did 
subjects who scored low in anti-Negro prejudice 
(a= 3:76) p =.05): 

The predicted relationship was also sustained 
with respect to the variable, fear of rejection; 
subjects high in anti-Negro bias were rated as 
having expressed such fear with significantly 
greater frequency than were subjects low in such 
bias (x? = 4.10, p = .05). 

Finally, subjects scoring low in bias were rated 
as having expressed optimism and an absence of 
vague fears of the future more frequently than 
were subjects high in anti-Negro bias (x° = 
6.62, p = .01). 


Incidental Findings 


No significant relationships were found between 
scores on the modified anti-Negro bias measure, 
and the following demographic variables: age, 
college class level, family income, social status 
scale rankings, college grades or major subject, 
and number of siblings. 

Significant North-South differences did not 
occur in either the tests of the major hypotheses 
or in scores on the anti-Negro bias measure. 


8 Although the same sentence-completion items 


, Were often used to rate specific personality variables 


which might have either a positive or negative 
valence, the raters were instructed to examine all 


' the protocols for the presence of one variable before 


going on to another variable that might be consid- 
ered the opposite of the first. Thus, for example, 
absence of negative attitudes towards the father 
did not mean that a subject was credited with posi- 
tive attitudes towards the father. Before being 
credited in any category a response had to inde- 
Pendently meet the scoring criteria for that category. 
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TABLE 1 


DIFFERENCE BETWEEN THE NuMBER OF HIGH- AND 
Low-Scorinc SUBJECTS RATED As HAVING THE 
PERSONALITY VARIABLES PRESENT 


Anti-Negro bias 
‘ou. 
Hypotheses (1-6)* RRA x? 
High | Low 
1, Negative attitudes, parents 
Death of mother (TAT)* 2 1 004 
Death of father (TAT) 8 6 03 
Death of parents (TAT) 1 1 53 
Death of all parent figures 
(TAT) 11 8 09 
Negative percept of mother 7 8 134 
Negative percept of father 11 3 3.76* 
Fear of parental disapproval 11 6 1.12 
Negative percept of home 6 7 135 
2. Negative attitudes, self 
Self-negation 18 13 72 
Psychic stress 22 20 04 
Self-derogatory remarks 11 9 103 
Fear of rejection 14 5 4,10* 
Fear of future 19 18 00 
3. Passive response to aggression 
Passive response, aggression 30 21 2.41 
Suicide of hero (TAT) 3 4 614 
Suppression, retaliatory 
desire 12 8 32 
4, Positive attitudes, parents 
Positive percept, mother 23 22 00 
Positive percept, father 5 12 4.48* 
Positive percept, home 26 28 +33 
5, Positive attitudes, self 
Self-assertion 5 8 .20 
Absence of fears 1 7 6.62%" 
Favorable remarks about 
self 14 14 «00 
6. Active response to aggression 
Direct retaliatory action 30 30 «00 


aFor Hypotheses 1, 2, and 3, high group frequency pre- 
dicted to exceed low group frequency; for Hypotheses 4, 5, and 
6, low group frequency predicted to exceed high group frequency. 

bN = 53 in each group (high and low thirds of 160 ranked 
subjects). 

e Unless TAT is indicated, variable was rated from sentence- 
completion items. 

Also not significant with Fisher's exact test. 
*p< .05. 
wep = 01. 


Further, northern and southern samples did not 
differ in authoritarianism or ethnocentrism scores, 

The reports of organizational membership and 
participation in antisegregation activities were 
subjected to chi-squares in much the same way 
as were the personality variables. In the general 
context of the major hypotheses, it would be 
expected that subjects low in anti-Negro bias 
would be more likely to be members of such 
groups as the National Association for the Ad- 
vancement of Colored People (NAACP) or the 
Congress of Racial Equality (CORE) than would 
Negro subjects high in anti-Negro bias. In the 
case of each organization the results were in the 
expected direction, and significantly so when 
CORE membership was considered (x? = 5.48, 
p=.01). 

Further, it would be expected that low-scoring 
subjects would be more likely to have partici- 
pated in demonstrations. This expectation was 
sustained (x? = 3.24, p = .04). 
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It should be noted that these data concerning 
organizational memberships and participation in 
demonstrations (and the associated probability 
levels) may have only limited validity, as it was 

enot possible to determine the extent to which 
these subjects concealed their memberships or 
activities through fear of possible retaliation, 
even though they were told that the booklets 
were to remain anonymous. 

One final incidental finding: In the analysis of 
the sentence-completion and TAT protocols di- 
rect and obvious racial statements were occa- 
sionally noticed. In the hope that relative fre- 
quencies of these comments might be of interest, 
all projective protocols were rated for the pres- 
ence of responses indicative of positive or nega- 
tive attitudes towards Negroes, as well as towards 
whites. 

There were no positive remarks about whites 
expressed in the protocols of the 160 subjects in 
the total sample. Low-scoring subjects made 
significantly more spontaneous antiwhite remarks 
(x? = 3.36, p = .04), and significantly more pro- 
Negro remarks (x? = 5.50, p=.01); both fnd- 
ings are consistent with the overall theoretical 
position taken in this study. The reverse Position, 
that is, that subjects scoring high in anti-Negro 
bias would be more prone to make anti-Negro 
remarks, does not find support in the data. How- 
ever, this finding may be a reflection of the 
apparent greater freedom with which low-scoring 
subjects express any racial comments (x? = 7.99, 
p= .005).° 


Discussion 


Three broad areas of personality differ- 
ences between Negroes high in anti-Negro bias 
and Negroes low in such bias were hypothesized 
on the basis of the theory of identification 
with the aggressor. The theory was extended 
by Sarnoff from the realm of individual 
psychopathology to the study of Jewish anti- 
Semitism, and he reported acceptance of hy- 
potheses which stipulated that Jews high in anti- 
Semitism differ from Jews low in anti-Semitism 
in attitudes towards the parents, attitudes to- 
wards the self, and methods of dealing with 
hostility directed against the self by others. In 
this research, similar hypotheses did not yield 


® The significance associated with this contingency 
table must be considered an approximation as the 
cell entries may violate the principle of independent 
observations. This is so because in one or two cases 
the same subject made racial comments scorable in 
different categories; thus, when the cell entries are 
summed over all categories, the same subject is 
counted more than once. 


a 
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similar results with Negro subjects and all major 
hypotheses were rejected. Í 

The rejection of these hypotheses indicates 
that, within the limits of the design used, i 
cannot be said that the theory of identification 
with the aggressor is supported by data dram 
from a diverse sample of male American Ne 
groes. Further, the conclusions drawn from the 
study of Jewish anti-Semitism have been shown 
not to be directly applicable to another minority 
group. 

While real personality differences between Ne- 
gro and Jewish ethnic groups may exist, it is 
most parsimonious to explain the differences be- 
tween the current findings and those of Sarnoff 
in terms of research design and execution. A diss 
cussion of the methodological differences be 
tween the two studies, especially with regard to 
sampling procedures, the measurement of preju: 
dice, and statistical procedures, is available and 
need not to be repeated here (Maliver, 1963), 
To the extent that the hypothesized reat 
between personality variables and minority group. 
attitudes depend solely on statisticai relation 
ships between measured verbal factors, it seems 
evident that the method does not afford a good 
test of the theory. Pa 

The positive findings of this study, combining 
as they do a pattern of verbal and behavioral 
elements, do offer a measure of support for the 
concept which lies at the heart of the theory. 

It will be recalled that a high degree of at 
ceptance of anti-Negro statements was associate 
with a negative view of the father and general- 
ized fear of rejection by adult figures. This them 
may be seen as affecting behavioral spheres in 
a lack of positive identification with those social 
forces which are critical of the Negro’s low 
estate and willing to take positive strides to 
change the environment. On the other end of 
the bias continuum, a low degree of acceptance 
of anti-Negro statements is associated with 4 
Positive percept of the father, an absence 0 
conscious fears of the future, and more active 
identification with the current movement t0 
effect social changes. Stated conversely (and 
keeping in mind all the dangers of reasoning 
from a converse), the validation of the predic- | 
tions with respect to the father suggests that 
for these Negro males, strong positive or nega- h 
tive feelings towards the father may result i 
identification with the Negro in general, or 
unwillingness to be so identified. 
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PERSONALITY FACTORS ASSOCIATED WITH THE ACQUISITION 
OF THE CONDITIONED EYELID RESPONSE 


E. WAYNE HERRON + 


University of Wisconsin-Milwaukee 


The Holtzman Inkblot Technique (HIT) was administered to 16 Ss who did 


not acquire the eyelid conditioned response (CR), 


and to 16 matched Ss 


who did acquire the eyelid CR, Results showed that HIT Anxiety and HIT 


Integration scores were sig 


nificantly higher for those Ss who acquired the CR. 


These results extend to a projective measure of anxiety the previous research 
showing conditionability to be related to paper-and-pencil measures of anxiety. 
The relationship of HIT Integration to conditionability was interpreted as 
further evidence that cognitive factors are relevant in the eyelid conditioning 


E situation, 


Spence and Spence (1964) emphasize the ef- 


. fects of the personality variables of Manifest 


Anxiety and Neuroticism on performance in the 


_ eyelid conditioning situation, while rejecting the 


hypothesis that conditionability is related to the 
Personality variable of Extrayersion-Introversion 
(Franks, 1961). However, virtually all the re- 
Search investigating the relationship of person- 


| Appreciation is expressed to John J. Porter for 
his cooperation in making these subjects available. 


ality variables to performance in the eyelid 
conditioning situation have utilized paper-and- 
pencil measures of personality. 

The recent development of the psychometri- 
cally sound Holtzman Inkblot Technique (HIT) 
now makes it possible to get a reliable projective 
measure of six orthogonal perceptual-cognitive 
personality factors. These six factors have been 
shown to be present in 14 different subject pop- 
ulations (Holtzman, Thorpe, Swartz, & Herron, 


1961). 
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TABLE 1 
Means, STANDARD DEVIATIONS, AND INTERCORRELATIONS (NV = 32) ; : 
Variable 1 2 3 4 5 6 7 8 9 10 | 11 | 12 | 13 | 44 i 
ra ee 
M: 20.6 | .9 | 46.2 | 48.5 | 188 | 3.6 | 4.8 | 10.8 | 30.3 | 21.7 | 2.7 | 66 52) 
SD: 2.7 | 21 | 146) 4.7 | 97] 36 | 49 | 59 | 82] 6.2 | 26 | 42 30 
1. Conditionability i 
2. Sex 00 
3. Age 230 | —.25 
4, Rejection 09 | —.13 | —.02 
5. Location —.22| —.05 | —.27| —.20 
6. Form Appro- 
priateness 02) —14) .21| .29] 53 
7. Color «416| .08| —.15 | —:22 | —l04] .02 k 
8. Shading 27! —.05| .09| —:07| :o8| (07| .16 
9, Pathognomic 
Verbalization -25| 10) —.06 | —.01 | 5.12] —.28 | .20 "4 
10, Integration «42| 21| .00| 02| 260| —154| 12 -60 
11. Human «418| 09 | —.11 | —.20 | —:27 | —:37 | —115 28 | .63 
12. Animal =01| 10| 23) 08] 131] (28 | =:14 09 | —10} —.08 
13, Anatomy —25| .23| .06| —.28 | —01] ‘05 | =.05 03 | —.11 | —08| —.04 ji 
14. Anxiety «26 | —.17 | —.22 | —.25| <06 | —:22 | [14 25 | .26| 13| .16| .02 
15, Penetration +08) —.07 | .00| —.29 | —:21 | —:32 | ‘02 23 | 28| 32| —132| :32 | 15 
> 


Note.—Correlations involving conditionability and sex are point biserial: nonconditioners and males coded 0; conditioners and 


females coded 1. 


The purpose of the study reported here is to 
see if a projective measure of “neuroticism” 
(HIT Factor III) is related to the acquisition of 
the eyelid conditioned reflex, 


PROCEDURE 


Sixteen subjects who failed to acquire an eyelid 
conditioned response (CR) (less than 15% CRs in 
60 trials) were matched as to sex and experimental 
condition imposed in the eyelid conditioning situa- 
tion with 16 subjects who did show acquisition of 
the eyelid CR (voluntary responders eliminated). 

Subjects were selected and tested as follows. All 
subjects had taken part in a study by Porter, Engel, 


TABLE 2 


PARTIAL CORRELATION COEFFICIENTS BETWEEN HIT 
FACTOR MARKER VARIABLES AND CONDITIONABILITY 


Factor Marker variables r123 
I Integration .44* 

Human .22 

Color .22 

Hal Shading 26 

1 Pathognomic Verbalization 30 
xiety 36* 

Iv Location =s 

Form Appropriateness —.05 

Rejection 10 

V | Animal <06 

Penetration .08 

VI Anatomy —.28 


Note Holding age constant, Conditioners coded 1, 


Brady, and Kropp (1964) into the effects of condi- 
tioned stimulus (CS) duration and task instructions 
on extinction of the eyelid CR. Using Porter's list 
of subjects, 84 students in the general psychology 
course at the University of Wisconsin-Milwauket 
were invited to take the HIT as part of a normality 
study of UW-M students, Of the 84 subjects int- 
vited to sign up to take the HIT (they thought 
they were a random sample of psychology students); 
66 did so. Of the 66 scheduled to be tested at ont 
of six different testing sessions, 60 actually appeared 
and were administered the HIT by the standardized 
group method (Swartz & Holtzman, 1963). Of the 
84 subjects invited to be tested, 24 (29%) had 
been classified as nonconditioners by Porter a 
Of the 60 subjects who actually were tested, 1 
(27%) were classified as nonconditioners. Since ag 
is also considered to be a factor related to tht 
acquisition of the eyelid CR (Kimble & Pen 
packer, 1963), age and date of birth were also ou 
lected at the time the HIT was administered. Ti 
HIT protocols were scored by a research as 
who was ignorant of the subjects’ classification 8$ 
to conditionability. tie 

The marker variables for HIT Factors I-VI a | 
lized for correlational analysis were selected on i 
basis of the factor matching studies of Ho 
et al. (1961, pp. 146-167). However, certain mat! ; 
variables were not used in the analysis. HIT Fac 
I marker variables, Movement and Form Defni oa 
ness were not used because they load on pe, 
than one factor; and Popular was not used be 
of its generally low split-half reliability. Hostility | 
was dropped as a marker variable for Factor a 
because of the changes in magnitude of loading f 
Factor III for the different populations used in | : 
factor matching studies (Holtzman et al, 1961, P 
152). One of the marker variables for Factor i ; 
Reaction Time, is lost as a variable in the gt! 
method of administration. 
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RESULTS 
Table 1 shows the means, standard deviations, 


L and int&rcorrelations of the 12 HIT marker vari- 


ables and subjects’ age, sex, and conditionability. 
‘The correlations involving sex and condition- 
ability are point-biserial with males coded 0 
and females coded 1, and nonconditioners coded 
0 and conditioners coded 1. Since age and con- 
ditionability correlate .30, Table 2 gives the 
partial correlation coefficients, holding age con- 
stant, between the HIT variables and condition- 
ability. Tgblee2 shows that conditioners have 
higher scores (p< .05) on HIT Integration, a 
marker variable for Fa€toreI, the cognitive fac- 
tor; and that conditioners have higher scores 
(p < .05) on HIT Anxiety, a marker variable for 
Factor III, the factor indicative of emotional 
disturbance, although not necessarily of psy- 


* chotic proportions. No marker variable for Fac- 


tors II, IV, V, or VI shows a significant differ- 
ence between conditioners and nonconditioners. 
However, there does seem to be a trend for 
conditioners to have higher scores on the marker 
variables for Factor II, the factor indicating 
sensitivity to the stimulus properties of the blots. 


is Discussion 


The results show that the hypothesis that the 
marker variables for Factor III, the Neuroticism 
factor, are positively related to conditionability 
was confirmed. Anxiety is significantly related 
(b<.05) to conditionability and Pathognomic 
Verbalization nearly reaches statistical signifi- 
cance in its relationship. It is interesting to 
note that while HIT Anxiety is positively related 
to eyelid conditionability, it does not seem to 
be related to Manifest Anxiety (Holtzman et al., 
1961, p. 180) which also has been shown to be 
Positively related to eyelid conditionability. This 
result is consonant with the idea that two vari- 
ables may correlate 0, or even negatively, and 
still show a positive relationship to the same 
Psychological construct. For instance, the con- 
sumption of rice may be negatively related to 
the consumption of wheat in a particular indi- 
vidual, yet the consumption of both would be 
Positively related to the degree of hunger, when 


. 4 number of individuals are considered. 


The significant relationship of Integration, a 
marker variable for Factor I, to conditionability 
Was not predicted. However, the relationship is 
compatible with the finding by Spence (1963) 
and Porter et al. (1964) that differences in in- 
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structional set are related to the rate of extinc- 
tion of the eyelid CR. HIT Integration is a` 
measure of a person’s success in organizing the 
elements of an inkblot into some meaningful 
relationship; this success often requires that, 
elements of the inkblot which are incompatible 
to that organization, hence distracting, be ig- 
nored. Presumably, the effect of the instructional 
sets of Spence (1963) and Porter et al. was to 
facilitate the elimination of distracting elements 
from the eyelid conditioning situation. The re- 
sults of Porter et al. show that the elimination 
of distracting elements in the eyelid conditioning 
situation significantly speeds up the rate of ac- 
quisition of the CR and slows down the extinc- 
tion of the CR. Hence the conclusion seems 
tenable that success in achieving a percept based 
on organized ideation, a personality characteris- 
tic that is relatively stable over a period of at 
least 1 year (Holtzman et al., 1961, p. 137), 
can influence performance in the eyelid condi- 
tioning situation. Therefore, one more person- 
ality variable, in addition to Manifest Anxiety, 
Neuroticism, and possibly Extraversion—Intro- 
version, should be added to the list of personality 
characteristics influencing performance in the 
eyelid conditioning situation. 
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CONFIDENCE, UTILITY, AND SELECTIVE EXPOSURE: 
A PARTIAL REPLICATION 1 


JONATHAN L. 


Stanford University 


The study attempted to replicate an experiment by Canon in which more use- 
ful information was preferred to less useful, and high-confidence Ss preferred 
dissonant information while low-confidence Ss preferred consonant, The present 
results replicate the utility finding but do not replicate the finding with respect 


to confidence. 


Several recent papers have suggested a’ some- 
what new way of considering the general prob- 
lem of whether people prefer supportive or non- 
supportive information (Festinger, 1964; Freed- 
man, 1965; Freedman & Sears, 1963). These 
authors have pointed out that in most cases in 
which selectivity of exposure has been studied 
the subject is not avoiding dissonance, but is 
attempting to reduce it. In the typical study the 
subject takes a position on an issue and is then 
asked to indicate his preference among a number 
of articles some of which support his position 
and some of which attack it. Under these circum- 
stances dissonance would ordinarily be aroused 
merely by the knowledge that discrepant articles 
exist, and the subject must decide how to reduce 
this dissonance, He may seek out the dissonant 
information and attempt to refute it; or he can 
avoid the dissonant information and attempt to 
use other means of reducing the dissonance. 

Festinger (1964) has suggested that the sub- 
ject’s confidence in his initial position will be a 
major determinant of his choice between these 
alternative courses of action, If the subject is 
highly confident that he is correct, he should 
seek out the dissonant information because he 
will expect to be able to refute it. On the other 
hand, if he has little confidence in his Position, 
he should avoid the dissonant information be- 
cause he will expect it to be stronger than the 
counterarguments he might be able to muster. 
Thus, it is predicted that the subject with high 
confidence will show a greater preference for dis- 
sonant information than will the subject with low 
confidence. 

A study by Canon (1964) was designed to test 
this, as well as the hypothesis that the greater 
the perceived utility of the information the more 
it would be preferred. Subjects took a position on 
an issue under conditions which were supposed to 
make them feel either confident or not confident; 


1The study was supported in part by Grant GS- 
. 196 from the National Science Foundation. 
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and they then indicated"how much they 
like to read each of five articles dealing 
issue. Two of the articles were conson 
their position, two were dissonant, and 
neutral. In addition, the inStructions 
the subject’s task in the study were des 
make either the consonant articles or the 
nant articles more useful. There were 
conditions: high and low confidence, di 
more-useful and consonant-more-useful in 
2 factorial design. 

The results supported the predictions: 
useful articles were preferred to the less 
and the high-confidence subjects prefel 
sonant articles while the low-confidence s 
preferred consonant. All of these dif 
were significant. The only lack of perfe 
the results was in the check on the coi 
manipulation which did not demonstrate 
nificant difference between the high 
condition. 

The present study, which is a som 
tended replication of the one just descril 
conducted for two reasons. The first sl 
cluded only conditions in which either c 
or dissonant information was more 
was felt that the whole issue would be 
if the confidence manipulation could 
shown to operate when the two types of 
tion were equated in usefulness. The 
study included a high- and low-confidene 
for which dissonant and consonant info 
were equally useful. Second, it was felt 
results of the Canon (1964) study were 
portant and had such broad implication: 
whole problem area that it would be 
to provide a replication. 


METHOD 


The procedure was identical to that of the 
study in every detail except that two eq 
conditions were added and subjects were 
for participating. 

The subjects were 116 male students al 
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Junior College.2 They were run in groups from 
4 to 8 with each group containing high- and low- 
confidence subjects in one of the utility conditions, 
y This is a slight deviation from Canon’s procedure 
since he ran completely unmixed groups. 
, The study was described as an investigation of 
the possibility of setting up undergraduate courses 
in business. A fairly lengthy exposition of the pur- 
pose of the study and the history of the problem 
was presented, and the subjects’ task was explained. 
Each subject read four case histories in business 
and indicated which of two or three alternative 
courses of action he thought was the best solution 
to the prahlen? posed in the case. After the first 
three cases, the subject was informed whether or not 
he was correct and how the others in the group 
had responded. In the high-confidence condition 
_ subjects were told that their choices were right on 
three cases and that most of the other subjects had 
gotten them wrong.,In the low-confidence condi- 
tion, subjects were told that their choices were cor- 
rect on only the first case and that most other sub- 
jects had gotten them all right. 

After the fourth case, subjects were not told how 
they had done. Instead the experimenter announced 
that on this case he wanted the subjects to write a 
brief essay? The utility manipulation was based on 
the kind of essay subjects were supposed to write. 
In the consonant-more-useful condition, subjects 
were to present their position on the last case; in the 
dissormnt-more-useful condition subjects had to en- 
gage in a “written debate” in which they would be 
given a sheet containing a number of arguments 

. from the opposing point of view and would be 
required to answer these arguments; and in the 
information-equally-useful condition subjects had 
merely to give their reactions to other evaluations 
of the case. It was reasoned that consonant infor- 
mation would be more useful if the subjects had to 
Present arguments favoring their position; disso- 
nant would be more useful if they had to rebut 
arguments opposed to them since the dissonant in- 
formation would in essence give them advance 
knowledge of these counterarguments; and little of 
the information would be useful if all they had to 
do was evaluate other papers. 

At this point, the experimenter said that before 
the subjects wrote the essays it might be helpful for 
them to see other people’s reactions to the case, and 
that a number of papers on the case by students and 
faculty were available. Since there were a limited 
number of some of these papers, the subject was 
asked to indicate on a sheet how much he would 

_ like to read each of five papers. The titles of these 
five papers indicated clearly that two of them sup- 
Ported the subject's position, two of them opposed 
it, and one was neutral. The subject rated them on 
a scale ranging from 0, not at all interested, to 100, 
maximally interested; and then ranked them from 
1 to 5. After this, the subjects filled out a brief 
poe Er 


* The author is grateful to Ervin Roth for his 
a in arranging for subjects to participate in the 
y. 
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TABLE 1 


MEAN PREFERENCE FOR CONSONANT OVER 
DISSONANT ARTICLES 


Be Consonant | Dissonant | Equal e 
Condition | spore useful | more useful | utility 
High confidence | +16.89 ae —12.66 63 +1.90 (20) 
Low confidence +16,39 (19; —13.66 (20) | +1.33 (19) 
Total +16.64* —13.17* +1.62 


a Figure in parentheses is number of subjects in cell. 
* Different from no preference, p < .05. 
** Different from no preference, p < .01. 


questionnaire containing a question on confidence in 
their decision on the fourth case and various other 
questions, The experiment was then terminated, the 
purpose explained, and the subjects cautioned not 
to discuss the study with anyone for several weeks, 
For a somewhat more detailed description of the 
method see Canon (1964). 


RESULTS AND DISCUSSION 


The major results are presented in Table 1 
which gives the difference in ratings between 
consonant and dissonant articles for each of the 
six experimental groups. As in Canon’s study the 
figures given are based on the deviation of the 
subject’s ratings from the overall mean ratings. 
A positive number indicates that consonant arti- 
cles were rated higher; a negative number that 
dissonant were preferred. As may be seen, there 
was a strong tendency for the subjects to prefer 
more useful information regardless of the posi- 
tion it favored. In the consonant-more-useful 
condition, consonant information was preferred; 
in the dissonant-more-useful condition, dissonant 
information was preferred. Both of these effects 
are significant, and of course the difference be- 
tween the two conditions is also significant (¢ = 
3.74, p < 01). When the information was equated 
on utility, no preference was shown. 

In dramatic contrast to the strong effect of 
perceived utility, the confidence manipulation 
had no discernible effect on the subjects’ pref- 
erences among the various articles. There were 
no appreciable differences between any corre- 
sponding high- and low-confidence groups, nor 
between the total means for the two confidence 
levels. Thus, although the findings with respect 
to utility agree very well with those reported by 
Canon, the earlier results with respect to confi- 
dence did not replicate at all. 

When a highly significant result fails to repli- 
cate, it is essential to consider the possibility 
that the studies were not comparable. The most 
important aspect in this case is the strength of 
the confidence manipulation. Although the identi- 
cal procedure was employed in both experiments, 
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minor differences could have crept in which may 
have weakened the manipulation. This possibil- 
ity may probably. be discounted on the basis of 
the subjects’ ratings of their own confidence. 
Although this question did not produce significant 
differences in the first study, it did show a sig- 
nificant difference between high- and low-confi- 
dence subjects in the present study (¢= 2.72, p 
<.02). Thus, if anything, it must be assumed 
that the confidence manipulation was more suc- 
cessful in the second experiment than in the first. 

The only explicit differences between the two 
studies were that in the second study subjects 
were from Foothill Junior College rather than 
San Jose State, they were paid for participating, 
high- and low-confidence subjects were run to- 
gether, and of course the experimenter was dif- 
ferent. It is difficult to think of any logical rea- 
son why any of these differences would eliminate 
the effect of confidence while not affecting the 
utility manipulation. 

Although a study which finds a significant 
effect is usually, and with good reason, given 
more weight than a study which fails to find an 
effect, the present negative result must cast 
serious doubt on the previous finding. This was 
areal attempt at a replication. As far as possible 
nothing was changed—the same materials, the 
same instructions, the same case histories, the 
same rating scales, the same time intervals, etc., 
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were employed. The confidence manipulation 
appears to have been successful, and perhaps 
most important of all, the utility effect repli: 
cated perfectly. It is not at all clear why ‘the 
two studies produced such strikingly different re- 
sults with respect to the effect of confidence, but! 
it is equally uncertain which result will replicate 
in the future. The analysis in terms of confidence 
offered by Festinger is very logical and appeal- 
ing and it would be very nice if it turned out to 
be correct. At the moment, however, the find- 
ings are contradictory and it remains for some- 
one else to attempt another replication and per. 
haps resolve the discrepancy. 
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The findings of the APA’s Project on Scientific Information Exchange in Psychology call 
attention to the difficulties of the scientist in establishing and maintaining contact with current 
worke«elevant to his interests. Accordingly, the project is sponsoring a series of changes during 
the next 12-15 months which are designed to enhance scientific communication. These changes 

* are planned as limited trials. Moreover, they are wholly supported by the Project and hencg 
* do not cut into our page allotment. 

The Editor has agreed to make possible a test of one of the changes, that is, the titles and 
authors of accepted papers will be listed following the last articles in the journal. Such listing 
will make it possible for our readers to become aware of research many months in the advance 
of its journal publication. At the end of the year’s trial the results of the trial will be evaluated 
and further consideration will be given at that time as to whether the service is worth continuing. 

’ This listing plus those published in the preceding 1965 issues are the entire backlog of manu- 
scripts a@cepted by this journal through July 1965. 
e s: . 


i 1965 (June-July) Manuscripts Accepted for Publication in the 
he Journal of Personality and Social Psychology 


a 

Psychological Balance in Situations of Negative Interpersonal Attitudes: Kendall O. Price, Ernest Harburg, and 
Theodore M. Newcomb*: Department of Psychology, University of Michigan, Ann Arbor, Michigan 48104. 

Connotations of Racial Concepts and Color Names: John E. Williams*: Department of Psychology, Wake Forest 
College, Winston-Salem, North Carolina 27106. 

Extension of Personal Time, Affective States, and Expectation of Personal Death: Paul Wohlford*: Department 
of Psychology, University of Miami, Coral Gables, Florida 33124. 

Assimiliation and Contrast in Interpersonal Prediction with Control for the Interaction of Real Similarity and 
Differential Accuracy: William A. Blanchard*: Department of Psychology, University of Alberta, Edmonton, 


Alberta, Canada. 
Subliminal Perception or Perception of Partial Cue with Pictorial Stimuli: Gerald Guthrie* and Morton Wiener: 


partment of Psychology, Clark University, 950 Main Street, Worcester, Massachusetts 01610. 

Aggression toward Outgroups as a Function of Authoritarianism and Imitation of Aggressive Models: Ralph 
Epstein*: Department of Psychology, Wayne State University, Detroit, Michigan 48202. 

Influence of Fantasy Ability on Attitude Change through Role Playing: Alan C. Elms*; Department of Psychology, 
Southern Methodist University, Dallas, Texas 75222, 

Psychological Correlates of Plateau Hypnotizability in a Special Volunteer Sample: Ronald E. Shor, Martin T. 
Orne,* and Donald N. O’Connell: Department of Psychiatry, University of Pennsylvania, 111 North 49th 
Street, Philadelphia, Pennsylvania 19139. 

Dissonance and the Revision of Choice Criteria: Donald D. Penner, Gordon Fitch, and Karl E. Weick*: Depart- 
ment of Psychology, University of Minnesota, Minneapolis, Minnesota 55455. $ 

Observational Learning of Fear in Monkeys: Judith L. Crooks*: Department of Psychology, University of Penn- 
sylvania, Philadelphia, Pennsylvania 19104. 

_ Aggression Themes in a Binocular Rivalry Situation: Marv Moore*: Department of Psychology, 

l University, East Lansing, Michigan 48823. 

n 


j 


Michigan State 


Analysis of Associative Behavior and Creativity: Klaus E: Riegel,* Ruth M. Riegel, and Robert S. Levine: 
Department of Psychology, University of Michigan, Ann Arbor, Michigan 48104. A 
Effects of Paternal Absence on Sex-Typed Behaviors in Negro and White Preadolescent Males: E. Mavis Hether- 
ington*: Department of Psychology, University of Wisconsin, Madison, Wisconsin 53706. 
Anticipated Effort and Subjective Probability: A. John Arrowood* and Lee Ross: Department of Psychology, 
d University of Toronto, Toronto 5, Ontario, Canada. 
Comment on Waterman and Ford’s Dissonance Reduction or Differential Recall: 
i Psychology, University of Missouri, Columbia, Missouri 65202. 
> Instrumental Affiliative Functions of Facial and Gestural Expressions: 
Psychology, University of Kansas, Lawrence, Kansas 66045. 
, Personality Integration in College Women: Julius Seeman*: Department of Psychology, George Peabody College, 
Nashville, Tennessee 37202. 
Studies in Forced Compliance: I. The Effect of Pressure 
. Face Role Playing and Anonymous Essay Writing: J. 
Helmreich: Department of Psychology, Yale University, 


eee 
i “* Asterisk indicates author for whom address is supplied. 
781 


Clyde Hendrick*: Department of 


Howard M. Rosenfeld*: Department of 


for Compliance on Attitude Change Produced by Face-to- 
Merrill Carlsmith, Barry E. Collins,* and Robert L. 
333 Cedar Street, New Haven, Connecticut. 


Affect Arousal and the Positioning of Recommendations in Persuasive Communications: Howard Leventhal* 
and Robert P. Singer: Department of Psychology, Yale University, 333 Cedar Street, New Haven, Con. 
necticut. 

Inducing Belief in False Confessions: Daryl J. Bem*: Department of Psychology, Carnegie Institute of Technology, 
Schenley Park, Pittsburgh, Pennsylvania 15213. ‘ 

Need and Expectancy as Determinants of Affiliative Behavior in Small Groups: Daniel B. Fishman*: Center for i 
Research in Personality, Harvard University, Cambridge, Massachusetts 02138. 

Attitude and Involvement in the Absolute Judgment of Attitude Statements: Charles D. Ward*: Psychology 


782 List or Manuscripts ACCEPTED [ 
4 


Department, University of Maryland, College Park, Maryland 20742. 

Preference for Information about an Uncertain but Unavoidable Outcome: John T. Lanzetta* and James M. Dris- 
coll: Department of Psychology, Dartmouth College, Hanover, New Hampshire 03755. f 

Personality Characteristics and Motor Activity: Some Empirical Evidence: Curtis D. Hardyck*: Institute of Hu _ 
man Learning, University of California, 2241 College Avenue, Berkeley, California. 

Effects of Social Class and Race on Responsiveness to Approval and Disapproval: David L. Rosenhan*: Educa- 
tional Testing Service, Princeton, New Jersey 08540. i 
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ACHIEVEMENT MOTIVATION TRAINING FOR UNDER- 


ACHIEVING HIGH-SCHOOL BOYS’ 


DAVID A, KOLB? 


Harvard University 


An experiment testing the effect of a training program in achievement motiva- 
tion on the academic performance of underachieving high-school boys. 20 boys 
with IQs above 120 and school grades below C received the training program 
designed to teach characteristics of the person with high n Achievement in 
eaddition to an academic summer-school program. They were compared to a 
control group of 37 similar boys who received only the academic program. A 
6-mo. follow-up revealed no significant differerces between the groups in im- 
provement in school grade average. In a 1.5 year follow-up, however, the 
total grade average of experimental Ss improved significantly more than the 
grades of controls (p< .05). Large social class differences were found. The 
grades of high social class (SES) experimentals improved significantly more 
than those of high SES controls (p<.005) while low SES experimentals did 
not increase more than low SES controls. In the experimental group, im- 
provement in school grades was significantly correlated with participation 
variables—change in n Achievement and a Business Game performance score. 


A substantial body of research using widely 
differing measurement techniques has indi- 
cated that underachieving students are low in 
their motivation and/or concern for achieve- 
mer (Burgess, 1956; Garrett, 1949; Geb- 
hart & Hoyt, 1958). In addition, a number of 


_. studies have demonstrated a small but posi- 


tive significant relationship between achieve- 
ment motivation and academic performance 
in high school (Atkinson, 1958; McClelland, 
Atkinson, Clark, & Lowell, 1953; Uhlinger & 
Stephens, 1960). The present study tests the 
effect of a training program designed to in- 
crease concern for academic achievement in a 
group of underachieving high-school boys. 
‘The content of the training program derives 
from the research on achievement motivation 
carried on by McClelland and his associates 
(Atkinson, 1958; McClelland, 1961; McClel- 
land et al., 1953). A study by Burris (1958) 

1 This research was carried out with funds granted 
by the Carnegie Foundation of New York to David 
C. McClelland of Harvard University, under whose 
auspices this study was conducted. Thanks are also 


due to the Department of Education and officials of 
Brown University whose cooperation and support 


‘made this study possible. George Litwin, Stephen 


Seigel, and Anthony Davids all made significant 
contributions in the development, execution, and 
ahalysis of this program. The statements made and 
ae expressed are solely the responsibility of the 
uthor, 


? Now at Massachusetts Institute of Technology. 


suggests that counseling centered around a 
student’s n Achievement score can produce 
significant changes in his academic perform- 
ance. Burris found that the grades of college 
underachievers who received n Achievement 
counseling improved significantly more than 
those of similar students who received non- 
directive counseling. 

The training techniques used in this pro- 
gram stem from four different experimental 
backgrounds. 

1. Identification: The identification models 
of Kagan (1958) and Hill (1960) maintain 
that learning takes place through emulation 
of effective role models, this learning being 
reinforced through vicarious affective experi- 
ence. Goldberg (1959) has found that identi- 
fication with a positive role model seems to 
be associated with academic improvement in 
underachieving boys. 

2. Expectation: There is a growing body of 
research which indicates that the expectations 
held by the experimenter and subject (Orne, 
1962; Rosenthal, 1963) or the therapist and 
patient (Goldstein, 1962) can measurably af- 
fect the outcome of the experiment or therapy. 
In this study the research team model devel- 
oped by Schwitzgebel and Slack (1960) was 
adapted. Schwitzgebel (1962) has found the 
research team model useful in reducing juve- 
nile delinquency. 
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3. Ideomotor response: The hypothesis that 
thought determines action has a long history 
in psychology. Braid maintained that an idea 
firmly implanted in the mind during hypnosis 
ldter issues into behavior. Through the influ- 
ence of Herbart, nineteenth century educa- 
tional psychology promoted the copybook to 
implant in children’s minds the “right ideas” 
which would lead to “right actions” (Allport, 
1954). William James formulated a think- 
talk-act model recently elaborated by Leary 
(1961b). Perhaps the most sophisticated mod- 
ern statement of this theory is George Kèlly’s 
(1955) role construct theory. He maintains 
that behavior is determined in large measure 
by the way a person construes the world. The 
Burris (1958) study suggests that teaching 
underachievers achievement constructs and 
encouraging them to think in achievement 
terms can lead to better performance in school. 
In the current study the technique is to im- 
plant the idea of achievement by teaching the 
students the n Achievement scoring system 
and to observe the results in action, that is, 
academic performance. 

4. Games: The use of games to simulate 
life situations is becoming more and more 
prevalent in the training of managers and ad- 
ministrators, military leaders, diplomats, and 
others requiring complex skills (Bensen, 
1962; Kibbe, Croft, & Nanus, 1961; Sprowls, 
1962). These games provide a well-defined 
psychologically safe situation where men can 
try out new ways of thinking and behaving. 
In this study the game concept was used in 
two ways: in actual games designed to teach 
achievement skills (eg., Litwin & Ciarlo, 
1961) and as an analytic device in counsel- 
ing to aid in understanding real-life problems 
(Leary, 1961a). 


METHOD 
Setting 


and exciting subject matter, In addition, the plan- 
ners of the program hoped that exposure to college 
living and positive college role models (counselors 


Dav A. Kors 


The project was 6 weeks long, beginning in late 
June and ending in the first week of August, The 
schedule was a 5-day week of classes in history, 
English, and mathematics, with weekend recrea- 
tion at various parks and beaches in the area. The 
boys lived together on three floors of a college , 
dormitory on the campus and ate in the college 
dining hall. 


Subjects 


Fifty-seven boys drawn mainly from public and 
private schools in New England were enrolled in 
the project. These were selected from -95 applicants 
on the basis of IQ (120 or higher) and school 
gradé average (C or lower). Boys with serious 
reading defects were generally not admitted, At- 
tempts were made to screen out applicants with 
serious psychiatric problems. Two general classes of; 
students were recruited: boys from homes where 
sufficient interest, parental background, and finances | 
favored college entrance and boys from homes | 
where these were lacking. Parents of the first type | 
were requested to pay tuition, room, and board, 
Lower-class boys were granted scholarships to cover 
their expenses. About half of the participants 
received scholarships. mo 

A breakdown by social class based on occupation 
(Warner, Meeker, & Eells, 1949) shows 14 boys 
in Class 1, 11 in Class 2, 11 in Class 3, 5 in Class 4, 
6 in Class 5, 2 in Class 6, and 1 in Class 7, ‘The 
mean IQ score (WISC) of the boys was 126 (range: 
109-149) and their mean school grade average wal 
D to D+ (range: F-C+). The average year in 1 
school was 9.0 (range: 7-11) and their average age 
was 14 years (range: 12-16). Stanford Achievement 
Test scores showed them to be at an average grade 
level of 10.9 (range: 8.8-12.7). 


Experimental Design 


Twenty of the 57 students were randomly as- 
signed to the fourth floor of the dormitory. This i 
group received the AMTP in addition to their 
regular summer school schedule, The remaining 37 
students received only the regular summer school 
program. 

Subjects were tested in the following manner. 

Pretesting. On the second day of the summer 
school, 57 subjects were given the following tests: 
the Test of Insight, Form A (this test was an adapta- 
tion of the form used by French, 1958, and was 
scored for n Achievement according to the published 
scoring manuals—Atkinson, 1958); the Stanfor 
Achievement Test; the Taylor Manifest Anxiety 
(MA) scale; and the Mandler-Sarason Test Anxiety 
Scale (TAS). In addition the following data were 
collected for each subject: age, year in school, n 
(WISC), 1961 school grade average, and parents 
Socioeconomic status based on father’s occupation 
(Warner et al., 1949). í 

August 1961 posttest. On the next-to-last day of 
the project, subjects were given the following tests: 
the Test of Insight, Form B, a parallel form; P 
Stanford Achievement Test; the MA scale; an 
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the TAS. In addition all subjects received grades in 
each of the three subjects taught in the summer 
school. AMTP members were ranked on their 
participation in the AMTP and assessed in moderate 
risk-taking ability using the Litwin-Ciarlo (1961) 
Business Game. 

January 1962 follow-up. The midyear school 
grades were collected for all boys who. completed 
the project. 

April 1962 follow-up. At this time 33 of the sum- 
mer-school participants were selected on the basis of 
proximity to Brown University to return for a 
testing session. Twenty-seven boys returned (experi- 
mentals = 10,e controls =17) and were given the 
Test of Insight, Form A. 

January 1963 followSup. School grades weré col- 
lected for: the Spring 1962 semester. Unfortunately 
these were available for only 37 boys (experi- 
*mentals = 12, controls = 25). 

The initial, posttest, and follow-up Insight tests 
scored for n Achievement were included to deter- 
mine whether experimental subjects would learn 
and retain the achievement scoring categories better 
than controls, School grades and Stanford Achieve- 
ment Test scores were chosen as indicators of 
achievement-oriented performance. WISC scores, 
social class, MA scale, and TAS scores were in- 
cluded as potential predictors of change, for ex- 
ample, high MA scale subjects might change more 
than low MA scale subjects, though no specific 
hyp8theses were made. 

To check for n Achievement scoring reliability, 
a sample of 30 Tests of Insight (10 from each 
testing period) was scored by a trained scorer not 
connected with the training program. Scores thusly 
obtained correlated (rho) .85 with the experimenter’s 
Scores, 

School grades were all converted to a numerical 
scale: A= 95, B= 85, C=75, D= 65, E or F= 55. 


Experimental Condition: The Achievement 
Motivation Training Program 


The experimenter lived in the dormitory with 
the AMTP boys and served as their counselor, The 
experimenter behaved in a manner consistent with 
the behavior of a person with high n Achievement, 
50 that the subjects would have a visible high n 
Achievement role model to imitate. 

The procedure changed from the use of external 
rules and discipline to internal control and personal 
responsibility: Structured classes and required meet- 
Ings gradually gave way to individual appointments 
made at a subject’s request and optional group 
Meetings. Responsibility for running the fourth 
„floor and the AMTP was gradually given to the 
boys themselves. After a few initial required meet- 
ings the boys were free to choose how much they 
Wanted to participate in the program. This policy 
reflected the research team contract (see below) and 
was further encouraged by the time limitations of 
the experimenter. It was felt that time would be 
More profitably spent working with interested 
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students than trying to involve those who were not 
interested. 

Initially the boys met with the experimenter in a 
classroom twice a week for 1 hour. As the weeks 
progressed this meeting time was changed to better 
fit the summer-school schedule and meet the neefs 
of the boys. The program is described below in 
detail. 

First session. The first session was concerned 
with negotiating a contract. The role relationship 
emulated was that of a team doing collaborative 
research on the problem of underachieving. The aim 
was to involve the students in the hypothesis 
that the course would improve academic performance 
and achievement motivation thereby creating in 
them *the expectation that the course would work, 
The essence of the contract was this: 


We at Harvard have some ideas about how to 
help the underachiever play the school game 
better. Our goals is to try out these ideas to see 
if they work, and to discover new ways to prevent 
the problems associated with underachievement. 
If you will work with us we think that we can 
in return, introduce you to some strategies which 
will help you in school and, more generally, in 
life. Whether you want to do this or not is en- 
tirely your decision. Your involvement with this 
program is not required; you can participate to 
the extent to which you feel it is useful to you. 


After negotiating the contract, the characteristics 
of a person with high achievement motivation were 
described. Subjects were told that the person with 
high n Achievement has three major characteristics: 
he likes and chooses to take personal responsibility 
for his actions, he takes moderate risks, and he 
likes and attempts to obtain knowledge of the 
results of his actions (Atkinson, 1958; McClelland, 
1961; McClelland et al, 1953). The remainder of 
the session was spent describing the racing car 
game the subjects would play in the next session 
and suggesting how the characteristics of the 
person with high achievement motivation would be 
valuable to a racing driver. 

Second session. The entire second session was 
spent playing the race game. The game consisted 
of a miniature race track around which small 
electric cars could be driven. The “racers” each had 
a transformer which controlled the speed of his car, 
The boys raced against the clock two at a time, 
The boy in each pair with the fastest time was 
declared the winner. If by going too fast the boy 
left the track three times he was disqualified. As 
they raced the boys were asked to keep in mind the 
characteristics of the achieving personality in rela- 
tion to the race, for example, (a) personal re- 
sponsibility—How involved am I in the race? Do 
I care if I win or not? (b) moderate risk taking— 
How much of the time did I take too much of a 
risk? How often was I too cautious? (c) using 
knowledge of results—How well did I use my 
practice trials to judge my ability? 

Third session. This session began with an introduc- 
tion to the process of assessing motivation by 
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analysis of thought (Atkinson, 1958). The boys 
were then shown how their n Achievement scores 
related to their performance in the racing game 

(boys with low n Achievement tended to be dis- 
qualified). 

The discussion then turned to time orientation, 
During the previous evening the boys had estimated 
a 30-second interval on a stopwatch. Subjects with 
high n Achievement tended to see time as passing 
more quickly than subjects with low n Achievement. 
Further discussion centered around future orienta- 
tion, delay of gratification, and ability to control 
impulses, 

Fourth session. This session marked the beginning 
of the shift from structured lecture meetings to 
a program oriented more toward the individdal. In 
essence the session was a renegotiation of the con- 
tract. The group’s commitment to the “research 
team” model was reaffirmed, emphasizing the boy’s 
role in determining the nature of the course. The 
follow-up and the experimenter’s interest in knowing 
the boys’ future grades were discussed. 

The last half of the session was spent discussing 
how thought influences action. The think-talk-act 
model was explained: If you think and talk in a 
certain way you will act that way. In particular the 
direct relationship of achievement thinking to 
achievement behaviors was discussed. At the end 
of this session the boys were given n Achievement 
scoring manuals. 

Fifth session. This session was devoted to a group 
discussion on the analysis of Test of Insight 
stories. Example stories from the protocols of con- 
trol students were read to the class and discussed, 
Following this discussion the students were given 
their own Insight test protocols, They were offered 
help in scoring their protocols for n Achievement, 
The expert scoring was given to them upon request. 
The remainder of the hour was spent in teaching 
the n Achievement scoring system. 

Sixth session. A major theme that occurs over 
and over in the Insight Tests is a conflict between 
achievement and affiliation goals (see Parsons, 1959). 
One of these stories is quoted below. (Written to 
Test of Insight, Form A, Story 4, “Bill may not be 
the best student in his class, but he is the friendliest 
and the best-liked.”) 


He fools around a lot to make friends and 
doesn’t do his school work the best he can, He 
didn’t take school seriously and just had a lot of 
fun. He thinks more about making friends than 
about getting his homework done. He probably 
wants to be a mechanic. His marks will keep 
slipping down and he won't get a good job when 
he gets out of school. 


The session was focused on this conflict and on 
learning to take realistic risks, An article on how 
to assess risks was given to the class members. 
The group discussed risk taking in terms of 
planning goals and strategies in life situations, 
making conscious decisions, taking action to attain 
desired goals, and knowing one’s own ability. The 
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Litwin-Ciarlo Business Game (discussed below) 
which they would all play later that week was 
described. : 

Informal sessions. Beginning on Thursday of the 
fifth week, the group began meeting every evening 
on the lawn on an informal basis. Attendance at 
these sessions was not required and subjects could 
come and go as they pleased. The boys wanted this 
arrangement so they could discuss how the techniques 
they had learned could be applied to specific prob- 
lems in their lives. These discussions were intense 
and seemed to be of considerable benefit to those 
who attended. Approximately 12 out of the 20 
boys attended regularly. The remainder had more 
or less dropped by the wayside. 

Individual counseling sessions. Counseling on an 
individual basis was done informally. The problems 
discussed in individual sessions were usually con- 
cerned with the application of principles talked about’ 
in class to the life of the boy.»These ranged from 


using the principles of moderate risk taking to 


play a better tennis game, to setting realistic goals 
in study tasks, to staying out of arguments at home 

Litwin-Ciarlo Business Game. This training device 
was perhaps the most popular aspect of the whole 
project. It is designed to train a person. to take 
moderate risks and use knowledge of results, and 
to assess his capacity to do so. The subject in this 
game plays the role of a manufacturer and actually 
builds the products he contracts for with tinker- 
toys. He is told that he is the head of a company 
just starting to manufacture three new products—a 
missile, an atomic cannon, and an airplane, He 
must order sufficient parts to build each of these 
products and actually try to assemble them in 
three separate 5-minute production periods. Only 
completed units are purchased, so if he orders too 
many he loses what is invested in the leftover parts, 
and if he orders too few his profit is lower than it 
should have been. The subject makes his decisions 
and assesses risks using two types of information: 
printed statistics giving prices, profits, and ave 
erage construction times for each product and his 
own timed practice construction trials for each 
product. The game takes an hour to play and gives 
Scores on risk taking and use of past performance 
as an indicator of future performance. A couple 
of boys learned to run the game and did a large 
part of the administration for the other boys. 


RESULTS 


Eight of the original 57 boys were not in- 
cluded in the final data analysis. Five of these 
boys (4 controls and 1 experimental) wet 
dropped from the summer school for either 
disciplinary or emotional problems. The othe! 
3 boys (2 experimentals and 1 control) com 
pleted the school but were so withdrawn 
rebellious that they were rated unsatisfactory 
in all three of their courses. 
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‘ Initial Comparison of Experimental and Con- 
f” Groups 


To determine the success of random assign- 

, ments, the experimental and control groups 

swere compared on all variables in the study. 

Using the Mann-Whitney U test, p values 

were greater than .20 (two-tailed) on the fol- 

lowing variables: year in school, age, IQ 

(WISC), n Achievement, 1961 school grade 

| average, average Stanford Achievement Test 

j score, MA,scale, and TAS. A chi-square com- 

parison of the two groups by social’ class 

_ (Classes 1 and 2 versus Classes 3-7) yielded 
| ap value of .50. 


Group Changes 


In Table 1 the n Achievement change scores 
are reported for experimental and control sub- 
jects, pretest to August posttest and pretest 
to April posttest for the 27 subjects who at- 
tended that follow-up. It is assumed that this 
latter group represents an unbiased sampling 

: of the total summer-school population since 
they were selected on nonacademic criteria— 
: proximity of their home to the testing site. 


2 of Experimental and Control 


Note that the experimental group shows sig- 
nificantly greater increases in n Achievement 
Score in both periods, indicating that experi- 
Mentals both learned and retained the n 
Achievement scoring system better than the 
untrained controls. 

Comparison of experimental and control 
changes on the Stanford Achievement Test 
| from the pretest to August posttest indicated 

nó significant differences either for total score 
r any subtest score although in most cases 
` €xperimentals tended to improve more than 
Controls. The two anxiety scales (TAS and 
| MA) also showed no significant differential 

changes. An analysis of the final grades that 
boys received in the summer school 
showed no significant differences. 

Tn comparing changes in school grades, each 
boy received, whenever possible, a change 
ore for five subject areas: English, science, 
foreign language, mathematics, and history/ 
Civics, plus a change score for his total semes- 
ter grade average. Change scores for the sub- 
4 ae areas were the average grade that the 
9OY received in the courses he took in that 
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TABLE 1 
CHANGE IN n ACHIEVEMENT 


M change 
Testing intervals n 4 ish 
Experi | Control A 
group group 
Pretest to August 1961 | 6.72 —.34 | <.005 
posttest (n=18) | (n=32) 
Pretest to April 1962 4.4 —3.8 | <.025 
follow-up (n=10) | (n=17) 


a ManmWhitney JU test, one-tailed probability. 


area. Many times these subject change scores 
were not available because the boy would not 
have taken a given course in both the pretest 
and the follow-up periods. In Table 2, change 
scores for experimental and control groups 
are compared for the January 1962 follow-up 
and the January 1963 follow-up. Note that 
while in the January 1962 follow-up experi- 
mental boys had not improved significantly 
more in grade average than the control boys, 
by the January 1963 follow-up the experi- 
mental group’s improvements in grade aver- 
age were significantly greater than the con- 
trol group’s improvements. In addition, the 
mean experimental improvement increased 
from 1962 to 1963 while the mean control 
group improvement decreased in this period. 
In the subject areas, experimentals improved 
significantly more than controls in English 
while controls improved significantly more 
than experimentals in science in the January 
1962 follow-up (contrary to prediction). 
None of these differences held in the January 
1963 follow-up. 

In the 1963 follow-up, grades were avail- 
able for only 73% of the subjects in the 1962 
follow-up. One might question the validity of 
the 1963 change scores since ordinarily one 
would expect students who improved least to 
drop out in later follow-ups. An analysis of 
the 13 dropouts’ change scores in the 1962 
follow-up showed a nonsignificant trend in 
this direction—8 were below the 1962 total 
mean change and 5 were above. These boys, 
however, were evenly divided between the ex- 
perimental and control groups (experimental 
= 2/4, control = 3/4). Thus we would sus- 
pect that any artificial increase in average 
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TABLE 2 
CHANGE IN SCHOOL Grapes: ALL Boys 
M change 
ə 
School subjects Experi R p 
oe Cade 
Pretest to January 1962 follow-up 
Total grade 6.3 4.1 
average (n = 18) | (n = 31) 
English 8.3 3.1 <.03 
(n = 18) | (n = 30) 
Science —4. <.02 
(n = 10) | (n = 21) í 
Foreign language 
(n = 12) | (n = 14) 
Mathematics 
(n = 17) | (n = 28) 
History/civics $ .9 
(n=7) | (n=20) 
Pretest to January 1963 follow-up 
Total grade 7.1 i <.05 
average (n = 12) | (n = 24) 
English 4.7 
(n= 12) | (n = 23) 
Science 
(n = 6) (n = 15) 
Foreign language 7.8 —0.9 
(x = 7) (n = 10) 
Mathematics 
(n = 12) | (n = 20) 
History/civics 9 „9 
(n= 8) (n = 15) 


* Mann-Whitney U test, one-tailed probability. 
b Contrary to prediction; two-tailed probability. 


change scores would not affect the experi- 
mental and control groups differentially. 


Effect of Social Class 


As described in the section on method, the 
boys in the project were of two types—boys 


TABLE 3 
CHANGE IN n ACHIEVEMENT BY SoctAL Crass 


M change 
Testing interval a 
Experi- | Control |? 
mental | “Group 
High social class (1 and 2) 
Pretest to August 84 —0.3 <.05 
posttest (n= 9) | (n = 9) 
Pretest to April 5.7 —5.8 <.025 
follow-up (n=7)| (n= 5) 
Low social class (3 through 7) 
Pretest to August 5.0 —.6 <.01 
posttest (n = 9) | (n = 20) 
Pretest to April .. -3.2 <.10 
follow-up (n = 3) |n = 12) 


* Mann-Whitney U test, one-tailed probability, 


from homes where interest, parental back 
ground, and finances favored college entran e 
(i.e., high SES boys) and boys from ‘homes 
where these were lacking (low SES boys), 
Because of these differences, all of the change 
comparisons were made by social class (hig 
SES = Classes 1 and 2, low SES= classes 
3-7). In Table 3 are reported the changes in 
n Achievement score divided by social class, 


TABLE 4 
CHANGE IN SCHOOL Grapes: Hicu Socrat Crass Bors 
M change 
School subjects Eel cone car 
Feats group 
| 
Pretest to January 1962 follow-up 
Total grade % 9.5 1.8 <.005 
average (n = 10) | (n = 13) f 
English 9.4 0.7 | <01 
(n = 10) | (n = 13) 
Science 
(n = 3) (n = 8) 
Foreign language 10.5 10,0 
(= 8) | @=3) 
Mathematics 13.0 —12.0 | <01 
(n=10) | (n=11) | @ 
History/civics 7.2 4.6 
ay a 4) (n = 7) | 
Pretest to January 1963 follow-up 
iotalypratieaat: 11.9 19 | <05 
average (n = 6) (n = 11) 
English 9.5 3.7 
(n = 6) (n = 11) 
Science 8.8 7.1 
(n = 5) (n = 8) 
Foreign language 9.8 —2.0 | 
(n = 5) (n = 8) 
Mathematics 11.2 ; 
(n = 6) (n = 10) 
History/civics 6.0 5.2 | 
(n=4) (n = 6) 
^ Mann-Whitney U test, one-tailed probability. 


In both lower- and upper-class groups, expel i 
mental subjects increase significantly mo 
than control subjects. f A 
While Mann-Whitney U tests indicated 2°) 
significant differential changes in Stante 
Achievement Test scores, MA scale, TAS, k 
summer-school grades, there were differen! 
changes in school grades in both the Ja 
1962 and January 1963 follow-up perio% 
These effects are reported in Tables 4 an 4 
for the high SES and low SES grou | 
respectively. Pi 
The total grade average of high SES expe 
mentals improved significantly more than 60” 
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trols in the January 1962 follow-up and in the 
January 1963 follow-up. In addition, high 
SES experimentals improved significantly 
more than controls in English and mathe- 
matics in 1962. No subject areas were signifi- 
cant in the 1963 follow-up. 

The trend reported for high SES experi- 
mentals was reversed for low SES experi- 
mentals. In most cases they improved less 
than control subjects although these improve- 
ments gnly approached significance in science 
and mathematics in the 1963 follow-up, Low 
SES subjects showed no significant differential 
changes in total grade average in either 
follow-up period. 

In Figure 1 jhe total grade averages are 
plotted for high SES and low SES experi- 
mentals and controls for the initial pretest and 
the two follow-up periods. The data reported 
in Figure 1 do not correspond exactly with 
that presented in Tables 4 and 5 since the 
tables use only subjects for whom change 
scores were available whereas the figure uses 
all subjects in each testing period to obtain 
the mean score. The high SES experimental 
group is the only group to show an improve- 
ment in grade average in the 1963 follow-up. 
This group shows a 12-point increase in grade 
average in the 1963 follow-up, moving from a 
D— to a C average, 

Similar change comparisons were made with 
the data divided into high MA and low MA, 
high TAS and low TAS, high IQ and low IQ, 
but no significant differences were found. 


, Analysis of Change within the Experimental 
Group 


To analyze patterns of change within the 
experimental group, seven variables were 
intercorrelated: three predictor variables— 
social. class, initial n Achievement score, and 
WISC IQ score; three participation variables 
—ranking on course participation by the 
counselor, a moderate risk-taking score from 
the Business Game, and the change in n 
. Achievement score from the pretest to the 
August posttest; and the criterion variable— 
change in school grades. Changes in grades 
“in the January 1962 follow-up were used 
since using 1963 changes would too much re- 
duce the already small sample. These correla- 
tions are reported in Table 6. 
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TABLE 5 
CHANGE IN ScHoor Grapes: Low Socar Crass Boys 
M change 
. 
School subjects F a 
Kapel Control 4 
mes group 
Pretest to January 1962 follow-up 
Total grade 2.4 fe 
average (n = 8) (n = 18) 
English 7.0 8 
(n = 17) 
Science 8 <.10 
° (n = 14) 
Foreign language el 
(n = 11) 
Mathematics 6 <.10 
(n= 17) 
History/civics A 
(n= 11) 
Pretest to January 1963 follow-up 
Total grade 2.3 2 
average (n = 6) (n = 13) 
English 7.8 
(n=6) | (n = 13) 
Sea a 
Foreign language! 
NNER —7.6 <.05 
(n = 6) (n = 10) 
History/civics 12.0 4,7 
(n= 4) | (W=9) 


a Mann-Whitney U test, two-tailed probability (contrary 
to prediction). 
b Not enough cases for comparison. 


None of the three predictor variables 
showed significant correlations with change in 
school grades although when SES was divided 
into high and low groups as was done for 


L HIGH SES 
EXPERIMENTAL 
Pale As Low SES 
K / << controt 
Tap ~ 


HIGH SES 
CONTROL 


Ciia LOW SES 


EXPERIMENTAL 


GRADE AVERAGES 
x 
ò 
T 


64 x 
(ea AER 
INITIAL JANUARY JANUARY 
PRETEST 1962 1963 
JUNE 1961 FOLLOW-UP FOLLOW-UP 


Fic, 1. School grade average in pretest and 
follow-up periods. 
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TABLE 6 
EXPERIMENTAL GROUP INTERCORRELATIONS (n = 18) 


Bae Initial icipa- | Business |" Achieve- | Change 
ə class |” pe IQ eos Game Sange in grades 
Predictor variables 
Social class* — 
Initial n Achievement 22 — 
WISC IQ —32 —31 = 
Participation variables 
Course participation> —03 —02 —01 — 
Business Game score —15 06 34 15 — 
Change in n Achievement® 11 02 35 58+ 25 = a 
Criterion variable x ; 
Change in grades? 32 —30 20 27 63% 42% — 


* Scores inverted for ease of interpretation: 7 = high SES, 1 = low SES, 


» All rank-order correlations. 
o Change to August posttest. 
4 Change to January 1962 follow-up. 
* p <.10, one-tailed. 
** p <,02, one-tailed. 
*** b <.01, one-tailed. 


control group comparisons, a median test 
showed that high SES boys tended to im- 
prove more than low SES boys (x? = 7.29, 
$ < .01). None of the predictor variables 
were significantly related to participation 
variables, although there was a weak relation- 
ship between IQ and Business Game and IQ 
and n Achievement change. 

Two of the participation variables can be 
seen as measures of how well the boys learned 
major themes of the course. The Business 
Game score gives an indication of the boy’s 
ability to take moderate risks and use the 
results of his previous experience (i.e., feed- 
back) at the end of the course, This score 
correlated .63 with change in school grades, 
The change in n Achievement score gives an 
indication of how well the boy learned to 
think like a person with high need for 
achievement. This score correlated .42 with 
change in grades, The multiple correlation 
coefficient of Business Game score and change 
in n Achievement with change in school 
grades was .68 (p< -025, one-tailed). The 
third participation variable, the Participation 
ranking by the counselor, did not correlate 
significantly with the criterion, although it 
lid with change in n Achievement. 


Discussion 


The fact that improvement in experimental 
toup grades differs significantly from control 
toup improvement in the 1963 but not the 


1962 follow-up seems to be based mainly on 
a decrease in the control group improvement 
score in 1963. This would suggest that while 
the summer school alone gave some boost to 
the grades, there is a tendency for this effect 
to decay over time, while the addition of the 
AMTP to the summer program seems to pro- 
mote a more permanent and perhaps increas- 
ing improvement in school grade average. 
This effect is seen most strikingly when 
the data are analyzed by social class (see 
Figure 1). 

The significant differences in subject areas, 
especially when controls improve more than 
experimentals, are difficult to explain with 
the data available. i 

The strong differential effect that social 
class had on change in grades requires some 
examination. The fact that only high social 
class experimental boys seemed to benefit sig- 
nificantly from the AMTP might be explained 
in two ways. 

One might reason that the course was de- 
signed to appeal more to boys from the upper 
classes where education is valued more than 
to lower-class boys (Kahl, 1952). The course 
was, in fact, pitched at a rather intellectual 
level and expounded high educational values 
—Harvard, research, and good grades. But if 


this argument were so, one would expect ` 


social class to be positively correlated with 
the participation variables—course participa- 
tion ranking, Business Game score, and 


i 
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change in n Achievement. As is indicated in 
Table 6, this is not the case. In addition, both 
high® and low SES experimentals increased 
significantly more than controls in n Achieve- 
ment score. 

The alternative explanation is not based on 
the differential reaction of high and low SES 
boys to the AMTP but on the different en- 
vironments to which they returned. An under- 
achieving upper-class boy finds himself in 
sharp djssonance with his subculture. There 
is constant pressure: for him to achieve and 
do well in school. ‘The underachiever from 
the lower classes is not pushed by such strong 
values on achievement and hence he is more 
likely to be in harmony with his subculture. 
Hence when the boys returned home after 
learning new techniques for achieving, the 
high SES boys used these techniques to al- 
leviate the tensions their failure to achieve 
had created. Although we would have to 
assume that the low SES boy also felt some 
of these tensions (his parents did care enough 
to send him to the summer school), the aspi- 
rations that the subculture holds for the boy 
are not so high (Kahl, 1952) and thus he 
need not improve his performance as much 
to bring himself in harmony with his sub- 
culture, 

The experimental group’s improvement 
over controls in school grades lends encour- 
aging support to the hypothesis that teaching 
underachieving boys the characteristics of the 
person with high achievement motivation can 
lead to better academic performance. The 


. hypothesis is further supported by the high 


correlation of change in n Achievement and 
Business Game score with improvement in 
school grades in the experimental group. 
This experiment, however, does not allow 
any conclusions about what techniques pro- 
duced the experimental group’s improved 
academic performance. The improvements 
could theoretically be a result of any or all 
of the following: the experimenter’s particu- 


_ lar personality (identification theory), learn- 


ing n Achievement thought categories (ideo- 
motor response theory and n Achievement 
theory), learning to take moderate risks and 
use feedback (n Achievement theory), learn- 
ing to take personal responsibility for actions 
(n Achievement theory), and expectations of 
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improvement created by participation in a 
research project and by the research team 
contract (expectation theory). Further re- 
search should attempt to measure and/or 
isolate the differential effects of these factors. 


REFERENCES 


ALLPORT, G. W. The historical background of 
modern social psychology. In G. Murphy (Ed.), 
The handbook of social psychology. Vol. 1 Read- 
ing, Mass.: Addison-Wesley, 1954. Pp. 3-56. 

ATKINSON, J. W. (Ed.) Motives in fantasy, action, 
ang society. Princeton, N. J.: Van Nostrand, 1958. 

Bensen, O. Simulation of international relations and 
diplomacy. In H. Borko (Ed.), Computer ap- 
plications in the behavioral sciences. New York: 
Prentice-Hall, 1962. Pp. 574-595. 

Burcess, Erva. Personality factors of over- and 
under-achievers in engineering. Journal of Educa- 
tional Psychology, 1956, 47, 89-99. 

Burris, R. The effect of counseling on achievement 
motivation. Unpublished doctoral dissertation, 
Indiana University, 1958. 

Frencu, Exizasetu G. Development of a measure of 
complex motivation. In J. W. Atkinson (Ed.), 
Motives in fantasy, action, and society, Princeton, 
N. J.: Van Nostrand, 1958. Pp. 242-248. 

Garrett, H. F. A review and interpretation of 
investigations of factors related to scholastic 
success in colleges of arts and sciences and teachers 
colleges. Journal of Experimental Education, 1949, 
18, 91-158. 

Gepxart, G. G., & Hoyt, D. P. Personality needs of 
under- and overachieving freshmen. Journal of 
Applied Psychology, 1958, 42, 125-128, 

Gorpserc, Mrr1am. A three year experimental pro- 
gram at Dewitt Clinton High School to help 
bright underachievers. High Points, Jan. 1959, 
5-35. 

Gorpstery, A. Therapist-patient expectancies in 
psychotherapy. New York: Pergamon Press, 1962. 

Hır, W. F. Learning theory and the acquisition of 
values. Psychological Review, 1960, 67, 317-331. 

Kacan, J. The concept of identification. Psycholog- 
ical Review, 1958, 65, 296-305. 

Kant, J. A. Adolescent ambition, Unpublished 
doctoral dissertation, Harvard University, 1952. 
Kerry, G. The psychology of personal constructs. 

Vol. 1. New York: Norton, 1955. 

Kese, M., Crort, C, & Nanus, B. Management 
games: A new technique for executive develop- 
ment. New York: Reinhold, 1961. 

Leary, T. How to change behavior. Paper read at 
the XIVth International Congress of Applied 
Psychology, Copenhagen, 1961. (a) 

Leary, T. Thinking, talking, and doing. Unpublished 
manuscript, Harvard University, 1961. (b) 

Lirwiy, G. H., & Crarto, J. A. Achievement motiva- 
tion and risk-taking in a business setting, tech- 
nical report. Ossining, N. Y.: General Electric 
Company, Behavioral Research Service, 1961. 


792 


McCtettanp, D. C. The achieving society, Princeton, 
N, J.: Van Nostrand, 1961. 

McCtettanp, D. C., ATKrnson, J. W., CLARK, R. A., 
& Lower, E. L. The achievement motive. New 
York: Appleton-Century-Crofts, 1953. 

ORNE, M. T. On the social psychology of the psy- 
chological experiment: With particular reference 
to demand characteristics and their implications. 
American Psychologist, 1962, 17, 776-783. 

Parsons, T. The school class as a social system— 
Some of its functions in American society, Harvard 
Educational Review, 1959, 29, 297-316. 

RosentHat, R. On the social psychology of the 
psychological experiment: The experimenter’s 
hypothesis as the unintended determinant of ex- 
perimental results. American Scientist, 1963, 51, 
268-283. 

Scuwirzceser, R. Analysis and evaluation of the 
experimenter-subject role relationship in the re- 


Davip A. KoLB 


duction of known male adolescent crime. Un- 
published doctoral dissertation, Harvard University, 
1962. a 

SCHWITZGEBEL, R. & Strack, C. Á handbook: 
Reducing adolescent crime in your community, 
Cambridge, Mass.: Authors, 1960. 

Serowrs, R. C. Business simulation. In H. Borko 
(Ed.), Computer applications in the behavioral 
sciences. New York: Prentice-Hall, 1962. Pp, 556- 
573. 

UHLINGER, CAROLYN A. & Srepnens, M. W. Re- 
lation of achievement motivation to academic 
achievement in students of superior ability. Journal 
of Educational Psychology, 1960, 51, 259-266. 

Warver, W. L., MEEKER, M., & Extrs, K. Social 
class in America. Chicago: Science Research As- 
sociates, 1949. 


(Received July 16, 1964) 


Journal of Personality and Social Psychology 
1965, Vol. 2, No. 6, 793-808 


ZERO-POINT SCALING AS AFFECTED BY SOCIAL OBJECT, 


SCALING METHOD, AND CONTEXT* 


JACK M. HICKS? ann DONALD T. CAMPBELL 


Northwestern University 


482 college students served as judges in the exploration of a method of 
determining a rational origin on an interval scale developed by Thurstone and 
Jones. This method is based on Thurstone’s law of comparative judgment. It 
not only requires the judgment of single stimuli, but pairs as well. A basic as- 
sumption is that the scale values of pairs will equal the summation of scale 
values of corresponding components, The stimulus dimensions of birthday gifts, 
traffic violations, and bizarre behavior symptams were studied. Only gifts and 
violations were predicted to bé additive. Each dimension was presented in 
high, neutral, and low context sets. Paired-comparisons and categorical-assign- 
ments methods were employed. Each judge made 210 judgments involving 
instances of all 3 contexts, each represented by a different dimension. Support 
for fhe zero-point model was found in all 3 dimensions. In line with prediction, 
gifts were most satisfactorily additive. Evidence of additivity was found for 
behavior symptoms, contrary to prediction. Strong context effects were found 
under all conditions and modes of analysis. While a small proportion of this 
context effect could be interpreted as a random-error artifact, in bulk it seems 


a psychological effect. The simplest of 4 modes of analysis used, mean ratings, 


s was as effective as an; 
and minimizing context effects. 


The desire to achieve a meaningful zero 
point for psychological measures is a recur- 
rent one. Recently Thurstone and Jones 
(1959) have provided a promising method 
within the tradition of the law of comparative 
judgment, This paper extends the exploration 
of their method by using a wider range of 
social objects scaled, by using several dif- 
ferent scaling methods, and most impor- 
tantly, by examining the extent to which the 
zero-based scale values for a given object 
remain the same when the scaling is done 


“in the context of sets of objects varying in 


scale value. 

Fundamental to the technique of Thurstone 
and Jones is the demonstration that the scale 
value of two stimuli when judged as a com- 
bination be consistently greater than the 
scale values of either stimulus when judged 
alone. This established, it is possible to apply 
the mathematical concept of additivity and 
infer a rational zero point. By assuming that 


‘the scale value of a combination of two 


„ 1 Supported in part by the United States Office 
óf Education Project C-998, Contract 3-20-01, 
under provisions of Title VII of the National De- 
fense Education Act. 
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y of the more complex methods in maximizing additivity 


stimuli, AB, is equivalent to the sum of the 
scale values of the individual stimuli, A and 
B, they take the combined distances of A 
and B from AB as the distance of AB to the 
zero point. That is, if AB is two units above 
A and three units above B, the rational zero 
point may be deduced to be five units below 
AB. With N basic stimuli, N(N — 1)/2 
separate zero-point estimations are possible. 
The crucial test of additivity concerns the 
variability among these zero-point determina. 
tions. The less the variability, the more con- 
vincing is the mean of the zero-point estima- 
tions as representative of a rational origin 
Thurstone and Jones (1959) and a follow-up 
by Shuford, Jones, and Bock (1960) have 
demonstrated the applicability of this re- 
quirement using the method of paired com- 
parisons. 

With regard to the choice of social objects 
scaled, Thurstone and Jones (1959) provide 
a limited and atypical sampling. They asked 
for paired comparisons of five well-known 
consumer articles in terms of preferences as 
“birthday gifts which may not be sold.” To 
achieve subjectivity of judgment, the re- 
spondents were instructed to ignore the ob- 
jective judgmental criterion of resale value, 
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and they did indeed obtain a negligible rela- 
tionship between subjective scale values and 
monetary values of the items. While it would 
seem, therefore, that the objective criterion 
of’monetary value was essentially ruled out, 
it could be argued that the mere availability 
of well-known objective standards disqualifies 
consumer articles as adequate tests of “sub- 
jective additivity.” Therefore, in the present 
study two stimulus dimensions with lesser 
degrees of association with objective “values” 
have been chosen. These are the judged seri- 
ousness of traffic violations and the judged 
bizarreness of behavior symptoms, Birthday 
gifts were also included for replicative 
purposes, 

It was hypothesized that traffic violations 
should be subjectively additive. Due to the 
commonplace occurrence of multiple viola- 
tions committed by the same person, it be- 
comes necessary for one to “add up” the 
component violations in order to estimate the 
total seriousness of the offenses. In this re- 
spect, violations are seen as similar to birth- 
day gifts, which also occur in multiples for 
a single individual. Violations differ from 
gifts, however, in the absence of a single, 
relatively unambiguous objective criterion, in 
that the public is less generally exposed to 
consistent objective outcomes associated with 
traffic violations than with birthday gifts, 
Practically all consumer articles are associ- 
ated with a purchase according to a relatively 
stable retail price, whereas few traffic viola- 
tions result in fines or even apprehension. 
Even when fines do occur, they may vary 
radically according to conditions, such as 
degree of traffic hazard involved. Thus the 
“objective” criteria of amount of fine or 
traffic hazard might be less readily brought 
to bear upon the judgment of the seriousness 
of a traffic violation than the criterion of 
resale price in the case of gifts. In the ab- 
sence of a clear-cut objective criterion, past 
social experience is expected to provide a 
basis for judgments of a predominantly sub- 
jective nature. Assuming stimuli with objec- 
tive judgmental criteria to be more readily 
added than subjective stimuli, somewhat less 
evidence of additivity is expected for traffic 
violations than for gifts. 

The stimulus dimension of “bizarreness of 
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behavior symptoms” was not expected to pro- 
vide subjective additivity. This dimension ap- 
peared to have neither socially anchored 
subjective additive properties nor clear-cut 
objective criteria. In lieu of additivity, an 
“averaging” effect was predicted for the 


behavioral bizarreness dimension. Pairs of 


behavior statements might be judged as small 


samples of a total universe of potential in- 
formation about an individual, rather than 
as exhaustive. Thus an “extremely 
statement combined with. a “mildly bizarre” 
statement might be judgéd as representative 
of a “moderately bizarre” personality. The 
phenomenon of averaging of stimulus com- 
binations has also been suggested by Osgood 
and Sebeok (1954) with regard to adjective- 
noun combinations in their discussion of 
“laws of word mixture.” 

A second major objective of this study 
concerned the effects of stimulus context upon 
zero-point scaling. This was essentially an 
attempt to determine the relative positions 


piar l 


of zero points independently derived from | 


“high” and “low” contexts of a common di- 
mension, Ideally, such zero points would be 
the same, and the resultant absolute scale 
values for a given stimulus would be inde- 
Pendent of the context in which it was 
judged. In general, however, the experimental 
literature suggests a context effect. The 
dependence of specific scale values upon 
stimulus context has been a universal finding 
in psychophysics as well as in the areas of 
affective, esthetic, and social judgments (€2, 


Helson, 1947, 1948, 1959). A “tracer” stimu- 


lus common to two contexts will have a 
lower scale value when judged among stimuli 
higher on the dimension than when judge 
among lower stimuli (e.g., Campbell, Lewis, 
& Hunt, 1958; Krantz & Campbell, 1961). 
Therefore, in terms of the present investiga- 
tion, a given tracer stimulus should decreasë 
in scale value (i.e., scale distance from zero) 
as the context increases in mean magnitude. 
In anticipation, it may be noted that we 
do find such context effects, and in vey 
strong degree. On the one hand, these effects 
may reflect genuine changes in subjective 
valuation as a function of context. On $ 
other hand, they may be but methodological 
artifacts due to the untenability of a basic 


| 


r 
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assumption of all applicable versions of Thur- 
stone’s law of comparative judgment. In 
moving from Thurstone’s Case 1, which holds 
for multiple judgments of a single person, 
to Cases 2, 3, 4, and 5, which apply to pooled 
individuals, one presumably assumes that 
these individuals are homogeneous in their 
discriminatory performance. For those studies 
in which relative metric stimulus values are 
all that is needed, or all that can be checked, 
violations of the assumption introduce very 
minor problems. In, zero-point scaling they 
have differential effects, in differing contexts, 
producing a spurious context effect. In pur- 
,suance of this problem, a synthetic illustra- 
tion of random-error artifacts is provided, as 
well as a subsidfary analysis dividing judges 
into high and low discriminatory efficiency. 

A third objective of this investigation was 
to determine the feasibility of categorical 
judgments methods for the Thurstone-Jones 
zero-pdint model, Although Rimoldi (1956) 
has used combined stimuli in the method of 
successive intervals, his study was not de- 
sigged specifically to solve for a rational 
origin, In view of the greatly reduced labor 
involved in categorical assignments as op- 
posed to paired comparisons, and the prom- 
ising results reported by Jones (1957) in his 
reanalysis of the Rimoldi data for a rational 
origin, this additional exploration was under- 
taken. Moving to still less laborious pro- 
cedures, approaching Thurstone’s method of 
equal-appearing intervals, an absolute judg- 
ments analysis or method of mean ratings 
Was investigated. 


METHOD 
Subjects 


Participating as judges were 482 freshman and 
sophomore college students of both sexes; 317 of 
these judges were students at Wright Junior Col- 
lege, Chicago, and 165 subjects were enrolled at 
Northwestern University. 


Materials 


Stimulus materials were verbal statements of three 
types of objects: consumer articles, trafñc violations, 
and behavior symptoms. One item represented each 
of nine scale values of each stimulus group, and is 
identified in analysis by capital letters A through I. 
The letter A represents the “high” end of each di- 
mension, and I the “low” end. 
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Consumer articles ranged along a dimension of 
“desirability as a birthday gift,” as follows: (A) TV 
set, (B) record player, (C) camera, (D) typewriter, 
(E) transistor radio, (F) leather binder, (G) desk 
lamp, (H) pen and pencil set, (I) dictionary. Items 
E, G, H, and I were borrowed from the Thurstofe- 
Jones investigation and were ranked in accordance 
with those findings. The remaining articles were 
introduced to extend the range of stimuli, and are 
ranked according to relative retail price range. 

Traffic violations ranged along a dimension of 
“seriousness of traffic offense,” as follows: (A) 
driving under the influence of alcohol, (B) leaving 
the scene of an accident, (C) driving after license 
has been revoked, (D) driving 50 miles per hour 
in a 25-mile zone, (E) running a red light, (F) run- 
ning a stop sign, (G) going the wrong way on a 
one-way street, (H) following a car too closely, and 
(I) failing to signal before turning, These items 
were borrowed from an unpublished study by Law- 
rence Ross. The behavior symptoms were taken 
from unpublished work by Jack Gilchrest, who had 
scaled them as to degree of “bizarreness and eccen- 
tricity.” The following statements were used: (A) 
He tried to squeeze his head between the bars of his 
bed. (B) He kept hiding in dark corners and crying. 
(C) He said the green walls of his room were really 
purple. (D) He went on a number of hunger strikes. 
(E) He often said, “I wish I were dead.” (F) One 
day he brushed his teeth six times. (G) He seldom 
smiled. (H) He was found walking in his sleep, 
(I) He often complained of a backache. 


Design 


There were three basic experimental conditions: 
high context, low context, and neutral context. Items 
A through E were classified as high-context values, 
and E through I as low-context. The neutral condi- 
tion consisted of alternate stimuli A, C, E, G, and 
I. Stimulus E was used as a tracer, appearing in 
all three contexts, Each of the three stimulus dimen- 
sions were divided in this manner into high, neutral, 
and low contexts, resulting in nine item sets. These 
nine sets were presented by both categorical assign- 
ment and paired-comparison methods. 

The first stimuli of any one context were not only 
judged individually, but in all possible combinations 
of two, Since there are 10 possible pairs, there were 
15 stimuli in all. In the categorical-assignments con- 
dition 15 judgments were required. However, for the 
paired-comparisons task, not all pairings among the 
15 stimuli were used. Following Thurstone and Jones 
(1959), those pairs in which both members con- 
tained a common element were omitted (e.g., a com- 
parison between A versus AB, or AB versus AC). 
All other comparisons were utilized (e.g, A versus 
B, A versus BC, AB versus CD). There were 55 
pairs resulting. 

Task order effects were controlled by distributing 
contexts and dimensions through a Latin-square de- 
sign. Judgment booklets were assembled in the 
following three sequences: behavior-violations-gifts, 
violations-gifts-behavior, and  gifts-behavior-viola- 


796 


tions. The three contexts of these dimensions were 
rotated similarly, each appearing equally often 
first, second, and third in order of presentation. For 
example, a complete booklet might consist of low- 
context behavior statements, high-context traffic 
vidlations, and neutral-context gift judgments, Each 
judge got one set of each context and one set of each 
type of social object, in a Latin-square arrangement. 
Nine such schedule types result. Paired comparison 
and categorical assignment items were identical for 
each subject. That is, each subject was redundantly 
exposed to each stimulus, but by different methods. 
Methods were alternated in sequence with half of the 
judges making paired comparisons followed by cate- 
gorical assignments, and half giving categorical as- 
signments followed by paired comparisons. The two 
methods of exposing each topic context were juxta- 
posed. This produced 18 different booklet assemblies. 
The stimuli for any given task were arranged in 
a random order, and three different random orders 
were employed. For the paired-comparison tasks, 
Ross’ (1934) method of optimizing orders for the 
presentation of pairs was used. The final product of 
all booklet and stimulus arrangement devices was 
108 (3X6X2X3) types of judgment booklets, 
each calling for 210 judgments involving the three 
stimulus materials and two judgment methods. 


Procedure 


Judgment booklets were prearranged for distribu- 
tion in a restricted random order, Judgment booklets 
contained self-administering instructions and were 
Presented to groups ranging from 11 to 34. Asa 
sample, here are the instructions for the categorical 
judgments of the behavioral stimuli: 


The purpose of this schedule is to investigate 
ratings of statements about human behavior. 
There are five such statements which you are to 
rate individually and in various combinations with 
one another. In the case of both single and com- 
bined statements, assume that they represent be- 
havior protocols of different adult individuals. 
Your task is to rate individuals on the basis of 
their protocols. At the left of the next page next 
to each of the protocols will appear a space where 
you are to write the number of the rating which 
you feel best depicts the individual represented, 
The scale provided below is to be used in making 
your ratings: 

10 
Normal behavior with no signs of eccentricity 
50 


Moderately abnormal behavior, with moderate 
amounts of eccentricity and bizarreness 


60 


Extremely abnormal behavior, with extreme 
degrees of bizarreness 


If you think a statement or statements about an 
individual represent normal behavior, you should 
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place a number in the neighborhood of “40” on 
the line to the left of the protocol. If you feel 
that the behavior is extremely bizarre, then a “60” 
would be more appropriate. Feel free to’ use 
numbers above, below, and in between those indi- 
cated above. The above scale is intended only as 
a guide. Thus, if you have already used a “40” 
and come to a statement still more normal in your 
estimation, then you should use some lower num- 
ber to accurately represent your judgment, Like- 
wise, if you have already used a “60” and come to 
behavior which you consider still more bizarre, 
then you should use some still higher number, 
Please be sure to rate all the response statements 
and in the order presented. You may now turn 
the page and begin. , 


Paired-comparisons instructions for gifts read as 
follows: 


Instructions. In this task you are to make 
paired comparisons of articles which students 
might receive as birthday gifts. There are 5 such — 
gifts which you are to compare individually 
and in various combinations with one another. 
Assume that you do not already possess any of — 
the articles. Consider them as gifts for your own 
personal use, which may not be sold. For each 
comparison, you are to check the article or articles 
which you would, in general, prefer to own. 


Data Analysis 


Analyses of the data involve two discrete steps. 
First is the determination of the scale values of the 
15 points by the ordinary means. These will be 4 
described first for all three methods. Following this, — 
the mode of zero-point determination will be de- 
scribed. 

Paired-comparisons data. The Thurstone and Jones 
(1959) modification of Thurstone’s (1927) law of 
comparative judgment to be used for composite as 
well as single stimuli was adopted in simplified form. 
A detailed report of this technique is given by 
Thurstone and Jones (1959, pp. 203-204). This pro- 
cedure was simplified in the present investigation 
by assuming the same discriminal dispersion regard- 
less of whether or not a paired comparison involved 
composite or single stimuli. A standard deviation of 
differences of 1.00 was used, allowing scale separa- 
tions to be determined directly from unit normal 
deviates corresponding to the proportion of the time 
one stimulus was judged greater than another. 
(Alternative analyses are also explored, as will be 
discussed below.) 

A special technique devised by Thurstone and 
Jones (1959) was required for estimating z-score 
values for empty cells resulting from the restriction 
that no stimulus be compared with itself. This 
method requires the averaging of scale separations 
for all possible comparisons according to the genera: 
formula: 


(eS) = £205, — 5,) = (4-59) 
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TABLE 1 
SCALING METHOD INTERCORRELATIONS 


Multiple | Successive| Raw 
standards | intervals mon 
ratings 

High-context behavior 

Paired comparisons 964 962 961 

Multiple standards 999 997 

Successive intervals -998 
Neutral-context behavior 

Paired comparisons 983 982 980 

Multiple standards 999 +999 

Successive intervals 999 
Low-context behavior 

Paired éemparisons 961 961 -963 

Multiple standards . 998 995 

Successive intervals le $98 

© 

High-context violations 

Paired comparisons 979 980 986 

Multiple standards 999 999 

Successive intervals «999 
Neutral-context violations 

Paired comparisons 975 972 986 

Multiple standards 999 997 

Successive intervals 995 
Low-context violations 

Paired comparisons 978 975 +983 

Multiple standards 999 997 

Successive intervals +994 
High-context gifts 

Paired comparisons -981 983 983 

Multiple standards 999 996 

Successive intervals 994 
Neutral-context gifts 

Faired comparisons 996 995 +989 

Multiple standards 999 996 

Successive intervals 996 
Low-context gifts 

Paired comparisons 990 991 £992 

Multiple standards 999 998 

Successive intervals 998 


where 


(Sy — Sn) refers 
any two stimuli 
= (S,— Sy) — (Sn — Sy) symbolizes the summa- 
tion of z-score differences between any pair of 
stimuli 

q represents the number of columns in which 
entries appear for both Stimuli g and h, a value 
which may change for different combinations of 
g and h. 


These computations adjust directly obtained z-score 
differences as well as filling empty cells. From the 
resulting table of z-score values, the scale values are 
determined according to the usual procedures (eg. 
Guilford, 1954; Torgerson, 1958). In the present 
instances, the values were expressed in terms of the 
standard error of the difference rather than in dis- 


to the scale distance between 


, Criminal dispersion units, The value of the lowest 


stimulus is used as the computational zero in this 
Procedure, 

. Categorical judgments, For analysis by the method 
of categorical judgments, the responses were grouped 
into categories as follows: 39 and below, 40-44, 
45-49, 50-54, 55-59, 60 and above. A table was 
Prepared giving for each stimulus the percentage of 
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judgments in that category or a lower one. These 
percentages were then turned into z values, and the 
differences between stimuli in terms of these z values 
averaged for the five lower categories (all percent- 
ages for 60 and above being 100). This procedure is 
superior in obviating the necessity of computing the 
scale values for category boundaries and in avoiding 
interpolating. It can be called the method of multiple 
standards. It proves to be essentially identical to the 
more usual method of successive intervals, also used. 
These scale values use the lowest of the 15 stimuli 
as the computational zero, and are expressed in 
units of the standard error of the comparison. 

Mean ratings. The third metric used treats the 
categorical judgment data as numerical ratings, 
averaging the two digit numbers to obtain the 
mean rating for each of the 15 stimuli. These values 
are of course expressed in the units of the original 
judgment assignment, with 50 typically an average 
value, 40 low, 60 high, and with freedom to move 
outside of these limits, 

Zero-point computation. Rational origins were 
obtained for all of the above analyses. The first step 
was to obtain the 10 estimates of the additive 
constant, c, for each set of data by solving the 
equation, 


Sis — (Sit Sy) = 
where 


Sı; refers to any composite stimulus whose com- 
ponent stimuli are S; and Sy. 


The mean of each such set of estimates is then added 
to each scale value of the particular set of items 
from which it was derived. The resulting values are 
regarded as scale distances from the rational zero 
point pertaining to that particular set of data, A 
complete description of this procedure is given by 
Thurstone and Jones (1959). 

It was suggested earlier that the acceptance of 
additivity depended upon some degree of agreement 
or clustering of zero-point determinations, These 
clusterings are graphically displayed in Figures 1, 
2, and 3. As a quantitative description of the degree 
of clustering, the ratio between the standard devia- 
tions of zero-point determinations and the standard 
deviations of corresponding composite stimuli has 
been computed. Such ratios, referred to subsequently 
as “additivity indices,” were computed for each 
stimulus set. The greater the additivity index, the 
less the dispersion of zero points relative to the dis- 
persion of composite stimuli. 

The Thurstone and Jones (1959) method of 
describing additivity was a scatterplot of scale values 
(after addition of the additive constant) with 
summated component values on the abscissa and 
corresponding composite values on the ordinate. 
That is, composite stimuli, Sı, were plotted against 
their respective summated component stimuli, S+ + 
S;. Presumably, the higher this correlation, the 
closer the fit to the additivity model. However, such 
an analysis was used only for comparative and sup- 
plementary purposes in this investigation. A high 
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correlation is possible even when scale values of 
composite stimuli fall between or below those of 
their component stimuli, and the scattering of zero- 
point inferences is not directly examined. 

A further index of the degree of context effect 
will be presented in the Results section subsequent 
to examination of the graphic presentation which 
it seeks to summarize. 


RESULTS AND DISCUSSION 
Agreement among Scaling Procedures 


Table 1 presents the degree of agreement 
within contexts and stimulus types for four 
scaling methods. Agreement is essentially per- 
fect among the three methods (multiple 
standards, successive intervals, and mean rat- 
ings) making use of the same data. Agree- 
ment between these and paired-comparison 
values is almost perfect, although going as 
low as .96, Although multiple standards and 
successive intervals methods share with paired 
comparisons the procedures and logic of 
Thurstone’s law of comparative judgment, 
while mean ratings do not, mean ratings agree 
as well or better with paired comparisons, 
There is nothing in these results to justify 
the complex rationale and greater effort of 
the comparative judgment procedures. Note, 
however, that the rating procedure was par- 
ticularly designed to avoid the end effects 
which plague such methods, including the 
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method of equal-appearing intervals, (Be- 
cause of the computational overlap with the 
method of multiple standards, the methed of 
successive intervals values is not reported 
in what follows.) 


Additivity Requirements 


Figures 1, 2, and 3 present the paired- 
comparisons results for the three types of 
objects and for the three contexts of each, 
For purposes of illustrating the context effect, 
the tracer stimulus is plotted at the safne level 
for all three contexts, the other values being 
located in relation to it. Scale distances from 
the mean zero are indicated numerically for 


each E stimulus. Identification letters of stim- 


uli are indicated in uppercase letters to the 
right of the vertical line. Zero-point locations 
are indicated to the left of the line in lower- 
case letters corresponding to the stimuli on 


which they are based. In addition to the 


three plotted, essentially identical results ob- 
tain for the multiple standards and the mean 
ratings computations. In terms of additivity 
requirements, we are thus discussing 27 zero- 
point studies (3 Object Types X 3 Contexts 
X 3 Scaling Methods). In only five minor 
instances did a higher scale position obtain 
for a singlet than for a doublet of which it 
was a component. One of these is shown in 
Figure 1, for the A stimulus, neutral context. 


TABLE 2 
Appitivity INDICES 


Paired compariso: Multipl bined 
eh EASA m] ns ultiple standards Mean ratings ven ame, 
a 
SD: | SDs | Index | sp, | sp. | index | sp, | sp, | inder | indices 
Bryer 
ig 212 | 241 | 113 | .286 | .302 | 95 | 22 |1 
Neutral 862 | 459 | 1.44 | ‘770 | i348 | 221 | Sis | 26 245 
ow 380 209 162 | 424 | 131 | 324 | 2.77 | 141 | 1.96 
Viton F : é 493 | .260 | 1.90 | 3.35 | 172 | 195 1.77 
ig 992 | 352 | 1.57 | .532 | 356 | 1.49 | 444 | 262 | 1.60 
Neutral 653 | 360 | 181 | [557 | ‘60 | cot | $35 | 402 1.20 
ow See | 333 | 1.47 | 388 | 1366 | 1:06 | 290 | ate | £2 
giu 5651.348 | 1.62 | 492 | “aug | 111 | 423 3.08 | 1.37 1.37 
igh -245 | 156 | 1.57 | 288 | 153 
4 $ ; x 1.88 | 2.38 | 1. 2.14 
Neutral 456 | 317 | 144 | [529 | [300 | rt | aoa | gan | e 
ow $49 215 2.85 “$68 | 203 | 280 | 433 | 1184 | 235 a 
A : Š 3 :222 | 2.08 | 3.65 | 1.80 | 2.03 1. 
Grand M 473 | 294 | 1.64 | ‘480 | [306 | 170 | 374 | 22 1.72 1.70 
Thurstone-Jones gifts | .497 220 | 2.26 (unweighted) 
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Fic. 1. Scale values and zero-point determinations of paired-comparisons behavior statements, 
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Fic. 2. Scale values and zero-point determinations of paired-comparisons traffic violations. 
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Fic. 3. Scale values and zero-point determinations of paired-comparisons gifts. 
801 


802 


LO l2 


$ \ 
Fic. 4. Relationship of composite stimuli and summated components, high context 


Jack M. Hicks anp Donatp T. CAMPBELL 


2.8“ 


2.6 


l4 16 18 2.0 2.2 2.4 


Sı +3; 


multiple standards behavior statements, 


In 25 of the 27 context set analyses, additiv- 
ity indices exceeded unity as shown in Table 
2. Assuming the probability of SD, being less 
than SD, on the basis of chance alone to be 
50, the obtained .93 would be highly signifi- 
cant (p < .001), if one could assume inde- 
pendence, As predicted, birthday gifts yielded 
the highest combined additivity index of 1.98. 
Bizarre behavior statements, expected to be 
least additive, showed an additivity index of 
1.77, whereas the index for traffic violations 
was only 1.37. This surprise is traceable to 
categorical judgments data where a combined 
average behavior index for multiple stand- 
ards and mean ratings analyses was 1.92 and 
the combined average violations index was a 
low 1.24, For the paired-comparisons data, 
additivity indices ranked according to predic- 
tion with 1.82, 1.62, and 1.45 representing 
gifts, violations, and behavior, respectively. 
All but three stimulus sets fell short of an 
additivity index of 2.26 computed for the 
Thurstone-Jones (1959) data, The highest 


single additivity index produced by the pres- 
ent study was a 3.24 from low context multi- 
ple standards behavior data. As expected, 
additivity indices for mean ratings corre- 
spond more closely to multiple standards than 
paired-comparisons indices, 

Note that the agreement among methods 
on additivity is not as high as for scale valu j 
within contexts. There is no uniform superior 
ity for any one scaling method, multiple 
standards showing both the highest and low- 
est values, ‘ 

Thurstone and Jones’ assessment of the 
tenability of the additivity assumption was 


based upon scatter diagrams and correlation | 


coefficients relating the composite sion 
with the sum of the two component a 
Such values have been computed for ie 
Present data. Figures 4 and 5 present tt 
worst and the best of these. The correlato 
of .84 and .99 correspond to additivity indice 
of .95 and 2.80. The average correlation w4 
95. 
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Fic, 5 Relationship of composite stimuli and summated components, low context 
multiple standards gifts. 


Context Effects 


Figures 1, 2, and 3 show striking evidence 

; of Context effects. These effects are equally 
striking when the data from multiple stand- 
„ards and mean ratings are graphed (not 
shown here). While the graphic displays are 
Convincing, it is difficult to use them for mak- 
Ing comparisons as to degree of context effect. 

» For this’ reason, an index of context effect 
has been computed, in which a standard 
deviation based solely upon the three Tracer 
Stimulus E values has been divided by an 
average of the standard deviations within 
cach of the three contexts of its 15 stimuli. 
That is, the variability of Stimulus E values 
across contexts has been expressed relative to 
the average spread of values within contexts. 
The larger this index, the greater the effects 
of context. These context effect indices are 


shown in Table 3. From this table the sur- 
prising result appears that gifts show the 
largest context effect, by all scaling proce- 


TABLE 3 
INDICES or Context EFFECT 


Behavior | Violation] Gifts 
Paired comparisons 96 1.10 
Multiple standards 90 1.08 
Mean ratings 79 1.09 

Unweighted | Weighted] Case III 

Paired-comparisons gifts 94 1.05 

Good judges Poor judges 
Multiple standards gifts 97 1.21 
Mean ratings gifts 88 1,29 


Note.—The above indices have a minimum possible value of 
0.00, indicating complete absence of context effect. 
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dures, This is not immediately apparent from 
comparing Figures 1, 2, and 3 because the 
graphs are not comparable in spread of values 
within contexts. This is surprising in that the 
habits of relating dollar values to these ob- 
jects ‘might have been expected to provide a 
stable anchoring of values independent of 
the transient context of judgment. Behavior, 
which might have been expected to have the 
least quantitative external value anchoring, 
shows the least context effect. For methods, 
paired comparisons is slightly the most sus- 
ceptible to context effects in all three topics, 
while mean ratings tends to show the least. 
Again, if susceptibility to context effects is a 
weakness, the rank order of quality is the 
reverse expected. 

The context effects are so strong and so 
uniform that one suspects a methodological 
artifact rather than a pure psychological 
process, For this reason, a number of 
methodological explorations were undertaken, 
As was noted in the introduction, Thurstone 
and Jones assigned to simple stimuli the dis- 
criminal dispersion of 1.00, and to compound 
stimuli the discriminal dispersion of v2, 
whereas we have treated simple and compound 
stimuli the same. When their weighting sys- 
tem is used for paired comparisons, gifts, the 
context effect remains strong, being reduced 
only slightly, from 1.10 to .94. The additivity 
index within contexts is, however, greatly in- 
creased: for high, neutral, and low contexts 
the values are, Tespectively, 3.14, 2.26, and 
5.47, averaging 3.29 compared to the average 
value of 1.82 for the uniform weighting pro- 
cedure, shown in Table 2, 

The Thurstone and Jones procedure as- 
sumes greater variability in appearance, 
greater discriminal dispersions, for compound 
stimuli, to an amount based on the formula 
for the standard deviation of a sum. If we 
assume that the variability in question is 
solely due to the fluctuating appearance to a 
single observer, it may seem reasonable that 
the variability of the composite be greater 
than that of a singlet. However, for the actual 
situation of application of the law of com- 
parative judgment (for Thurstone’s Cases 2 
3, 4, and 5) individuals are pooled. Here the 
basic variability undoubtedly comes more 
from the stable differences across individuals 
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stimuli, than from the fluctuating appearance 
to any one individual. In this instance it 
probably is the case that stable individual 
differences in hierarchy of values are smaller 
for compounds of objects than for single! 
objects, and hence discriminal dispersions | 
smaller rather than greater. Weiss (1963) has } 
found such an effect. Whatever the case may | 
be, a Thurstone Case 3 analysis that infers 
stimulus differences in discriminal dispersions 
would be preferable to either a priori assump- 
tion. Such an analysis, following Burros 
(1951) as described by Guilford (1954), has 
been done for paired comparisons, gifts, For | 
this, the index of context effect is 1.05, but a 
slight improvement on the 1.10 values pre- 
viously found. For this analysis, the additiv- 
ity indices for high, neutral, and low con: | 
texts are 1.72, 1.63, and 2.78, averaging 2.01. | 
Random-error artifacts might also con- 
ceivably produce the context effect. If in a 
paired-comparisons task, Stimulus A is judged 
greater than Stimulus B 90% of the time, the 
10% of exceptions can be regarded as in’patt 
due to the fluctuations in the appearance of 
A and B (Thurstone’s discriminal dispersions) | 
and in part due to blind error, including fail- 
ure to read the items, misreading, mistakes in 
responding, etc. Such blind error would pro- | 
vide on the average a 50-50 split no mattet 
what stimuli were being compared. The super- 
imposition of such error moves all compati 4 
sons closer to the 50% point, and thus brings 
the stimuli being compared closer together it 
resulting scale values, This shift is greater it 
terms of the corresponding 2’s for those per- 
centages nearest 0 and 100. The shrinkage 1$; | 
of course, always toward the stimuli which 4 
given stimulus is being compared with, If 
is being scaled with higher valued A and B 
in one context and with lower valued D an 
E in another, it is distorted up in the firs 
context and down in the other, with corte- 
sponding effects upon the inferred zero point: 
Such a mixing of a group of judges who a 
homogeneously good discriminators with 4 
group who respond purely randomly can H 
regarded as an extreme case of variability ™ j 
discriminatory efficiency (or discriminal r 
persions for judges, if such a concept 
allowed). This, as has been noted abové 


in their stable preference hierarchies of me 


Z=ERO-POINT SCALING 805 


presumably violates the assumptions of Thur- 
stone’s Cases 2, 3, 4, and 5. Because of the 
complexity of the problem, an illustrative 
synthetic example was worked out in which 
it was assumed that half of the judgments 
were totally blind. The details of this analy- 
sis are available from the authors. Suffice it 
to say here that while some artifactual con- 
text effect resulted, the amount was surpris- 
ingly small, generating an index of only .07. 


Separating Good aud Poor Judges 


In spite of the trivial amount of context 
effect generated by the random-error illustra- 
tion, a related analysis was made for gifts, 
separating good’and poor judges. Each judge’s 
responses on the categorical judgment ratings 
were correlated with the paired-comparisons 
scale values of these same 15 stimuli. On the 
basis of these correlations, the judges were 
divided into a good half and a poor half for 
each context, The low group was expected to 
contain the bulk of the random error per- 
formance, and hence to show the greater con- 
text effect. The correlation range for the good 
judges was, for high, neutral, and low con- 
texts, .94 to .575, .99 to .7975, and .99 to 
-753. For poor judges, these ranges were: 
574 to —.48, .7974 to —.66, and .752 to 
—.40. The multiple standards and mean rat- 
ings zero-point scalings were then done. As 
expected, the poor judges showed great con- 
traction of scale values. The average standard 
deviation of scaled values for the poor judges 
was but two-fifths that of the good judges. 
As expected, the index of context effects, re- 
ported in Table 3, was less for the good judges 
than the poor, consistently for both scaling 
methods. However, a very substantial con- 
text effect remains for the good judges, leav- 
ing data quality inadequate to explain the 
basic context effect findings. 


Context Effects in Terms of Response-Scale 
Values 


The usual statement of context effects is in 


*terms of the judges’ response language, 


whether stated as the value of a point of 
subjective equality or as the mean judgment 
given to a tracer stimulus. It should both 


provide continuity with that tradition, and 
help clarify the psychological meaningfulness 
of the context effects found, to reanalyze 
some of our findings in these terms. Cate- 
gorical judgments data for all three contexts 
were reanalyzed with raw mean stimulus 
judgments as scale values. Figure 6 typifies 
these results. The most direct representation 
of the context effect is in the unbroken line 
showing the locations of the Tracer Stimulus 
E. This shows the higher value for the lower 
context, the typical result. 

Were the context effect to have been com- 
plete, then the mean stimulus value across 
contexts would have been identical. Such is 
clearly not the case as indicated in Figure 6 
by the downward slope of the broken line con- 
necting mean scale values from high to low 
context, Complete context effect has indeed 
been found to be the case in some studies 
(e.g., Krantz & Campbell, 1961). Here, how- 
ever, the mean ratings reflect in part and in 
rank order the differences in the stimuli being 
scaled. This may be due to the external re- 
ferents in the definition of the response lan- 
guage terms. For example, the value 50 was 
defined for behavior “Moderately abnormal 
behavior, with moderate amounts of eccen- 
tricity and bizarreness.” For traffic offense, 
50 was “A moderately serious offense.” For 
gifts, 50 was defined as “Quite desirable, 
would enjoy receiving as a gift.” That the 
independence from context was only partial, 
is shown by the shifting values for the Tracer 
E. The shifting values for Stimulus E show 
the typical context effects of the method of 
absolute judgment (e.g., Campbell et al., 
1958; Helson, 1947; Krantz & Campbell, 
1961). Note, however, that were this effect 
to operate as an additive constant, as it may 
be regarded as operating in most studies, 
there would be mo net context effect resulting 
in the zero-point computations. The obtained 
context effect in zero points (shown by the 
sloping line connecting mean zero points in 
Figure 6) is thus a subtler effect, manifesting 
itself in the relation between perceived values 
for simple and for complex stimuli rather 
than in any simple additive or multiplicative 
effect on the absolute response values. 

In general, this analysis is reassuring that 
the context effect found is in considerable de- 


60 


oO 
a 


ORIGINAL JUDGMENTAL VALUES 
a a 
a [e] 


40 


HIGH NEUTRAL LOW 
CONTEXT CONTEXT CONTEXT 
Zeros |Stimuli Zeros |Stimuli Zeros | Stimuli 
AE 
AB AC 
EL 
AD EG 
BD Al 
„AE AG EH 
KAS CE Stimulus EF 
\ac A E (54.7) 
Meon Value =, 
C 
(53.5) SA 
N Mean Value 
-CE Stimulus a ` 
`D E (50.7 aS 
ean Y% 
Value Is 
(49.6) | __FI 
“HI 
: Fs 
Stimulus E 
(46.2) bd se GI G 
ad~ ac I 
ab ce 
Mean Zero? 
(43.9) EA eg H 
be 7, ein FI eh- 
ae Mean Zero- E 
ce (41.8) ai G gi 
ae Mean Zero F 
Sts (40.4) F7 
cg fg” 
gi 
fi, fh 


n 


( 


ZeERO-PoInT SCALING 


gree a psychological phenomenon, rather than 
a methodological artifact. 


CONCLUSIONS 


In extending the Thurstone-Jones zero- 
point analysis to a variety of novel stimulus 
objects, the requirements of the model were 
found reasonably well met. This was so not 
only for the topic of gift items, comparable 
to those of the initial study, not only for 
traffic wiolåtions, where additivity was also 
expected, but also for statements describing 
bizarre behavior, where something more like 
an averaging of evidence than a psychological 
"summation was expected. If the zero-point 
analysis and the*concept of additivity are to 
be psychologically useful, then some set- 
tings where they do not work should be 
found. For example, if the singlet stimuli 
were grades on one exam, and the doublet 
stimuliswere grades on two exams, an averag- 
ing should result, with the composite scaling 
between the two singlets which compose it 
rather than being more extreme than either. 
Comments in letters of recommendations 
might, for example, combine in this way. 

It was argued earlier that the phenomenon 
of additivity of a stimulus class was governed 
entirely by two conditions: objective criteria 
associated with the stimulus class, and/or 
socially anchored subjective additive prop- 
erties of the stimulus class. It appears, there- 
fore, that either behavior statements do con- 
tain one or both of the above attributes, or a 
„third variable must be considered; viz., the 
Capacity to éransfer learned additive skills to 
novel stimuli. In this latter case, additivity 
could be achieved in the absence of both of 
the ‘above conditions, or in conjunction with 
one or both of them, A corollary possibility 
may be an interaction between additivity 
transfer and the presence of demand charac- 
teristics of the experiment suggested by 
judges’ instructions and nature of task pres- 
entation. That is, a judge may add or average, 


' ‘depending upon possible desired outcomes 


implied to him by the experimental arrange- 
ment. Our instructions, and the joint presence 
of other tasks in which additivity was ap- 
Propriate, may have worked in this direction. 

A very desirable feature in a meaningful 
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psychological zero point would be stability, 
so that the same zero point would be inferred 
from different sets of stimuli. Indeed, the 
zero point would hardly seem meaningful 
without showing invariance over some multi- 
ple methods of ascertainment. On the other 
hand, if psychological valuations do genuinely 
change with context, then the zero point 
should perhaps change accordingly. What- 
ever the interpretation, the zero points in 
the present study were markedly affected by 
context. While a trivial amount of this effect 
maybe attributed to a random-error artifact, 
in the main it seems a psychological phe- 
nomenon, 

Several methods of scaling were employed, 
all producing equivalent results. A simple 
mean rating procedure provided excellent 
zero-point determinations, and correlated .96 
or better with other methods. This procedure 
was likewise as effective as the others in 
maximizing additivity and minimizing con- 
text effects. The methods of successive inter- 
vals and multiple standards were essentially 
equivalent to each other and to paired com- 
parisons, 
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NEED FOR APPROVAL AND THE EXPRESSION OF 
. AGGRESSION UNDER VARYING CONDITIONS 
OF FRUSTRATION * 


CLAIRE G. FISHMAN 


Harvard University 


The relationship between need for approval, as measured by the Marlowe- 
Crowne Social Desirability scale, and aggression, was investigated in 60 college 
a Women, 2 types of frustrations were devised to elicit aggression, arbitrary and 
. nonarbitrary. Subsequent to the frustration procedures, Ss had the opportunity 
to aggress against the E-frustrater on an “eyaluation questionnaire.” Blood 
pressure was ‘recorded to measure arousal following the frustration, and fol- 
lowing expression of aggression. As predicted, high n Approval Ss expressed 
less aggression against the frustrater than low n Approval Ss. (There was no 
difference between them on frustration arousal.) Moreover, among high- 
aggressive Ss, high n Approval Ss manifested a continued elevation in blood 
pressure while low n Approval Ss manifested a decline in blood pressure, 
following aggression. Ss under arbitrary frustration expressed more aggression 


than Ss under nonarbitrary frustration, regardless of n Approval. 


Research on the Marlowe-Crowne Social 
Desirability (MC-SD) scale indicates that 
the social desirability response set reflects a 
general disposition to behave in a socially de- 
sirable manner in order to gain approval, and 
that the MC-SD may therefore be considered 
_ an indirect measure of a general need for 

approval (Crowne & Marlowe, 1960). Spe- 
cifically, studies have found that high scorers 
on the MC-SD were significantly higher than 
low scorers on: the tendency to respond to 
“perceived situational demands” (Marlowe & 
Crowne, 1961); conformity in an Asch-type 
situation (Strickland & Crowne, 1962); con- 
ditioning to minimal verbal reinforcements 
(Crowne & Strickland, 1961; Marlowe, 
1962); “fear of rejection” imagery on the 
TAT (Crowne & Marlowe, 1964); rating of 
defensiveness by psychiatrists (Strickland & 
Crowne, 1963); and “defensiveness against 
the overt expression of hostility” (Conn, 
1963). 


1The research reported here is from Fishman’s 
` (1964) doctoral dissertation submitted to the De- 
partment of Social Relations, Harvard University. 
The investigation was conducted while the author 
was a Public Health Service Predoctoral Fellow. 
The research was also supported, in part, by NSF 
Grant G-25113, David Marlowe, principal investi- 
gator. The author is indebted to David Marlowe 
for his continued advice during the preparation of 
this manuscript, and to Robert Rosenthal for his 
valuable suggestions, 


The present study was designed to further 
explore the need for approval construct, spe- 
cifically, by investigating the relationship be- 
tween the need for approval (n Approval), as 
measured by the MC-SD scale, and the ex- 
pression of aggression under varying condi- 
tions of experimentally induced frustration. 
The direct expression of aggression against 
others has strong negative sanctions in Amer- 
ican middle-class culture and tends to pro- 
voke disapproval and rejection. We might 
therefore expect that individuals strongly 
motivated toward gaining approval would 
tend to inhibit such unacceptable behavior. 
Thus, the main prediction in the present 
study was that, following an experimentally 
induced frustration, individuals high on n 
Approval will respond with significantly less 
direct aggression against the frustrater than 
individuals low on n Approval. 

In order to investigate the generality 
(across frustration situations) of the relation- 
ship between n Approval and aggression, 
individuals were subjected to one of two 
kinds of frustration situations, an arbitrary 
or a nonarbitrary one, The dimension of arbi- 
trariness as an important mediator of the 
frustration-aggression hypothesis was origi- 
nally investigated by Pastore (1952), and 
explored further by Cohen (1955) and 
Rothaus and Worchel (1960), all using ques- 


809 


810 CLAIRE G. 
tionnaire descriptions of hypothetical frustra- 
tions to elicit aggressive responses. As pre- 
dicted, they all found more aggression elicited 
by arbitrary than by nonarbitrary frustra- 
tion situations. Kregarman and Worchel 
(1961) investigated this variable by experi- 
mentally manipulating arbitrary and non- 
arbitrary frustrations. They found that when 
arbitrariness was operationally defined in 
terms of expectancy, the arbitrary (unex- 
pected) frustration elicited more aggression 
than the nonarbitrary (unexpected) frustra- 
tion; when it was defined in terms of reàson- 
ableness, however, there was no difference 
in the amount of aggression elicited. This, as 
the authors suggest, might have been due to 
inadequate differentiation of the two arousal 
procedures. The design of the present study 
allowed for the further experimental investi- 
gation of the hypothesis that more aggression 
would be expressed by subjects frustrated 
arbitrarily than by subjects frustrated non- 
arbitrarily with arbitrariness defined spe- 
cifically in terms of reasonableness. 

The hypothesis that more aggression will 
be expressed in the arbitrary frustration situ- 
ation is based on the theory (partially sup- 
ported by the findings of Rothaus & Worchel, 
1960, and Kregarman & Worchel, 1961) that 
there is a difference in the perceived appro- 
priateness of expressing aggression—and thus 
a difference in the amount of inhibition of 
aggression—in these two situations. (This 
explanation assumes equal instigation to ag- 
gress in both types of situations.) Similarly, 
the hypothesis that less aggression will be 
expressed by individuals high on n Approval 
is based on the theory that these individuals 
inhibit socially unacceptable behavior, How- 
ever, it is possible that the differences in 
elicited aggression (between subjects in the 
two frustration situations, and between sub- 
jects high and low on n Approval) is due 
to differential arousal, that is, to differential 
instigation to aggress, rather than to a dif- 
ference in strength of inhibition. Still another 
Possibility is that differences in amount of 
aggression are due to differences in the per- 
ceived source of the frustratio ; that is, sub- 
Jects may blame themselves for the frus- 
tration and thus direct aggression against 
themselves. In the present study, these vari- 


FISHMAN 


TABLE 1 
MC-SD MEANS AND STANDARD DEVIATIONS FOR Ecm 
AND LOW n APPROVAL SUBJECTS IN EACH cr ° 
THE THREE CONDITIONS 


Arbitrary | Nonarbitrary | Control 
n Approval | — se 

M SD | M | SD | M | sp 
High 18.45 | 3.26 | 17.70 | 3.10, | 17.80 | 3.12 : 
Low 10.20 | 1.60 | 10.30 | 2.28 | 10.75 | 1.94 
Total 14.33 | 4.85 


14,00 | 4.60 | 14.28 | 4.40. i 


Note.—Grand mean = 14.20, SD = 4.62, 


ous explanations were investigated by offering” 
subjects the opportunity to aggress against 
themselves (as well as against the experi- | 
menter-frustrater) and by taking physiologi- | 
cal measures of arousal.’ 

Physiological measurements of arousal were | 
also recorded following the opportunity to | 
aggress in order to explore the physiological | 
concomitants of expressing aggression, spt | 
cifically in relation to n Approval and to the ` 
arbitrariness of frustration. Increase in physi- 
logical arousal following aggression might be | 
interpreted in terms of Berkowitz’ “aggves- 
sion-anxiety” hypothesis (Berkowitz, 1958, 
1960); tension reduction, on the other hand, | 
might be interpreted in terms of the “ca: 
tharsis” hypothesis (Dollard, Doob, Miller, 
Mowrer, & Sears, 1939; Feshbach, 1955; 
Hokanson & Shetler, 1961). N 

METHOD 

Subjects 

The subjects were female freshmen and sopho- 
mores at a small, 4-year teachers college for women. 
They were selected from a group of 98 students 
for whom scores on the MC-SD were available. The 
original plan was to run 60 subjects, 20 in each of i 
the three conditions. However, because 19 subjects 
suspected the real nature of the experiment and hal 
to be eliminated, a total of 79 subjects participated 
before the desired total of 60 was obtained. 


Need for Approval (MC-SD) 


The median MC-SD score for the 98 students w% 
14; thus, subjects with a score of 14 or below were 


*In line with the work of Hokanson and bis 
associates (e.g, Hokanson & Shetler, 1961) we em 
ployed physiological arousal as our operatio 
definition of the instigation to aggress. This appt 
to be the best available method of measuring 4 
instigation to aggress. However, it must be n0! a 
that arousal may also reflect such other emotions 


anxiety and fear. " 
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classified as low on n Approval; and subjects with 
a score above 14 were classified as high on n Ap- 
proval, Subjects were then selected and assigned to 
one of three conditions according to two design 
requirements: first, that each condition contain 10 
low and 10 high n Approval subjects; and second, 
that the three conditions be matched on the n 
Approval variable. The results of the selection and 
matching procedures are presented in Table 1. 


Treatment Conditions 


Subjects in the arbitrary frustration condition 
(Arb) wee promised a $2 bonus if they succeeded 
on a task, and then were unjustly criticized and 
denied the bonus despit® their success on the fask. 
Subjects in the nonarbitrary frustration condition 
(Nonarb) were also promised a $2 bonus, a 
Game terms as the Arb subjects. They, however, 
were told that they, had failed on ‘the task. They 
were thus denied the bonus (justifiably) and criti- 
cized for their poor performance. Subjects in the 
Control condition were just asked to perform the 
task. There was no mention of a bonus, or of 
success or failure. 

A number of scripts were originally prepared for 
each of fhe experimental conditions. Twenty female 
judges were asked to indicate on a 10-point scale 
how frustrated they would feel if actually subjected 
to one or another of the frustration experiences pro- 
vokeel by the experimenter’s enactment of the script. 
The Arb and Nonarb scripts chosen for the experi- 
ment were the two judged to be equally frustrating. 


Aggression 


Following the frustration manipulation, subjects 
were asked to fill out a so-called “Research Evalua- 
stion Questionnaire’? (REQ). The REQ consists of 
18 items, 7 of which refer to an evaluation of the 
experimenter (REQ-E subscale). A sample item is: 
“In your opinion, how competent was the student 
who conducted the experiment in which you par- 
' ticipated?” Below each item was a 10-point rating 
scale (from —S to +5, with O omitted). A —5 
tating was given a score of 10, a —4 rating was 
given a score of 9, and so on down to +5 which 
was given a score of 1. The individual’s score was 
her niean rating on the 7 items (with a possible 
range of 1-10). Thus, a high score indicated that a 
subject was high on aggression against the experi- 
menter, while a low score indicated that a subject 
was low on aggression. 

A procedure was devised to make the REQ-E a 
Measure of direct aggression against the - 
menter, The subjects were led to believe that nega- 

ı tive responses would be injurious to the experi- 
menter. The experimenter assumed the role of a 
first-year graduate student who was conducting the 
Tesearch for a research course requirement. A letter 
accompanying the REQ informed subjects that their 
evaluations on the REQ were to be used by the 
research supervisor” in rating and grading the 
Student’s research, The REQ, the letter, and an en- 
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velope addressed to the “research supervisor” were 
presented to the subjects in a sealed envelope. The 
experimenter assumed total ignorance of the en- 
velope’s contents. 

Some of the REQ-E items were taken directly, 
or in modified form, from a questionnaire developed 
and used by Feshbach (1955, 1961). Other investi- 
gators have used similar questionnaires for measur- 
ing aggression (Hokanson & Burgess, 1962; Kahn, 
1960; Kregarman & Worchel, 1961). Our procedure 
of presenting subjects with an “evaluation question- 
naire” was modeled after Kahn’s procedures. 

Two items referring to self-aggression were also 
included in the REQ (REQ-S subscale). These were 
presented and scored in the same manner as the 
REQ-E. The rest of the items on the REQ served 
as filler items. 


Changes in Systolic Blood Pressure 


A sphygmomanometer operated by the experi- 
menter was used to measure systolic blood pressure. 
Readings were taken following each phase of the 
experiment. The difference between the reading fol- 
lowing a specific phase and the preceding one served 
as an index of “arousal,” or an index of “tension 
reduction” resulting from the specific procedures 
during that phase. Our primary interest was in the 
changes in blood pressure resulting from the frustra- 
tion procedures; and in the changes in blood pres- 
sure resulting from the opportunity to aggress on 
the REQ. 


Procedure 


Each subject was seen individually by the experi- 
menter, To avoid experimenter bias, the experi- 
menter never knew the subject’s n Approval score; 
nor did she know the subject’s experimental condi- 
tion until immediately before the actual frustration 
procedures, The experimenter did know from the 
beginning when a subject was a control because of 
a necessary modification in the introduction to 
control subjects. 

Subjects were told that the study involved the 
physiological concomitants of intellectual activity. 
Experimental subjects were told that the task they 
would be asked to perform was a measure of flexi- 
bility, an important aspect of intelligence, that 95% 
of the college girls studied were able to accomplish 
the task, and that effort was required to do well. 
A $2 bonus would be given to them if they suc- 
ceeded. Task instructions would follow a 10-minute 
adaptation period, Control subjects were told only 
that the task instructions would follow the adapta- 
tion period. 

8 After completing the REQ, subjects were also 
given the opportunity to aggress indirectly on two 
projective tests: the TAT (4 cards) and a word- 
association task (Gellerman, 1956), No significant 
or consistent findings were obtained with these two 
measures, and therefore no further space will be 
devoted to discussing them. 
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A base-line blood pressure reading was taken im- 
mediately after the 10 minutes. The subject was then 
told to count backwards from 100 to 1 by 3s as 
quickly as possible.* Control subjects were told to 
go through the counting three times. Experimental 
subjects were told they could have three trials. After 
each of the first two trials, they were told they 
failed. After the third trial, the experimenter told 
subjects in the Arb condition: 


Well, you made it, but the more I think about it, 
the less I think you deserve that $2 bonus, 
Frankly, I don’t think you cared one bit about 
the whole thing. Remember, I said it was impor- 
tant that you do your best, and I don’t think you 
did. No, I don’t think PI give you that $2 
because you didn’t try hard enough. I can’t give 
a bonus to someone so indifferent about it as you 
were, even if you did succeed. 


Subjects in the Nonarb condition were told: 


That does it, you failed to make it in the time 
limit. So, you don’t get the $2. That was our 
agreement. Do you know that 95 per cent of the 
college girls studied have been able to do the task 
successfully? Remember, I said it was important 
that you do your best, that you make the effort. 
Maybe you didn’t care about the whole thing, 
because if you did you would have made it. In 
any case, since you did fail, for whatever reason, 
you don’t get the $2. 


Blood pressure was recorded after the first two 
trials and again after the frustration manipulation 
following Trial 3. The experimenter then explained 
that her experiment was over and handed the sub- 
jects the sealed envelope containing the REQ. Blood 
Pressure was recorded after the completion of the 
REQ. Four TAT cards and a word-association task 
were then presented, as “part of another experi- 
ment,” 

With the experiment over, subjects were asked 
to remain for a one-half hour interview. One pur- 
Pose of the interview was to determine the subjects’ 
impressions about the experiment. If a subject indi- 
cated any suspicion that the experimenter was de- 
liberately trying to anger her, she was eliminated 
from the study. The second purpose of the interview 
was to disclose fully all deceptions employed in the 
study and to provide subjects an opportunity to 
express feelings aroused by the experiment. 


RESULTS 
REQ-E Aggression 


Hypothesis 1. High n Approval subjects 
will express significantly less aggression 
against the experimenter than low n Ap- 


4The task was a modification of one original; 
employed by Hokanson and his associates (Hokan. 
son, 1961; Hokanson & Burgess, 1962; Hokanson & 
Shetler, 1961). 
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proval subjects. This difference was expected 
in the experimental conditions but not in the 
control condition. e 

Hypothesis 2. Subjects in the Arb condi- 
tion will express significantly more aggression 
against the experimenter than subjects in the 
Nonarb condition, and subjects in both Arb 
and Nonarb conditions will express signifi- 
cantly more aggression against the experi- 
menter than subjects in the Control condi- 
tion. These relationships will be investigated. 
for high and low n Approval subjects sepa- 
rately as well as combined, 

The n Approval effect was significant 
(Table 2): high n Approval subjects were, 
significantly less aggressive „on REQ-E than 
low n Approval subjects across all conditions l 
(F = 4.32, p < .05).2 A comparison of the 
means of high and low n Approval subjects 
for each of the conditions reveals that the 
difference between them is greatest in the 
Arb condition (¢ = 2.16, p < .05); next in | 
the Nonarb (¢ = 1.34, p = .20); and least in 
the Control condition (¢ = 12); l 

The condition effect was highly significant 
(F = 25.53, p < .0005). Hypothesis 2 was 
supported by the highly significant differ- f 
ence obtained between the Arb and Nonarb 
conditions (¢= 3.80, p< .001), as well as 
between Arb and Control conditions (t = 7.13, 
$ < .0005) and between Nonarb and Control 
conditions (¢ = 3.32, 01 > p > .001). More- 
over, ¢ tests reveal that these differences 
between experimental conditions were sig- 
nificant for high n Approval subjects and | 
low n Approval subjects separately. Thus, | 
high n Approval subjects in the Arb condi- 
tion were significantly more aggressive than 
high n Approval subjects in the Nonarb con- 
dition (¢ = 2.31, .05 > p> .02); low n Ap- 
proval subjects in the Arb condition were 
significantly more aggressive than low n AP- 
Proval subjects in the Nonarb condition (t 
= 3.12, .01 > > .001). The difference bê- © 
tween low n Approval subjects in the Nonatb | 
and Control conditions was significant ae 
= 2.98, p < .01), and the difference betwee? 
high n Approval subjects in the Nonarb and 
Control conditions, while not significant, waS | 
in the predicted direction (¢ = 1.76, p < 10). 4 


5 All significance tests reported are two-tailed. 


NEED FOR APPROVAL AND AGGRESSION 


REQ-S Aggression 


The relationship between self-aggression 
and both n Approval and arbitrariness of 
frustration was investigated. Table 3 presents 
a summary of the cell means. Nonarb sub- 
jects were higher on REQ-S aggression than 
Arb and Control subjects (¢ = 1.85, p = .06). 
There was no difference between high and low 
n Approval subjects on REQ-S aggression. 


Changes in Systolic Blood Pressure 


An analysis of variance was performed on 
the base-line systolic blood pressure to deter- 
mine whether any sampling bias existed in 


è TABLE 2 


REQ AGGRESSION AGAINST THE EXPERIMENTER AS A 
FUNCTION OF n APPROVAL AND ARBITRARINESS 
OF FRUSTRATION 


Source df MS F 
a 
n Approval (A) 1 367.54 4.32* 
Frustration (B) 2 2171.79 25.534 
AXB 2 88.28 1.04 
Error (w) 54 85.07 
Total 59 


Means on REQ-E of high and low n approval 
within each condition 


Condition | Low n Approval | High n Approval] Total 
Arb 6.90 5.64 6.27 
Nonarb 5.07 4.29 4.69 
Control 3.33 3.26 3.29 

Total 5.10 4.40 4.75 


wn? S05, two-tailed test. 
b <.0005, two-tailed test. 


assigning subjects to conditions, and whether 
there were any initial differences in base line 
between high and low n Approval subjects. 
An analysis of variance was also performed 
on “task arousal” (change in blood pressure 
from base line to Trial 2) to determine 
‘whether task alone had differentially raised 
blood pressure among conditions, or between 
high and low n Approval subjects. All Fs 
. Were insignificant. 

Hypothesis 3. The Arb and Nonarb condi- 
tions will elicit significantly more “frustration 
arousal” (Trial 3 minus Trial 2 blood pres- 
sure) than the Control condition. No differ- 
ence was anticipated between the Arb and 
Nonarb conditions. The relationship between 
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TABLE 3 


Means on REQ-S or HicH AnD Low n APPROVAL 
SUBJECTS WITHIN EACH CONDITION 


Condition | Low n Approval | High n Approval Totaj 
Arb 5.00 5.40 * 5.20 
Nonarb 6.10 6.70 6.40 
Control 6.15 4.45 5.30 

Total 5.75 5.52 5.63 


n Approval and “frustration arousal” was 
also investigated; no difference was antici- 
patęd. 

Table 4 presents the summary of results, 
The condition effect was significant (F 
= 3.54, p < .05). Further analysis indicated 
that the Arb and Nonarb subjects were sig- 
nificantly more aroused than the Control sub- 
jects following the frustration procedures (t 
= 2.40, p = .02). While a difference between 
Arb and Nonarb conditions was not antici- 
pated, a # test was performed to investigate 
the possibility that one of the experimental 
conditions was in fact more physiologically 
arousing than the other. No reliable differ- 
ence was obtained (f= 1.13, 40 >p > .20). 

There was no significant n Approval effect 
(F = 41); high and low n Approval subjects 
were not differentially aroused by the frustra- 
tion procedures. 

The following are the results of explora- 
tory investigations of the physiological con- 
comitants of expressing aggression. Previous 


TABLE 4 


Frustration AROUSAL* AS A FUNCTION OF n APPROVAL 
ARBITRARINESS OF FRUSTRATION 


AND 
Source af MS F 
n Approval (A) 1 41.67 .42 
Frustration (B) 2 354.82 3,54* 
AXB 2 49,72 .50 
Error (w) 54 100.15 
Total 59 


Mean frustration arousal* of high and low n approval 
subjects within each condition 


Condition | Low n Approval | High n Approval Total 
Arb 5.10 0.00 2.55 
Nonarb —.60 —1.60 —1.10 
Control —6.40 —5.30 —5.85 

Total —.63 —2.30 —1.47 


a Changes in blood pressure from}Trial 2jto Trial 3. 
*p = .05, two-tailed test, 
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TABLE 5 


Post-REQ DECREASE IN AROUSAL AND n APPROVAL 
AMONG SUBJECTS WHO Express 
“HIGH AGGRESSION” 


3 Approval Decrease No decrease Total 
High 1 12 13 
Low 11 7 18 

Total 12 19 31 


Note.—x? = 6.97, 


research suggests that expressing aggression 
can be tension reducing (“cathartic”) or 
arousing (due to “aggression anxiety”) de- 
pending upon the amount of punishment or 
rejection anticipated as a result of expressing 
aggression. 

No significant relationships were obtained 
between the two variables in any of the con- 
ditions. The Spearman rho correlations were 
+ .20, .00, and — .21 in the Arb, Nonarb, and 
Control conditions, respectively, (While + .20 
is not significantly different from .00 or —.21, 
the pattern of these correlations appears 
meaningful, and we will discuss this pattern 
in the Discussion section.) However, closer 
investigation of the data resulted in the find- 
ing that of those who expressed more than 
the average amount of aggression in each 
condition (31 subjects above the mean ag- 
gression score in the three conditions) low n 
Approval subjects did manifest tension reduc- 
tion; 
(Table 
tionship 
rule out 


“low aggres- 
sors” was analyzed and the resulting xX? was 


ruled out was 


Approval sub- 
e tension reduc- 
tion merely because low arousal limits the 
amount of subsequent reduction possible., 
The comparison of high and low n Approval 
subjects on Trial 3 blood pressure, yielding a 
x" of .07, indicated that the difference be- 


low n Approval 
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subjects on tension reduction was not just a 
function of limits imposed by postfrustration 
elevation.) 5 

In sum, the catharsis hypothesis does gain 
some support, but only from individuals low 
on the need for approval; and the aggression- 
anxiety hypothesis gains some support, but 
only from individuals high on the need for 
approval. 


Discussion 


The results of this study clearly support 
the hypothesis that, following an experimen- 
tally induced frustration, high n Approval 
subjects express less aggression against the 
experimenter-frustrater than Jow n Approval 
subjects. As pointed out earlier, this hypothe- 
sis was based on the theory that individuals 
highly motivated toward gaining approval 
would inhibit such socially unacceptable be- 
havior. However, we noted that a difference 
in aggressiveness could also be attributed to 
differential perception of the source of frustra- 
tion or differential arousal. Since we found 
that there was no difference between the two 
groups on self-aggression or on frustration 


arousal, we tentatively conclude that the 


difference between the groups was due mainly 
to greater strength of inhibition among high 
n Approval subjects. It appears that aggres- 
sion is a “dangerous” type of behavior, ego. 
alien to individuals highly motivated toward 
gaining approval and avoiding rejection. This 
interpretation receives further support from 
the finding that when these individuals do 


aggress, their blood pressure remains ele- 


vated, in contrast to low n Approval subjects 
whose blood pressure declines following a8- 
gression. The reaction of the former seems 
to fit the aggression-anxiety hypothesis; the 
reaction of the latter seems to fit the catharsis 
hypothesis. 

It is important to note that high n Ap 
proval subjects did not uniformly inhibit 
aggression. (They were significantly more 
aggressive in the Arb than in either the 
Nonarb or Control conditions.) Our findings 


ê This conclusion must remain tentative since it A 
presently not possible to physiologically dine 
the various emotions, and thus we cannot denna 
assume that the finding of equal arousal indica’ 
equal instigation to aggress (cf. Footnote 2). 
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thus seem to run counter to Conn’s (1963) 
speculation that high n Approval subjects 
deaf with angry feelings by repression and 
reaction formation. 

That both high and low n Approval sub- 
jects expressed more aggression as the situa- 
tion became more appropriate for such be- 
havior was, of course, a confirmation of our 
hypothesis regarding the dimension of arbi- 
trariness (reasonableness) and its relation to 
aggression, In contrast to Kregarman and 
Worchél (1961), we obtained significantly 
more aggression iff an unreasonable (Arb) 
than a reasonable (Nonarb) situation. Our 
, Physiological data provide some indirect sup- 
port for the assumption of equal instigation 
to aggress. However, as noted earlier, physio- 
logical arousal may "reflect not only anger at 
the experimenter, but also anger at the self. 
We did find a tendency for more self- 
aggression among Nogarb than Arb and con- 
trol Subjects (and perhaps the difference 
would have been significant had our REQ-S 
measure included more than two items). 

eThe correlations between tension reduction 
and aggression against the experimenter were 
not significant in any of the conditions, but 
the pattern of relationships seems worth men- 
tioning. Where aggression against the experi- 
menter is most appropriate (Arb condition), 
aggression tends to be followed by tension 
reduction (correlation is positive); where ag- 
gression against the experimenter is least 
appropriate (Control condition), aggression 
tends to be followed by elevated blood pres- 
. sure (correlation is negative); and where 
appropriateness is ambiguous (Nonarb condi- 
tion), there is no consistent relationship be- 
tween aggression and tension reduction (cor- 
relation is zero), We have suggested that 
tension reduction following aggression sup- 
Ports the catharsis hypothesis; and that ele- 
` vation of blood pressure following aggression 
supports the aggression-anxiety hypothesis. 
We seem to have some slight evidence in 
support of both hypotheses. Depending upon 
the appropriateness of aggression in the situa- 
tion, either catharsis or aggression anxiety 
‘results from expressing aggression. 

In conclusion, the Arb-Nonarb distinction 
1 turned out to be an important one in pre- 
dicting aggression following a frustration. 
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However, while we found fewer aggressive 
responses against the experimenter in the 
Nonarb condition and equal physiological 
arousal, it is still not clear whether the 
difference is due to differential instigation’ to 
aggress against the experimenter or’ to dif- 
ferential response inhibition. The tendency 
for greater self-aggression in the Nonarb 
condition may indicate support for the for- 
mer. However, this may be so only in non- 
arbitrary frustration situations such as ours 
where failure is an important feature. Per- 
haps future studies, by comparing failure 
and nonfailure types of nonarbitrary frustra- 
tions, will be able to clarify the issue. 


We can make two general statements about 
the individuals highly motivated towards 
gaining approval, First, evidence shows that 
they (relative to those less motivated) are in 
conflict about expressing aggression. Second, 
these individuals can and do respond to 
situational cues determining the appropriate- 
ness of expressing aggression, but always 
within the limits imposed by their general 
disposition to inhibit aggression, 
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USE OF THE SEMANTIC DIFFERENTIAL TO STUDY 
ACCULTURATION IN AMERICAN INDIAN 
ADOLESCENTS * 


MALCOLM M. HELPER anp SOL L. GARFIELD ? 


Nebraska Psychiatric Institute 


232 Indian adolescents in an Indian boarding school and 123 white adolescents 
in a community high school rated concepts referring to values thought to be 
distinctive in the 2 groups. Academic achievement test scores were used as an 
independent measure of acculturation within the Indian group. Indians and 
whites, and high- and low-achieving Indians, differed in response biases. In- 
dians ani whites were found to ‘be highly sinfilar on overall rank ordering of 
the concepts, as were high- and low-achieving Indians. Nonetheless, apparently 
consistent differences were noted between the races for certain concepts. Such 
differences were also noted between high- and low-achieving Indians. Most of 
thesg latter differences put the high achievers closer to white norms than the low 
achievers. With appropriate precautions against response bias, the semantic 
differential appears potentially useful in studying acculturation and attitudes 


toward ethnic group membership. 


Past research on values and value conflicts 
in the°American Indian has relied largely on 
projective methods and open-ended question- 
naires (Macgregor, 1946). While these meth- 
ods have yielded important results, they often 
involve subjective judgments, which can be 
unreliable, and difficult to replicate. The pres- 
ent study represents an attempt to apply the 
technique of the semantic differential to the 
question of the assimilation of values by 
adolescent Indians. This technique is objec- 
tive and has the advantage of relating re- 
search in this area to a broader body of 
psychological theory and research. 

While the semantic differential has been 

. used to compare the values of one cultural 
group with those of another (Maclay & Ware, 


1 This study was supported by United States Pub- 
lic Health Service Grant MH 967, “Mental Health 
Project for Indian Boarding Schools.” The assist- 
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Director, and John Bjork, Co-director, of this 
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of the Flandreau Indian School, and to Sylvan 
Vigness, Superintendent of the Flandreau Com- 
munity Schools, for arranging for the cooperation 
` of the students and teachers in their respective high 
schools. Barney Watson of the Data Center, Vet- 
erans Administration Hospital, Omaha, rendered 
Valuable assistance in the programing of the IBM 
1620 computer, and arranged for the necessary 
Processing time. 
2Now at Teachers College, Columbia University. 
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1961; Osgood, 1960; Rosen, 1959; Triandis 
& Osgood, 1958), it apparently has not been 
applied in the situation in which the value 
systems of two cultures impinge upon the 
same individual. In the present study, the 
semantic differential technique was used to 
compare values of American Indian and white 
adolescents, and within the Indian group, to 
determine whether semantic differences can be 
detected between those showing high and low 
acculturation by another criterion (academic 


achievement). 


METHOD 


The method used consisted of three steps: (a) De- 
termining the number and nature of differences be- 
tween Indian and white groups on a semantic dif- 
ferential designed to tap the connotative meaning of 
concepts related to distinctive values of both groups. 
(b) Within the Indian group, determining the dif- 
ferences on the instrument between subgroups show- 
ing high and low acculturation by the achieving 
criterion, (c) Assessing the direction of these 
semantic differences to determine if the groups with 
high academic acculturation tend to be more like 
the whites semantically than the low academic 


acculturation groups. 


Subjects 

American Indian. The freshman and senior classes 
of the Flandreau Indian School (FIS) located in 
Flandreau, South Dakota, provided data for the 
study. FIS is a boarding school serving 18 reserva- 
tions in North and South Dakota, Nebraska, 
Wyoming, and Montana, and includes individuals 
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from various Plains Indian groups. Students are 
sent to FIS for a variety of reasons, including 
educational needs which cannot be met in the local 
reservation schools, family instability, and adjust- 
ment problems of the individual. More than one 
factor may be operative in a given case. The aim 
of the school is to provide a basic high-school educa- 
tion, plus opportunities for acquiring vocational 
skills. In this study, the entire freshman and senior 
classes were tested, resulting in a sample of 101 
boys and 131 girls. 

Whites. The public high school in the town near 
which FIS is located provided the white subjects. 
Freshman and senior classes were again involved 
in their entirety, resulting in 62 boys and 61 girls. 
Students of Indian extraction in these classes* filled 
out the questionnaires, but these were not included 
in the analyses. 


Semantic Differential 


Ten scales were used, three for each of the three 
major factors found in previous studies (Osgood, 
Suci, & Tannenbaum, 1957) plus an additional 
scale included on a trial basis, The scales, listed 
according to the factor represented, were as follows: 
Evaluation—happy-sad, fair-unfair, bad-good; Po- 
tency—weak-strong, hard-soft, manly-womanly; and 
Activity—fast-slow, cold-hot, dull-sharp. The experi- 
mental scale was like me-not like me. It will not be 
reported in this paper since it was not used in 
deriving any of the factor scores, 

The 15 concepts included are listed below, together 


with comments regarding the rationale for their 
selection. 


Race labels: INDIAN and WHITE PERSON were in- 
cluded to determine whether the technique would 
show differences in attitudes toward ethnic group 
membership, 

Time concepts; THE FUTURE, BEING ON TIME, and 
PLANNING AHEAD were included because it was felt 
that concern about scheduling and about the 
future were hallmarks of the dominant culture, 
and much less important in the Indian culture 
(Macgregor, 1946), 

Self-concepts: 


É that have been 
hypothesized to accompany acculturation (Hartley 


culture among the Indians, 
School-related conce 
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Specific behaviors: DRINKING, SPEAKING EI 
and GETTING MAD were included because 
thought, on the basis of initial contacts w 
sonnel of the Indian school, that they 
have different connotations in the two 

The concepts were presented two to a 
8-page, multilithed, booklets, each cone 
followed by the 10 scales in the following for 


THE FUTURE 
fast. 


weak. z z 3 3 : 
etc. 


The concepts were presented in the same 
for all subjects. The scales were presented 
same order for each concept, in the follo 
and left-right orientations: fast-slow, wi 
happy-sad, hard-soft, fair-unfair. cold-hot, 
not like me, bad-good, manly-womanly, di 

Testing was done in a regular classroom 
no more than 35 subjects in each group, O 
structions and blackboard demonstrations 
given to supplement printed directions. The 
menter introduced the study as part of 
Project aimed at uncovering how high-schoo 
dents viewed various ideas, with special 
tion of differences between freshmen and. 
At no time were Indian-white compatiso! 
tioned to either set of subjects. 

A sentence-completion task was administe 
ing the same testing session, prior to the s 
differential, Results from this instrument ar 
analyzed separately. f 


Independent Measure of Acculturation 


Study of acculturation within the Indian 
requires an independent measure of accepti 
white culture by the Indian subjects. It 
cluded that the academic achievement test 
already available on these subjects would 
propriate. Acquisition of formal academic s 
knowledge, which are almost totally foreign 
toric Indian culture, would seem to repre 
crucial aspect of acculturation. As noted belo 
scores of the Indian subjects on the test 
well below white norms, suggesting that fact 
than basic learning ability are important | 
Indian scores. These other factors would pri 
bly include amount of formal or informal 
to white culture, and attitudes toward act 

The scores available were on the Iowa 
Educational Development, Form X3-CP. Th 
centile value of the Composite score on th 
was used. The median percentile for the 
Students was 32, and for the senior studel 
Only five subjects in the entire Indian group 
above the seventieth percentile. This und 
ment relative to white standards is similar 
found in other culturally deprived minority 
(Kennedy, Van DeRiet, & White, 1963). 
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RESULTS 
Invid Responses 


‘For each subject, the 10 ratings on each 
concept were scanned (via a computer pro- 
gram) for omissions and for sterotyped re- 
sponse patterns. A subject’s ratings on a 
given concept were considered invalid and 
were discarded from further analysis if they 
had any of the following characteristics: 
1 or more blank scales; 8 or more iden- 
tical re$ponses; a pattern of strict alternation 
of extremes for all 10 responses. By thesé cri- 
teria, 9.2% of all concepts rated by Indian 

, Subjects were invalid, as compared to 4.1% 
of those rated by white subjects. WHITE 
PERSON had the $reatest proportion of invalid 
responses in both groups, 19% among In- 
dians, 13% among whites, DRINKING was sec- 
ond for Indians (16%), while INDIAN was 
second for whites (11%). ME AS I WANT TO BE 
had a* minimal number of invalid response 
patterns in both groups, 5% for Indians, 1% 
for whites. 


Independence of Factors 


For each race, mean scores on the three 
factors were computed for each concept for 
each sex separately, Within each group, rank- 
order correlations were computed among the 
three factors over the concept means in order 
to determine the extent of similarity between 
the factor scores, In each group, the correla- 
tion between concept rankings on the Evalu- 
ation and Activity factors proved to be higher 

. than the other two correlations. In three of 
the four groups, this correlation was over 
.90 (the value in the fourth group, white 
boys, was .76). Potency was largely inde- 
pendent of the other two factors in girls of 
both races (rho’s ranging from .12 to .30), 
but was significantly related to them in boys 
of both races (rho’s ranging from .45 to .79). 
It thus appears that the three factors tap 
very similar dimensions in this sample of 

i subjects and concepts, except for the differen- 
tiation of the Potency factor from the other 
two by girls. 


Comparison of Indians and Whites 


Examination of the grand means for the 
three factors (over the 15 concepts) indi- 
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cated differences between the racial groups 
in the use of the rating scales, Indian boys 
had higher overall means on all three fac- 
tors than white boys, the discrepancy ranging 
from .24 scale unit for Evaluation to .10 for 
Activity. Furthermore, comparison ‘of the 
means occupying each rank for Indian and 
white boys indicated that the tendency 
toward higher means for Indians occurred at 
all levels, and did not arise from higher rat- 
ings by them on just a few concepts. Con- 
stant differences of the magnitude observed 
would have a significant impact on the results 
of ¢ tests, since the standard error of most 
of the means involved falls between .10 and 
.20 scale unit. For girls, the greatest differ- 
ence on overall means (.13 scale unit) was 
found for Activity and was in the opposite 
direction from that for boys. 

Because of the apparent differences in re- 
sponse habits, Indians and whites were not 
compared on concept means directly, but 
rather with respect to the relative ranks oc- 
cupied by the concepts, Table 1 presents the 
rankings of the concepts on the three factors 
by Indian and white groups of each sex. A 
rank of 1 indicates the highest mean; a rank 
of 15, the lowest. 

The general level of agreement between 
Indians and whites, as indicated by the rho’s 
in Table 1, is high, Except for boys on the 
Activity score, all of the correlations reach 
the .01 level. The significance of between- 
group differences in the ranks of individual 
concepts was not evaluated statistically be- 
cause of the apparent lack of appropriate 
tests for this kind of data, However, the 
consistency of these discrepancies in the 
two sex groups give some assurance as to 
their reality, Eighteen instances were found 
in which an Indian-white difference of two 
ranks or more occurred on the same concept 
and factor for both sexes. In 15 of these, 
the direction of differences was the same for 
the two sexes, significantly in excess of the 
9 expected by chance (x? = 6.72, p < .01). 


Comparison of High- and Low-Achieving In- 
dians 


The median achievement score was deter- 
mined for each sex in each school class, Those 
above the median in each group were consid- 
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TABLE 1 
RANKINGS oF CONCEPTS ON THREE FACTORS BY WIITES AND INDIANS 


Boys Girls 


Concept Evaluation Potency Activity Evaluation Potency Activity 


White | Indian | White | Indian | White | Indian | White | Indian White | Indian | White | Indian 


1. The future ca A ESSI T 6 6 | 10 7 7 5| 4 7| 6 

2. My father 7 9] 2 3 8 8 6 | 10 1 1 4]7 

3. Flandreau school 12 8 |13 10 | 4 | 41 12 8 9] 3 13 | 10 

4. Going home 5 2 sat 7 | 10 3 8 Det 6 oll | 45 
5. Drinking 1313 bio 13 | 13 | 14 | 13 13 | 10, | 12 14] 14 
6. White person TONELE? J2 5 |.o12 | 10 | 12 Be (eld 8 |13 

7, Quitting school 15°15 IS | 15 | 15 | 15 | 15 |r 14° | 13 15 | 45 

8. Me 6 (a | oe Ua | 3 6 9 9 | 15 | 15 9 | Og 
9, Being on time 4 10 8 9 2 9 5 11 4 8.5 6 | it 
10, My mother g 3 |14 14 4 7 4 3 | 13 | 14 2 ae ‘ 
11, Indian 11 6 | 6 2 7 2] 5 oy 1.2 10 | (2m 
12. Speaking English 8 5 |10 8 11 4 2 4 7 | 10 3} 45 
13. Me as I want tobe| 1 1 1 1 1 1 1 1 12 8.5 1 1 
14. Getting mad 14 | 14 | 9 TE E a Di jal Pad a Y Oe ae 12 | 12 
15, Planning ahead 2 4/5 5 9 5 3 6 6 5 5 | 8 


Rho (White versus 
Indian) -80 85 -50 72 » 69 66 


ered to be high achievers (highs), and those pooled within each sex, as were freshmen and 
below were considered to be low achievers senior lows. Highs and lows showed differ- 
(lows). Freshman and senior highs were ences in response pattern which paralleled 


TABLE 2 
RANKINGS oF CONCEPTS ON THREE FACTORS BY LOW- AND HIGH-ACHIEVING ĪNDIANS 


Boysa Girls» 
Concept Evaluation Potency Activity Evaluation Potency Activity 
Tow | High | Low | High | tow | High | Low | HRA | tow | Hih | Low EE 
1. Future EESE OWO EE Pea bers 4 | 5 o SE 
2. My father EON E EA E ia a o |i 17/3 (3 
pee St ule ee E E a E | 2 | Zula 
4, Going home 1 4 8 8 3 4 6 2 5 8 4 4 
5. Drinking ap a [RR TST silsc resin one boa oa 
6. White person eR Ee ales aca a ra diza 
ge ane seneo asi 145.0 aia 115%, his vs] tel. [22 15 }13 [14 [15 | 1 
SMe A S e E A E E a | (46 (48° a 
9. Being on time 10 6 9 9 9 OIEA T: e12 Oa 9 
10. My mother Peon neta E E E NN tIs asi fits ef-42~. | thn alae 
11. Indian ; EEE e S A tvs...) a> | | 2.71 
reid atta eb ge EE AAEE E 4 |10 |7 |6 |2 
13. Me as I ule tobe} 2 1 1 1 1 1 1 1 4 9 1 ale 
. Getting ma 4 14 fio Ju ha Se 
15. Planning ahead E aaa al er 5 T y es |'3 lioo | 6 
pho ioe versus 
gl -16 
T 93 84 92 64 89 | 
High versus white t 
criterion) T| 85) 83] sol 39| 46] 64 73| a| .71| o 3 


Vs were variable from concept to concept because of the screenin; 


ae wsi 
low boys, 34-39; high boys, 39-47; low girls, Aaa iig girls, 4g-$8 invalid response patterns, Ranges of M were as follo 
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those between the Indian group as a whole 
and the white group. Low boys had overall 
meafis which exceeded those of high boys by 
.38, .10, and .21 scale unit on the three fac- 
tors. On each factor, the high boys have the 
response pattern more similar to that of white 
boys. The low girls among the Indians 
showed a positive bias of .12 scale unit (as 
compared to the highs) on Evaluation, but 
a negative bias on Potency and Activity. 
The high girls were thus the more like white 
girls on’ the Evaluation and Activity factors, 
while the lows had*the, greater similarity to 
whites on Potency. 

Concepts were ranked for the highs and 
lows just as they were for the white versus 
Indian comparisons in Table 1. Table 2 lists 
these rankings. 

As one approach to the hypothesis that 
highs are more similar to whites than lows, 
the concept rankings gf highs and lows were 
each Correlated with the rankings from the 
appropriate white criterion group. These cor- 
relations, shown in Table 2, indicate, as ex- 
pested, closer agreement in each case between 
the highs and the white norms than between 
the lows and the white norms. The differences 
are not large, but it should be noted that they 
are markedly restricted by the high similarity 
of the high and low Indian groups to each 
other, as indicated by the low versus high 
tho’s in Table 2. In other words, the relatively 
few differences which do occur between the 
high and low Indian groups tend rather con- 
sistently to put the high group closer to the 

. white norms. This can be studied more di- 
rectly by noting, for each difference between 
highs and lows, which of the two groups stands 
in the same relation to the total Indian group 
as does the white group. In the 27 instances in 
which whites differ from Indians by two or 
more ranks and the high Indians also differ 

‘from low Indians by. this amount, the high 
group is found to parallel the whites in 21 in- 
stances, while the lows do so in only 6. On the 

_ assumption that each of these differences could 
have gone either way by chance alone, the 
chi-square value for the observed distribu- 
tion is 7.26, significant at the .01 level. 

High-achieving Indians thus appear to be 
more like whites than do low-achieving In- 
dians, both with respect to response bias and 
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with respect to the ranking of value-related 
concepts on semantic factors. 


DISCUSSION 


e 

Perhaps the first lesson to be learned from 
the present study is that response sets can 
differ in different cultural groups, and can 
seriously affect the interpretation of question- 
naire data. The data also suggest that the 
response biases themselves may be susceptible 
to the influences of acculturation, 

A.second methodological point is that se- 
mantic ratings of these value-oriented con- 
cepts are highly saturated with the evaluative 
factor, Concept rankings on Evaluation and 
Activity are highly correlated in all groups, 
and these two rankings are each significantly 
correlated with Potency rankings in boys. 
Girls apparently differentiate Potency from 
the other two factors most markedly in rating 
self, ideal, and mother concepts, all of which 
are seen as weak but good. 

The present study indicates, thirdly, that 
semantic ratings can provide evidence of both 
similarities and differences between a domi- 
nant culture and different subgroups of a 
minority population. In the present study, 
the similarity is evidenced by the high cor- 
relations between rankings of concepts by 
whites and by Indians. The concepts which 
produce the most agreement between the 
groups, and also between high- and low- 
achieving Indian subgroups, are concepts 
which are seen very negatively (DRINKING, 
QUITTING SCHOOL) or very positively (ME AS 1 
WANT TO BE). To what extent acculturation is 
a factor in this similarity is not, however, 
revealed by the present methodology. 

The evidence for differences between groups 
appears in their consistent difference in rank- 
ing of certain of the concepts. A major prob- 
lem introduced by the discovery of response 
biases, is, of course, the lack of significance 
tests for differences between ratings of indi- 
vidual concepts by different groups. None- 
theless, the Indian-white differences tend to 
be replicated in the two sex groups. More 
importantly, differences between Indians high 
and low on a measure of acculturation tend 
to parallel in direction those between whites 
and Indians. This finding indicates that both 
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kinds of differences are meaningful rather 
than accidental. 

This apparent displacement of high achiev- 
ers, as compared with low achievers, toward 
wifite norms also is in line with the initial 
hypothesis of the study, and suggests that 
semantic ratings offer a useful means of as- 
sessing the relative acculturation of sub- 
groups of minority populations, 

In addition to providing a general picture 
of similarity and difference between groups, 
semantic data may help elucidate attitude 
patterns around topics of specific relevance 
to the process of acculturation. In the pres- 
ent study, the results for INDIAN and WHITE 
PERSON suggest potentially important differ- 
ences in attitudes toward group membership. 

The data in Table 1 indicate that each 
race rates its own label more positively than 
the other group. does. It can also be noted 
that Indians rank rnpran higher than whites 
rank WHITE PERSON in every instance. Com- 
parisons within each racial group shows that 
Indians differentiate the two concepts on each 
factor to a much greater extent than whites 
do. Furthermore, the race labels stand in a 
different relationship to self and ideal concepts 
in the two racial groups. For Indians, INDIAN 
is closer than ME to ME AS 1 WANT TO BE in 
five of six comparisons; for whites, the re- 
verse is true—me is closer to the ideal than 
is WHITE PERSON in five of six comparisons, 
Indians thus appear to see their racial group 
as more valuable than themselves as indi- 
viduals, while whites tend to see their racial 
group as less valuable than themselves as 
individuals. This tendency to elevate race 
over self appears slightly reduced in high 
boys, who rate mr somewhat higher and 
INDIAN Somewhat lower on the evaluative 
factors than do low boys, but appears to be 
about equal in high and low girls. These re- 
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sults suggest that the concept INDIAN is more 
highly salient and more highly valued in 
Indian youth than WHITE PERSON is in whites, 
This conclusion stands in contrast to certain 
previous research which has indicated very 
substantial devaluations of their own group 
by members of disadvantaged minorities 
(Clark, 1963; Macgregor, 1946). This dif- 
ference may indicate only that semantic rat- 
ings tap more superficial, defensive attitudes 
and fail to reveal more deep-seated negative 
feelings; or it may be a function of the spe- 
cific groups studied. In ary event, the method 
appears to generate rather clear-cut contrasts 


in attitudes toward racial group membership. , 
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The study was designed to test by multiple factor analysis for the existence of 
16 hypothesized interpersonal interaction categories. In addition, it aimed to 
examine evidence for a predicted circular sequence of the categories found. 
3 experiments were conducted with both psychiatric outpatients and normal 
@ Ss. 14 of the interpersonal factors isolated were found to fit the circular order 
sequence „anticipated in 3 distinct samples. Other interpersonal conceptual 


schemes proposed by Foa, Schaefer, 
compared with the model suggested. 


ig A recent report (Lorr & McNair, 1963) 


presented evidenee for a circular rank order- 
ing of nine categories of interpersonal inter- 
action. In addition, the authors postulated 
an expanded set of 16 modes of relating 
interpersonally which they deduced from 
available data, The concern of the present 
report is with experimental data in support 
of the extended circular order. The specific 
aims of the study were to: test by multiple 
factor analysis for the existence of 16 hy- 
pothesized interpersonal behavior categories; 


. examine the evidence for an expanded circu- 


lar ordering of the interpersonal categories 
established. 

Three conceptualizations of interpersonal 
behavior have been reported in the literature. 
One scheme proposed by LaForge and Suczek 
(1955) involves a circular arrangement of 
8 interpersonal categories derived from a 
134-item check list of adjectives and short 
phrases. The categories are labeled: Man- 
agerial-Autocratic, Competitive-Exploitative, 
Blunt-Aggressive, Skeptical-Distrustful, Mod- 
est-Self-Effacing, Docile-Dependent, Cooper- 
ative-Conyentional, and Responsible-Over- 
generous, Schutz (1958) has developed a 
system *based on three basic interpersonal 
needs regarded as sufficient to account for 


` interpersonal behavior. The three needs of 


«Inclusion, Control, and Affection are evalu- 
‘ated by means of a set of Guttman scales 
called FIRO-B, Stern (1958) also has devel- 


‘1 Partial support received from Grant MY-3262, 
from the National Institutes of Health, United States 
Public Health Service. 

2 Now at Boston University Medical School. 


LaForgeSuczek, and Cattell are briefly 


oped a taxonomy of personal interaction 
characteristics although his primary interest 
is in the measurement of needs, Forty-two 
needs proposed by Murray (1938) are re- 
flected in 300 activities to which the re- 
spondent reacts by indicating his likes and 
dislikes. Stern found that his nine major 
clusters could be arranged into a circular 
order. Only six of the clusters, however, are 
interpersonal in character. 

The investigations reported here and pre- 
viously grew, in part, out of a critical exami- 
nation of the LaForge-Suczek, the Schutz, 
and the Stern interpersonal behavior cate- 
gories, The sufficiency of a tripartite scheme 
proposed by Schutz for accounting for all 
interpersonal interactions appears doubtful, 
even if the three needs are regarded as broad 
second-order factors. Schutz’ own factorial 
reports are inconsistent with the hypothesis 
of such a small number of factors. There is 
also a lack of supporting data for the inde- 
pendence of the LaForge-Suczek categories 
and little published evidence in support of the 
postulated circular ordering, Inspection of 
the adjectives defining the various categories 
also casts doubt on category homogeneity. 
Finally adjectives such as those in the Inter- 
personal Check List (ICL), have quite di- 
verse connotations and thus lack the precision 
of meaning necessary for the definition of the 
dimensions of interpersonal behavior. It thus 
seemed useful and important to establish 
rigorously a set of overlapping interpersonal 
categories on the basis of rated manifest be- 
havior statements, and to test for the pres- 
ence of a circular rank ordering. 
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METHOD 


Two experiments were conducted. The plan of 
procedure in both studies was to: hypothesize a 
specific set of interpersonal categories; construct a 
set of statements descriptive of each of the postu- 
lated factors for incorporation into an inventory; 
collect data on a substantial sample of patients or 
normals; obtain the product-moment correlations 
among the inventory variables; factor the table of 
correlations; obtain factor scores for each individual 
in the sample; arrange the correlations among fac- 
tor scores in the expected sequence and test for the 
presence of a circular order. 

The circular order concept applied here derives 
from Guttman (1954). The basic hypothesis is that 
qualitatively different traits in a given domain may 
have a rank order among themselves without be- 
ginning or end. Since a circle is a closed sequence, 
it describes such an order, In a correlation matrix 
exhibiting a circular order, the highest correlations 
are next to the principal diagonal which runs from 
the upper left to the lower right corner, Along any 
row (or column) the correlations decrease in size 
as one moves further away from the main diagonal 
and then increase again, A more rigorous statement 
is that the correlations of any specified variable 
with its neighbors decrease monotonically in size 
and then increase monotonically as a function of 
their sequential separation, Contiguous variables cor- 
relate positively while more distant variables corre- 
late close to zero. However, if personality variables 
are represented some of the correlations are likely 
to be negative, implying the presence of common 
bipolar elements, The central notion in this con- 
ceptualization is order and the idea of dimensionality 
is extraneous as Guttman is careful to point out. 
Guttman proposed a certain algebraic structure to 
explain manifest data that exhibit a circular order 
and called it a “circumplex.” Jones (1960) has also 
suggested a latent structure to account for the 
data, Since neither structure was tested here only 
the concept of a closed sequence of ordered varia- 
bles is utilized. 

To test the hypothesis of a circular rank order, 
the variables in the correlation table are 
in a sequence suggested by knowledge of variable 
similarity or on the basis of theory. When a circu- 
lar order exists the correlations will decrease and 
increase in size as a function of sı juential separa- 
tion and within limits of error SRS, The highest 
positive correlations will 
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occurred between Passive-Dependency and Nurtur. 
ance and was manifested by a negative correlation, 
In addition to the 9 categories found in that anal. 
ysis it was hypothesized that: Autonomy and With- 
drawal would fit between Mistrust and Abasiveness; 
Approval-Seeking, Conformity, and Responsibility 
would fit between Passive-Dependency and Nur 
turance; Attention Seeking would be located be 
tween Dominance and Sociability. 

The revised inventory (IBI») consisted of the 
statements most clearly defining the nine categories 
isolated earlier as well as a newly constructed set, 
All of the 144 statements in the inventory were 
completed by the rater by circling a Yesvor a No, 
Therapists, numbering 265, át 44 Veterans Adminis- 
ration Clinics rated 523 outpatients who had been 
in individual psychotherapy at least 3 months, The 
sample included neurotics, personality disorder cases, 
psychotics, and a few psychophysiological reactions“ 
The product-moment correlations among the varia- 
bles were grouped into 15 clusters. However, only 
14 factors were extracted by the multiple-group pro- 
cedure. These were then rotated to oblique simple 
structure. 

The analysis revealed that the hypothesized Aw 
tonomy and Withdrawal factors could not be estab- 
lished. The remaining four factors postulated were 
confirmed. However, the Conformity and Responsi- 
bility factors were rejected as being insufficiently 
interpersonal in nature. In addition the Passive 
Dependent cluster split into two distinguishable fac- 
tors: Submissiveness and Succorance. Thus, 12 cate: 
gories remained. 

A new key was developed on the basis of the 
factor-analysis findings. Each interpersonal factor 
was defined by 5 to 11 items, The IBI, forms 
were then scored for each of the 12 interpersonal 
factors retained. The product-moment correlations 
among the category scores exclusive of approvi 
seeking are presented in Table 1, Approval-Seeking 
was dropped because it clearly did not fit into part 
of the circular sequence. The correlations of each 
remaining category with all others were plotted 3 
ordinates against the rank-order sequence on the, 
abscissa. The fairly smooth curves obtained resem- 
bled open parabolas and provided support for the 
required monotonic order, The largest deviations 
from monotonicity are between Succorance: all 
Detachment, between Affection and Dominance, ar 
between Exhibition and Affection. Contiguous vatia- 
bles are all correlated positively as required. F 
major discrepancy in the anticipated sequer.ce is the 
location of Abasiveness after Succorance instead 0f 
after Detachment. | 


F | 


Experiment II 


On the basis of findings from Experiment 1 E 
interaction categories were postulated: Domma 
Recognition, Hostility, Mistrust, Autonomy, Deta a 
ment, Inhibition, Abasement, Submissiveness, P i 
corance, Deference, Agreeableness, Nuturance, aa 
tion, Sociability, and Exhibition. The added © 
structs hypothesized were Recognition, Autonomy! 
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TABLE 1 
4 CORRELATIONS AMONG 11 CATEGORY Scores or IBI; 
1 2 3 4 5 6 uf 8 9 10 11 
Ie 

1. Dominance 

2. Hostility 51 s 

3. Mistrust 31 64 

4, Detachment —15 21 33 

5. Submissiveness | —25 | —02 18 43 

6. Succorance —06 | —10 06 | —13 39 

7, Abasement —10 | —20 04 09 34 36 

8, Nurturance 09 | —33 | —27 | —50 | —29 | —01 21 

9, Affegtion —12 | —50 | —53 | —55 | —21 07 10 62 

10. Sociability 26 | —04 | —12 | —54 | —23 06 03 41 35 

11. Exhibition 5 | 19 13 | 33 | —22 07 04 21 04 45 


Note.—Decimal points are omitted. 


Inhibition, Deference, and Agreeableness, The con- 
cept af Autonomy was reformulated in light of the 
previous study. New behavior statements were con- 
structed consistent with a set of definitions of the 
underlying variables. The new items were added to 
those already defining the 11 categories in IBH. 
The revised inventory (IBI) consisted of 160 state- 
ments® Each interpersonal category was defined by 
10 manifest behaviors, The rating format was also 
altered to permit the rater to indicate how often the 
person rated exhibited the behavior in question; not 
af all, occasionally, fairly often, quite often. 

A nationwide sample of psychotherapists in pri- 
vate practice, in universities, and in other public 
clinics provided the ratings. Each of the 115 thera- 
pists was asked to describe several nonpsychotic 
patients he had been seeing in treatment for at 
least 4 months. Ratings were thus obtained on 525 
men and women in treatment. The product-moment 
correlations among the 160 variables were factored 
by the multiple-group method with use of an 
a priori hypothesis matrix (Horst, 1956) and then 


8 Copies of the inventories may be obtained from 
the first author. 


rotated to simple structure. The fit to the hypothe- 
sized structure was strongly supported. Since it was 
predicted that each variable would correlate highest 
with the factor to which it was allocated an objec- 
tive check was available. In no instance were fewer 
than 7 confirmations secured on the predicted cor- 
relations, The percentage of correct predictions for 
the 160 items was 84. 

To test the hypothesis of a circular order, a key 
for scoring was prepared on the basis of the factor 
analysis. A score was simply the sum of the unit- 
weighted items defining the factor. All patients were 
rescored on 15 categories, each defined by 7-11 
items. Since the Autonomy factor was still not suf- 
ficiently well defined it was excluded. The product- 
moment correlations among the total scores are 
given in Table 2. The category labeled Succorance 
failed to fit the circular order satisfactorily and was 
excluded. To check the degree of fit to the circular 
ordering the correlations of each category with all 
other categories were plotted as ordinates against 
the anticipated circular sequence. By and large a 
smooth curve resembling an open parabola could 
be fitted to each plot. It should also be noted that: 
contiguous variables are all correlated positively; 


TABLE 2 
CORRELATIONS AMONG 14 CATEGORY SCORES OF TBI; FOR PATIENTS 


1 2 3 4 5 6 7 8 9 10 11 12 13 4 
1: Dominance 
2. Recognition 75 
3. Hostility 53| 59 
4. Mistrust 21| 38| 69 
.5. Detachment —40|—21| 07| 27 
6. Inhibition —53|—32|—06| 26) 66 
7. Abasement —38|—28|—29| 03| 17| 48 
8. Submissiveness | —52| —39| —51 | —18 20| 46| 66 
9. Deference 00| 06|—37|—22|—16 04| 41| 49 
10. Agreeableness | —01| —09| —53| —44| —31 | —24 17| 28| 54 
11. Nurturance 10| 06| —40|—34| -35| —21| 25| 26| 44 | 70 
12. Affection 19| —01|—36|—48| —57 | —42| 09| 13| 43 61| 69 
13. Sociability 55| 39| 00|—17|—75|—57|—10|—10| 31 45| 49 | 64 
14. Exhibition zal 65| 50| 25|—4|—44|—16|—33| 05 —12| 05 | 22 | 54 


Note.—Decimal points are omitted. 
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TABLE 3 
CORRELATIONS AMONG 14 CATEGORY SCORES OF IBI; ror 290 NorMats 
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1 2 3 4 5 6 7 8 9 | 10 | 11 
i boost) 
1. Dominance 
2, Récognition 74 
3. Hostility 50} 57 
4. Mistrust 31| 52| 67 
5. Detachment —24/—06] 35| 36 
6. Inhibition —33/-17| 14] 33| 56 
7. Abasement —15| 00|—05| 24| 16| 50 
8. Submissiveness |—38|—27 | —29 | —02| 11| 47| 52 
9. Deference 10| 16|—14|—02|—18| 10| 39| 32 
10. Agreeableness —=11|—27 | —54 | —42 | —40 | —21| 14| 19| 39 
11. Nurturance —10|—32 | —46 | —34 | —33 | —13 |> 22] 19] 26 69 
12. Affection —05 | —27 | —54 —45 | —53 —28| 15| 19| 33 y, 72 
13. Sociability 38| 25|—15|—14|—74|—44| 00] 00| 33 44| 34 
14, Exhibition 65| 68] 47| 41|—21|—24|—02|—20| 09 |—22|—21|—10 37 


Note.—Decimal points are omitted. 


the increases and decreases in correlations are by 
and large monotonic; the obtained order agrees 
completely with that expected. As was noted earlier 
Succorance does not fit into the sequence. A few 
small deviations from monotonicity occur between 
Abasiveness and Submissiveness, At several points 
Deference exhibits correlations not consistent with 
its position, 


REsutts 
A Cross-Validation Study 


In the initial published report, data from 
normals and neurotics were pooled, An im- 
portant question is whether ratings of a group 
of normals by relatively untrained raters 
would yield the same interpersonal categories 
and the same circular order, The use of clini- 
cally untrained raters would provide a severe 
test of the factors emerging from therapist 


judgments, Ratings were secured on I 
from a group of 254 seniors and gradu 
psychology on “normal” men and 
with whom they were well acquainted 
least a year. Excluded from rating 
mediate family members, spouses, and 
friends of the opposite sex. Included 
290 individuals, both male and female, 
senting a diversity of occupations and 
economic classes. t 
Intercorrelations were obtained amo 
first 80 and the second 80 variables of IB 
Each set included 5 variables repr 
each of the 16 hypothesized factors. 
relations in each set were factored b; 
method of principal components with 
in the diagonals, The 16 factors with 
ciated latent roots of 1.0 or highei 


TABLE 4 
CORRELATIONS AMONG 14 CATEGORY Scores or IBI; ror 60 PATIENTS 
SO) 2 3 4 5 6 |7 8 9 | 10 |1 |12 
1. Dominance 
2. Recognition 74 
3. Hostility 37| 39 
4. Mistrust 12| 25| 68 
5, Detachment —28| —20| 16] 36 
6. Inhibition —47 | —34 | —02| 22 69 
7, Abasement —21 | —12 | —37 | —15 27| 48 
8. Submissiveness | —36 =39| —46 | —18| 22] 48 75 
9. Deference 24| 33|—33|—19 —10| 00| 59| 51 
10. Agreeableness 30] 13| —38 | —33 —=28| —13| 35| 30] 61 
i Nurturance 30} 27) —37] —24 —=25|—21| 34] 23] 69 78 
12. Affection én} 22) 40] —35|-55|—32| 371 27] 66 | 76 | 74 
13. Sociability | 22|-95|—16]—65]—s1! 04| —07 |. 49 | se | ss | 771 | 
+ Exhibition 67| 56) 24| 02 —=39 | —53 | —17 | —22 | 22 20 | 22 | 40 | 60° 


Note.—Decimal points are omitted. 
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then rotated directly to the hypothesized 
simple structure (Horst, 1956). In Set 1, 
Autdénomy could not be confirmed; in Set 2, 
Agreeableness was not substantiated. The 
number of confirmations of predicted highest 
correlations of each variable ranged from 6 
to 10 except for Autonomy and Agreeableness 
which were defined, as predicted, by 3 varia- 
bles each. The major factors postulated were 
thus confirmed within the normal sample. 

The game key applied in scoring the pa- 
tients was used to score the normal sample. 
The correlations among the category scores 
are shown in Table 3. The same circular 

eorder of categories obtained with patient sub- 

jects was found ,for the normal sample. Suc- 
corance again was excluded as it failed to fit 
into the expected sequence, Small discrepan- 
cies from monotonicity occur with other 
scores, Abasiveness should correlate nega- 
tively with Recognition but does not. Defer- 
ence should correlate negatively with Domi- 
nance and Recognition instead of positively. 
However, the degree of agreement with the 
cirtular order found for patients is clearly 
high, 

A final set of data was obtained on 60 neu- 
rotics who were rated by their therapists 10 
weeks after initiating once-a-week treatment. 
The correlations among the total scores based 
on the IBIs patient key are shown in Table 4. 
The sequence of Abasiveness and Submissive- 
ness appears to be somewhat in doubt as their 
correlations deviate slightly from monoton- 
icity. Nevertheless, the hypothesized circu- 

‘lar sequence of interpersonal categories is 
supported, 


Interpersonal Behavior Categories 


Statements illustrating each of the 15 inter- 
Personal behavior categories are given in 
` Table 5. Figure 1 illustrates the circular 
order, The term interpersonal is used here 
to refer to a relation between people. An 
. interpersonal situation is one involving two 
or more persons in which the individuals 
interact for some purpose (Schutz, 1958). It 
need not involve the physical presence of 
both members since the behavior may be 
determined by expectations of the behavior 
of others: not physically present. As may be 
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seen, the statements are descriptive of overt 
actions rather than inferred feelings. 
Dominance involves the tendency to lead, 
direct, influence, and control others, It is re- 
lated to Recognition which represents the 
tendency to seek and to compete for* recog- 


TABLE 5 


STATEMENTS EXEMPLIFYING THE 14 INTERPERSONAL 
CATEGORIES OF IBI; 


Category Statements 


Bosses his friends and associates 
around, 

Takes qarag of things when he’s with 

ople, 

Seizes opportunities to rival and sur- 
pass others. 

Strives for symbols of status and 
superiority to others. 

Ridicules, belittles, or depreciates 
others. 

Uses a sarcastic or biting type of 
humor, 

Mistrusts the intentions of others 
toward him. 

Expresses suspicion when someone is 
expecially nice to him. 

Acts businesslike and impersonal with 
co-workers. 

Keeps aloof from his neighbors. 

Shows discomfort and nervousness 
when people watch him at work or 
at play. 

Shows signs of self-consciousness with 
strangers. 

Blames himself when interpersonal 
friction with others occurs, 

Apologizes for not having done better 
when he completes a task. 

Gives in rather than fight for his rights 
in a conflict. 

Lets his friends or spouse push him 
around, 

Carries out orders of his superiors with 


aa 
Dominance 
Recognition 
Hostility 
Mistrust 


Detachment 


Inhibition 


Abasement 
Submissiveness 


Deference 


zest. 

Takes the role of helper or supporter 
of authority figures. 

Contributes positively as a member of 
some group or team. 

Relates to and treats people as equals. 

Listens sympathetically to others talk 
about their troubles. 

Puts aside his own work or pleasure if 
someone asks for help. 

Shows a real liking and affection for 
people. 

Acts close and personal with people. 

Invites friends and acquaintances to 
his home. 

Drops in to visit friends just to 
socialize, 

Draws attention to himself in a group 
by telling jokes and anecdotes. 

Acts the clown or amuses others at a 


party. 


Agreeableness 


Nurturance 


Affection 


Sociability 


Exhibition 
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Dominance 


Recognition 

> 
Hostility 
Mistrust 
Detachment 


Inhibition 


Abasement 


Exhibition 


Sociability 


Affection 


! Nurturance 


Agreeableness 


Submission 


Fic. 1. The interpersonal circle. 


nition and status. The Hostility category is 
tied to Recognition and represents the tend- 
ency to criticize, ridicule, punish, or aggress 
against others. Hostility leads to mistrust, 
the tendency to doubt or Suspect the atti- 
tudes, feelings, and intentions of others, Mis- 
trust is related to Detachment which de- 
scribes aloof, withdrawn, and seclusive tend- 
encies, Associated with Detached behavior is 
Inhibition, or activities involving withdrawal 
from attention and shyness with others, The 
tendency towards Inhibition is related to 
Abasement, a proneness to accept blame, to 
belittle the self, and to apologize. Following 
Abasement is Submission which is charac- 
terized by passivity, docility, and appease- 
ment. Abasive tendencies are related to Def- 
erence, or actions involving support and sery- 
ice to a superior or leader. Agreeablness or 
the tendency to be Cooperative, helpful, con- 
siderate, and equalitarian follows Deference. 
In Nurturance the Cooperativeness found in 
Agreeableness moves towards more active 
Support, sympathy, and counsel. Related to 
Nurturance is Affection, the tendency to ex- 
press liking, warmth, and friendliness towards 
others, Sociability, next in the sequence, in- 


volves a more active tendency to join groups, 
to be with people, to be gregarious. Finally, 
locking into Dominance is Exhibition which 
is characterized by attention-seeking activit 
ties involving another type of control evi- 
denced in Dominance, 


Discussion 


Foa (1961) has reviewed a wide range of 
reports regarding the structure of interper- 
sonal behavior. He interpreted these to sug- 
gest that a similar circular ordering was 
evident in data presented by Borgatta, a 
trell, and Mann (1958), by Schaefer, Re 
and Bailey (1959), and La Forge and Sucze 
(1955). In the same review Foa proposed 4 
facet structure based on four principal on 
ponents, Subsequently Foa (1964) modifie 


i 
, 


his proposed structure and suggested a circu- | 


lar order of eight types of relationships bas 


on three facets. These were content of be — 


havior (acceptance and rejection), ob 
behavior (self and other), and mode of P 
havior (love and status). È ious 
The facet model represents an inge 
beginning. However, it has certain a 
which require discussion, The “profiles 
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types are defined in abstract inferential lan- 
guage such as “Social acceptance of other.” 
The actual rated statements are overlapping 
and not especially helpful in identifying the 
actual interpersonal behaviors involved. For 
example a husband’s social acceptance of the 
wife runs as follows: “Isaac thinks his wife 
is very successful and especially esteems her 
personality and her actions.” In addition the 
facets, such as acceptance and rejection, 
status and love, are left undefined. Appar- 
ently acceptance may mean giving or receiv- 
ing; rejection may mean denying, withhold- 
ing, or a hostile act. Status appears to mean 
prestige, control, and possibly inclusion. Also 
the circular sequence is attained by unusual 
scoring. A low score means weak acceptance 
or strong rejection. Consequently a positive 
correlation between acceptance and rejection 
indicates that the higher the acceptance the 
lowerethe rejection, With conventional scor- 
ing the correlations between accepting and 
rejecting behaviors are negative. Foa has 
thus established only a half circle and not 
a‘complete circle. For instance, acceptance of 
other and social rejection of other (bipolar 
opposites) are located next to each other in 
Foa’s model. Finally it would appear that 
the number of facets required to explain 
known interpersonal interactions established 
in the present study should be greater than 
three, The Foa facets cannot account for the 
15 «categories found. 

Schaefer’s (1959) hypothetical ordering of 
maternal behavior has some similarity to the 
* sequence presented here. The categories are 
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concerned with the role of a parent towards 
a child and thus do not include peer-peer 
behaviors such as Deference or Abasement. 
Later Schaefer (1961) offered evidence from 
a variety of studies including Kassebatm, 
Couch, and Slater (1959) for a circular order 
among a broader range of social and emo- 
tional traits, His hypothetical circular order- 
ing was defined by Extraversion, Friendliness, 
Love, Conformity, Introversion, Withdrawal, 
Hostility, and Aggressiveness, These cate- 
gories are quite similar to those described in 
the"present report. 

The LaForge-Suczek (1955) categories are 
most similar to those presented here and con- 
form to much the same sequence. However, 
these 16 interpersonal categories have never 
been demonstrated to be independent. In- 
stead these authors postulate a correlational 
structure defined by three orthogonal fac- 
tors: Dominance versus Submission, Love 
versus Hate, and Intensity, The intensity or 
vertical axis of the cone represented is based 
on the numbers of items checked. The cate- 
gories not represented appear to be Detach- 
ment, Deference, Affection, Sociability, and 
Exhibition. 

To what extent are the Cattell (1957) pri- 
mary personality factors represented among 
the interpersonal categories? A major ob- 
stacle to comparison is the bipolar nature of 
most of the original Cattell rating scales. The 
factors obtained are as a consequence also 
predominantly bipolar. The approach taken 
here has been to let the data determine the 
unipolar or bipolar nature of the factor, Evi- 


TABLE 6 
° INTERPERSONAL CATEGORIES ISOLATED OR HYPOTHESIZED 
Lorr and McNair LaForge and Suczek Stern Schaefer Cattell 
Dominance Managerial Dominance Aggressiveness Dominance E+ 
R iti iti si :, : 
Hostility EE Aggression Hostility Soret aT 
Mistrust Skeptical-Distrust į a rotensi 
Detachment Rejectance Withdrawal ea ec 
Annee Mo Abasiveness Introversion (?) Sule Eron ea o+ 
Submission Docile ity @) ubmission 
Deference Deference Conformity Zeppia J— (?) 
Agreeableness Cooperative L @ p 
Nirtuance | Response Generous | Naats | Heed 
Sataa convenia SAN Extroversion (?) Cyclothymia A+ (?) 
Exhibition Exhibition 
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dence in the current study strongly supports 
the existence of a set of unipolar first-order 
factors. Logical and verbal opposites are not 
necessarily psychological or behavioral alter- 
natives, For instance, Submission is not nec- 
essarily the alternative for Dominance, An 
individual may not exhibit dominant be- 
havior, or he may be rebellious or an autono- 
mous lone wolf. Table 6 presents a rough 
matching of Cattell factors and the IBI 
interpersonal categories, 

As may be seen from Table 6, although 
the labels may differ, there is fairly good 
congruence among the various theoretical and 
empirical approaches, Both Foa and Schaefer 
have presented evidence in support of such 
convergences in the nature of interpersonal 
categories and in their circular rank ordering. 

There is much to be said in favor of the 
circular ordering of interpersonal constructs 
as opposed to a conventional factorial inter- 
pretation. The proposed circular ordering 
provides a useful conceptual organization of 
a set of otherwise unrelated constructs, The 
scheme has heuristic power insofar as it en- 
ables the experimenter to infer missing cate- 
gories, Moreover the arrangement is fruitful 
in clinical diagnosis and in understanding 
how various psychological defenses operate, 
Finally the ordering of behaviors should have 
developmental implications, It should provide 
a useful base for studying the Parental pat- 
terns which elicited and reinforced the be- 
haviors manifest in interpersonal acts long 
after the parent figures are gone, 
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THE LEVEL OF COOPERATION IN THE 
PRISONER’S DILEMMA GAME 


ANATOL RAPOPORT anp ALBERT M. CHAMMAH 
Mental Health Research Institute, University of Michigan 


70 male pairs, 70 female pairs, and 70 mixed pairs played Prisoner’s Dilemma 300 
times in succession. The performances of 4 populations were compared, namely, 
men playing against men, men playing against women, women playing against men, 
and women playing against women. Comparisons were made with respect to several 
indices, including not only the frequencies of cooperative choices but a number of 
interactfon indices based on conditional probaBilities of responses, On the whole, 
large differences were observed between male pairs and female pairs, the principal 
difference being in the considerably greater overall frequency of cooperative choices 
by men. When men and women play against each other, however, these differences 
are grased: the men become more like women and the women more like men. More- 
over no differences in the frequencies of cooperative choice were observed in the 1st 
2 plays of the sequence, which suggests that the overall differences are results of 
different interaction patterns in men and women rather than of different a priori 


SEX ‘DIFFERENCES IN FACTORS CONTRIBUTING TO 


propensities to cooperate. 


Thé two-person nonzero-sum game called 
Prisoner’s Dilemma has been studied experi- 
mentally in the past 5 or 6 years by several 
investigators (Bixenstine, Chambers, & Wil- 
son, 1964; Deutsch, 1958, 1960; Lutzker, 1960; 
McClintock, Harrison, Strand, & Gallo, 1963; 
Scodel, Minas, Ratoosh, & Lipetz, 1959). 
Data obtained from such experiments are 
tichly suggestive in the formulation of psycho- 
logical theories which have a bearing on con- 
flicting motivations for and against coopera- 
tion. Such a conflict is built into the game 
itself, 

The original anecdote called Prisoner’s 
Dilemma concerns two prisoners accused of 


‘the same crime and held incommunicado. 


Each has a choice of confessing to the crime 
or not. If both confess, both will be convicted; 
if neither confesses, they must be acquitted for 
lack of evidence. However, if only one of the 
prisohers confesses, he not only goes free but 


. gets atreward for having turned state’s witness; 


while the prisoner who refused to confess gets 
a more severe sentence than if he had con- 
fessed. The dilemma arises from the circum- 
stance that it is to each prisoner’s individual 
advantage to confess regardless of what the 
other does, but at the same time it is better for 
both of them not to confess than to confess. — 

The general form of Prisoner’s Dilemma 1s 
given in Matrix 1. 


C: Dı 
Cı Ri, Re Sı, Ts 
Dı Ti, S2 Pi, Pa 
Matrix 1 
Here Si < Pi< Ri< Ti; Sit Ti < IR 


(i = 1, 2). The game is symmetric if Sı = Ss 
Pı = Po, Ri = Ro, Ti = To. 

A play of the game consists of a choice of a 
row (Cı or Dı) by Player 1 and of a column 
(Cz or Dz) by Player 2 independently, that is, 
without communication. The entries in the 
matrix are the payoffs to Players 1 and 2, 
respectively, each pair of payoffs resulting 
from a pair of choices (of row and column) by 
the players. The “dilemma” is a reflection of 
the fact that in the absence of an enforceable 
agreement, Di (i = 1, 2) is the “rational” 
choice for each of the players, in the sense that 
the payoff associated with D; is larger than 
that associated with C;, regardless of how the 
other player chooses, whereas the paired choice 
CıCa is better for both players than DiDs. 

‘Another way of putting it is that “indi- 
vidual rationality” prescribes the choice of D, 
while “collective rationality” prescribes the 
choice of C. Hence there is no unambiguously 
optimal way of playing the Prisoner’s Dilemma 
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game, and, in fact, human subjects playing 
this game exhibit a large degree of variability 
in the frequency of their choices of the “co- 
operative” (C) strategy and of the “defecting” 
(D) strategy. 

In mest experimental studies of Prisoner’s 
Dilemma the frequencies of individual co- 
operative responses or of jointly cooperative 
responses in runs of repeated plays or in a 
sample of one-shot plays have been the 
principal, sometimes the only, dependent vari- 
ables. Instructions to the players, the payoffs, 
the numbers of games in a run, and the differ- 
ent population samples have been the usual 
independent variables. Some experimenters 
have sought correlations between the tenden- 
cies of individuals to cooperate (as manifested 
in the frequencies of C choices) with other 
independently assessed personality traits, for 
example, the F Scale (Deutsch, 1958, 1960) 
or the “Internationalism scale” (Lutzker, 
1960). 

It was found subsequently (Rapoport, 1964) 
that the correlations of C-choice frequencies 
between the members of pairs were consistently 
very high (typically 0.9-1.00) when pairs of 
individuals played long Sequences, the result 
of each play being announced, This result 
Suggests that the interactions between the 
members of pairs rather than inherent pro- 
pensities of individuals to cooperate are the 
dominant factors in determining a performance 
of a repeated Prisoner’s Dilemma game. If this 
is so, one would not expect discernible correla- 
tions between the frequencies of C choices and 
independently assessed individual personality 
traits when the game is played many times 
by each pair. Whatever effect individual pro- 
pensities may have on the performance of an 
individual is masked by the interaction effect, 

On the other hand, it is quite possible to 
assess cooperation propensities of different 
populations by comparing the C frequencies 
observed in independent (noninteracting) popu- 
lations. This was, in fact, done by Lutzker 
(1960) and by McClintock et al, (1963). 

In this paper we shall compare some aspects 
of performance in the Prisoner’s Dilemma game 
of men and of women, 


The frequency of Cooperative Choice C 
(defined as the fraction of such choices ina 
block of plays) will be one of our indices. This 
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quantity can be associated “with the per 
formance of each individual. In a given pair, 
therefore, we shall have Cı and C, associated 
with the respective players. Besides C (and 
its complement D = 1 — C) we shall examine 
quantities related to the performance of the 
pair, namely, CC (fraction of double coopera- 
tive choices), CD (fraction of unilateral cooper- 
ative choices by the first player), DC (fraction 
of unilateral cooperative choices by the second 
player, and DD (fraction of double defecting 
choices). Obviously C = CC + (CD + DC)/2. | 

We shall also be concerned with the fraction 
of a population which played the cooperative 
choice on the first and second plays, These „ 
indices will be denoted by C(1) and C(2), 
Next, the fraction of the paits which played 
almost exclusively CC on the last 25 plays 
will be denoted by Lec; similarly the fraction 
which played almost exclusively DD on the 
last 25 plays will be denoted by Lpp. The term 
“almost exclusively” shall be taken to mean at 
least 23 out of 25 responses. 

The next indices are measures of interaction 
between the members of pairs. We denote hy 
Pe,c, the product-moment correlation coeffi- — 
cient with respect to C, and Cs related to a J 
population. The nearer pe,c, is to 1, the more 
evidence there is for the statement “Coopera- | 
tion on the part of one member of a pair tends | 
to elicit cooperation on the part of the other, 
and similarly for defection.” If p is near Zeto, 
there is no evidence of interaction; while if p 
is negative this would be evidence that One 
operation on the part of one member of a pall 
tends to elicit defection on the part of the 
other and/or vice versa, On a priori grounds, 
Positive, zero, or negative values of p all seem 
plausible: positive values as evidence of imita- 
tion, near-zero values as evidence of the 
inability of the subjects to influence onè 
another in the absence of communication, | 
negative values as evidence that the attempts $ 
of one subject to effect cooperation are “eX 
ploited” by the other for greater gains accruing 
to the single defector. $ 

Another correlation coefficient can be intro- 
duced to measure the extent of correlation | 
among the subjects’ choices with a single 
session, This variable, which we denote by M 
(sometimes called the phi coefficient) is 4° 
fined by . 
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: (CC)(DD) — (CD)(DC) 


Po 


where CC denotes the frequency of the CC 
response, etc. 

The quantity po is actually the correlation 
coefficient obtained by setting each C choice 
equal to 1 and each D choice equal to 0. (Any 
other pair of constants will do just as well 
since po is invariant with respect to linear 
transfagmations of the variables.) 

The correlation Goefficient po can be taken 
to be a measure of the éxtent to which the two 
players of Prisoner’s Dilemma imitate each 
other in the course of a session. However, to 
get the full mgasure of this imitation one 
should correlate the responses of one player 
with the immediately preceding responses of the 
other. If the responses of Player 1 are matched 
with the immediately preceding responses of 
Player 2, we obtain æ measure of the extent 
to which Player 1 imitates Player 2, and vice 
versa. This coefficient with the built-in “shift” 
will be designated by pı. Similarly, if we wish 
wt can define p; (i = 1,2 --+) whose succes- 
sive values will reveal the “decay” of the 
imitation effect. In this study we shall examine 
only values of p1. 

There are also other psychologically interest- 
ing indices of interaction. For example, con- 
sider the probability that a given subject 
responds cooperatively following a cooperative 
response on the part of the other. We denote 
this conditional probability by 


tı = p(Ci|C’s), for the first player, and 
t2 = p(C2|C':), for the second player, 
where C’; refers to the choice on the immedi- 
ately preceding play. i 
Apparently pı is a measure of “responsive- 
ness to the other’s cooperation.” 
Next we define 


or = p(Ci| D’); 
ws = p(C:| D). 


Apparently, a is a measure of a tendency to 
cooperate in spite of the other’s failure to 


»cooperate. 


The two quantities just defined relate to 
responses to the other’s choices. We now define 
the corresponding indices relating to one’s 


VCC + CD) (CC + DOOD + CD)(DD + DC)’ 


own choices immediately preceding the chqice 
in question. Thus i 


m = p(Ci|C'); 
m = p(C2|C’.). 


The 7’s measure the persistence of the coopera- 
tive response. Finally, we define 


t = p(l D); 
h= p(C2|D"2). 


The ¢’s measure the nonpersistence of the 
defecting response. 

It will be convenient to speak of £, n, 1 — «, 
and 1—¢. The first two are tendencies to 
respond cooperatively to the other’s (self’s) 
cooperative choice; the last two are tendencies 
to respond noncooperatively to the other’s 
(self’s) noncooperative choice. Thus ¢ and 
1 —« are measures of the tendency to reply 
in kind, while n and 1 — ¢ are measures of the 
tendency to persist with the same choice. 

As our next set of indices, we define the 
conditional probabilities of cooperative re- 
sponses related to the previous paired choices, 
namely: 

æı: the probability that Player 1 chooses 
cooperatively, given that the previous paired 
response was CC, that is, 

xı = p(Ci]C%sC’2) 
and analogously, 
az = p(C2|CsC’2) 
n= p(Ci|C'D’s) 
ya = p(C2|D'sC’2) 
zı = p(Ci|D':C’2) 
z = p(C2|C’D'2) 
wi = p(Ci|D/D') 
w = p(C2|D'.D’2). 

We note that these conditional propensities 
are also psychologically suggestive. For ex- 
ample, x means essentially the probability of 
choosing cooperatively after one has cooper- 
ated and been rewarded. From the way the 
Prisoner’s Dilemma game is constructed, this 


suggests a resistance to the (ever present) 
temptation to defect, hence “trustworthiness.” 
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Next, y means essentially the probability of 
choosing cooperatively after one has cooperated 
and been punished. This suggests either 
“martyrdom” or persistent attempts to induce 
codperation by example or, perhaps, “forgive- 
ness.” Next, z means the probability of choos- 
ing cooperatively after one has defected and 
gotten away with it. This propensity suggests 
“repentence.” Finally, w represents the prob- 
ability of shifting to the cooperative choice 
after one has been punished for defecting. 
Since there is no point in shifting to the 
cooperative choice from DD unless one hopes 
that the other will do likewise, the propensity 
w suggests “trust.” 
As our final index we define 


(1 — y) (1 — z) 
M182 


Mı = 


and analogously M: by interchanging Sub- 
scripts 1 and 2. Note that the factors of the 
denominator represent the respective prob- 
abilities of repeated cooperation on the part 
of a unilateral cooperator and shift from defec- 
tion to cooperation on the part of a unilateral 
defector. If we make the reasonable assumption 
that on a particular play of the game (in the 
absence of communication) these events are 
independent, then the denominator represents 
the probability that a “martyr run” (a se- 
quence of unilateral cooperative choices) ends 
by the “conversion” of the defector to coopera- 
tion. By similar reasoning, the numerator 
represents the probability that the martyr run 
is ended by the martyr (the cooperator) who 
reverts to defection, while the defector con- 
tinues to defect. Thus M represents the ratio 
of the expected frequencies of “failures” of 
such martyr runs to “successes” (if we assume 
that the objective of the unilateral cooperator 
is to induce cooperation in the other). Clearly 
My refers to martyr runs when Player 1 is the 
martyr, and Mz to martyr runs when Player 2 
is the martyr. 


PROCEDURE 


Our subjects were recruited as volunteers for experi- 
ments whose nature was not known to the subjects until 
the instructions were given. Each subject was paid a 
subject fee of $1.35 per hour (a session lasted about 
2 hours) „with the understanding that any winni 
incurred in the game would be added (or losses sub- 
tracted) to (from) the subject fee. The payoff Points 
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were converted to money at the rate of 1 mill per point, 
so that the magnitudes of the total winnings or losses 
of a subject were comparable to that of his fee, Each 
pair played a Prisoner’s-Dilemma-type game 300 times 
in succession. The outcome of each play was announced 
by the experimenter. 


Each pair sat in adjoining booths, separated by a 


partition, so that the subjects were not visible to each 
other, but both could see and were seen by the experi- 
menter, who sat facing the subject. 

The conditions of the game (meaning the dependence 


of the payoffs on the paired choices) were explained to 
the subjects. A chart, corresponding to the payofl © 


matrix was constantly in front of each subject. After 
the conditions were apparently understood, the session 
started. At the experimenter’s signal (move 1, move 2, 
etc.) each subject pointed silently to one of two cards 
marked L (left) or R (right) corresponding, respec- 


tively, to Choices C and D. When both subjects had © 
pointed the experimenter announzed the resulting 


payoffs. No signaling or communication of any kind 
among the subjects was allowed, and there was no 
evidence that this rule was ever violated. As a rule the 
members of a pair did not know each other nor were 
they permitted to speak to each other before the 
experiment. 7 


REsvtts 


We shall be comparing three samples pf 
subjects, all from the University of Michigan, 
namely, 70 pairs of male students, 70 pairs of 
female students, 70 pairs of students in which 
the two players were of opposite sex. In the 
all-male and in the all-female pairs, the two 
players of a pair will not be differentiated. In 
the mixed pairs, however, the two players 


being of opposite sex, will be distinguished. ¥ 
We thus have four populations to compare, 


which we designate MM (N = 140)—all the 


subjects in the 70 male pairs, MW (NV = 70)—_ } 


the male subjects in the 70 mixed paits, 
WM (N = 70)—the female subjects in the 70 
mixed pairs, WW 
in the 70 female pairs. 

No subject participated in more than one 
session, nor, to our knowledge, had played 
Prisoner’s Dilemma before. 


For our games we used Prisoner’s Dilemma | 


games with seven different payoff matrices 0 


the type described above. Differentiation of the | 
results by games, made elsewhere (Rapoport ” 


1963), is not made in this paper. Results até 
reported as means, averaged over all the Te: 
sponses of all the individuals or pairs in eal 
of the four populations. 


The comparison of performances related t0 


(N = 140)—all the subjects ‘a 
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TABLE 1 
COMPARISON OF PERFORMANCES 
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Group | CC | CD | DC| DD| C |C(1)| C(2)| Lec| Lop] pcics| pa | M | è | n |1-t|1-o| x | y |1-s|d—w 
MM «51 | .08 | .09 | .32 | .59 | 53 | 48 | 57 | 13 | 97 | .51 | 2.4 | .78 | .25 5 | .75 | .85 | .40 |,.62 | .80 
MW 40 | .10 | .09 | .41 | 50 | .63 | .56 | .39 | .21 | .91 | 39 | 3.7 | .71 | .24 | .76 | .75 | .80 | .40 “67 | 79 
WM 40 | .09 | .10 | .41 | .48 | .63 | .54 | .39 | .21 | .91 | .35 | 2.6 | .69 | .22 | .78 | .73 | .78 | .44 | .71 | .78 
ww «23 | 11 | .11 | 55 | .34 | 53 | .49 | 17 | 30] .87 | .34 | 4.8 | .60 | .17 | 83 | .81 | .75 | .37 | .74 | .85 


all of the indices discussed above is shown in 
Table 1. 

In disctissing the comparison, we shall not 
be concerned with «the significance of the ob- 
served differences but only with their direction. 
In some cases, the significance of a difference or 
the absence thereof will be obvious. For ex- 
ample, the significance of the difference in the 
overall frequency of cooperative responses be- 
tween the MM and the WW population is 
hardly to be questioned in view of the sizes of 
the population samples. It can also be estab- 
lished by the weakest of the nonparametric 
tests (the sign test), in view of the fact that C 
was found to be smaller in the WW population 
in every one of the seven games when averaged 
over each game separately. 

On the other hand, the 1% difference in the 
frequencies of unilateral cooperative responses 
(CD) between MW and WM is almost cer- 
tainly not significant, since a difference of this 
magnitude is observed also between CD and 
DC in the MM population, where this differ- 
ence refers only to players labeled arbitrarily 
1 and 2 and moreover where the population 
sample is twice as large. 

However, as we have said, the significance of 


` the observed differences will not be our con- 


cern here, We shall view the observed differ- 
ences merely as suggestive of hypotheses or 
conjectures concerning psychological differ- 
ences between the sexes as reflected specifically 
in the Prisoner’s Dilemma game. If some of 
_ these hypotheses or conjectures turn out to be 
especially interesting or revealing, experiments 
can always be designed with ever tighter con- 
a trols on sufficiently large population samples 
. to enable the interested investigator either to 
establish the hypothesis firmly or else to 
reject it, - 

We tum to the sex differences suggested by 
Table 1, The most pronounced of these is, of 
course, the considerably smaller degree of 
cooperatién in the WW population compared 


with the MM population. This is apparent in 
the smaller values of C in the WW population 
and also in the comparison of Lee and Lpp in 
theetwo populations. Of the 70% of the MM 
pairs who “lock in” in the last 25 responses, 
over four times as many lock in on CC than 
on DD. Of the 50% who lock in in the WW 
population, twice as many lock in on DD than 
on CC. 

The next thing to note is that the per- 
formance of the mixed population is inter- 
mediate between the two populations of 
homogeneous pairs. It is as if the men have 
been “brought down” by the women and the 
women have been “brought up” by the men. 
This shows up in the values of C and in the 
comparison of Lec and Lpp. 

In fact, comparing all the indices, we see 
that whereas there are several pronounced 
differences when MM and WW are compared, 
there are hardly any differences of com- 
parable magnitude between the MW and WM 
populations. 

Let us therefore examine the evidence 
against the hypothesis that men and women 
differ in their inherent (i.e., unconditional) 
propensity to cooperate. We compare the 
fraction of our populations which choose C 
on the first play, that is, C(1). It turns out to be 
the same in the MM and in the WW popula- 
tion, namely, 53%. Likewise the fractions 
choosing cooperatively on the second play 
C(2) tum out to be nearly equal. Turning to 
the initial fraction of individuals choosing 
cooperatively in mixed pairs, we find that 
these fractions are higher than in the pairs of 
the same sex (63%) but again the fractions 
of men and women choosing cooperatively are 
equal. This near equality persists also on the 
second play. There is thus some evidence 
that both men and women are more prone to 
choose cooperatively when playing against 
the opposite sex, but there is no evidence that 
men are more likely to choose cooperatively 
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than are women from the very beginning. 
Whatever differences we observe in the per- 
formances of men and women must be at- 
tributed to the effects of interaction during the 
course gf the play. Apparently what is hap- 
pening is that paired players are pushed in 
the direction of more cooperation or in the 
direction of less cooperation, and whichever is 
the tendency, the paired players tend to be 
pushed in the same direction and so to become 
more like each other in the course of the session. 

This tendency for paired players of a 
Prisoner’s Dilemma game to become like each 
other is best reflected in the correlation indices. 
Observe first the high values of the correlation 
indices pe,c,. If the frequency of cooperative 
choices were completely determined by some 
inherent propensity of the individuals to co- 
operate, the values of pc,c, should be near zero, 
because the individual players were all paired 
at random. Instead the correlations of C1 versus 
Cy are near unity. This suggests that the paired 
players tend to become like each other in the 
course of the play. The correlations pı are 
numerically smaller than p¢,c, but still defi- 
nitely positive. The significance of the positive 
value of the correlations is definitely estab- 
lised by examining their distribution among 
the individual pairs.’ 

As we have seen, pı measures the extent to 
which players imitate each other in the course 
of their session of 300 plays. We see that the 
Strongest imitation effect is observed when 
men play men; the weakest effect is observed 
when women play women, When men play 
women, the effect is intermediate, but even 
here it appears that men imitate women to a 
greater extent than women imitate men. 

When we consider the special significance of 
the imitative tendency in the context of the 
Prisoner’s Dilemma game, we can make a 
conjecture about the real reason why men 
show more overall cooperation. Imagine two 
automata playing Prisoner’s Dilemma, using 
a purely imitative Strategy, that is, each 
automaton’s next response is identical to the 
other automaton’s response which has just 
occurred, while the first choice is determined 
by chance. These automata will be charac- 
terized by pı = 1.00. Under these conditions 


Of the 420 values of pı only 24 were negati 
all of these were numerically small, ec 
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only three types of protocol are possible, 
namely, 100% CC, 100% DD, and 100% alter- 
nating (CD, DC). Of these three protocols, the 
first clearly pays off the most to both automata, 
Therefore, in any situation where a bias exists 
in favor of largest payoff and where only a 
minimal “noise” is introduced into the system 
(in the sense of responses which do not corre- 
spond to the other’s last responses), the tend- 
ency will be for the 100% CC sequences to be 
favored. It follows that populations with a 
greater tendency to give “tit-for-tat” will have 
more frequent occurrences of CC sequences 
and hence will show greater overall frequencies 
of cooperation, But these are precisely the MM + 
pairs, as is evidenced by the men’s greater pro- 
pensity to imitate their partners. We therefore 
conjecture that the men’s higher cooperative 
response frequencies stem from their greater 
tendency to give tit-for-tat in situations of 
this sort. 

This tendency is revealed also in some other 
indices. Observe that ¢ and 1— w are both 
direct indices of the tendency to reply in kind. 
When men play against women men rate higher = 
on both of these indices than women: they 
tend to cooperate more in response to the | 
partner’s cooperation (¢) and also to defect 
more in response to the partner’s defection 
(1 — w). Note, however, that whereas men, | 
playing men, are more inclined to respond 
cooperatively to the other’s cooperative choice 
than to retaliate against the other’s defecting 
move (> 1— w), the reverse is true of 
women playing women (1 — w > &). In mixed 
pairs retaliation is somewhat stronger than 
positive response to cooperation in both men l 
and women. l 

We turn to the indices «, y, 2, and Wp% — 
rather x, y, 1 — z, and 1— w. Expressed in 
this way the indices are measures of the extent 
to which a player tends to persist in the same 
state following each of the four outcomes, 
namely, rewarded cooperation, associated wl 
æ; unreciprocated, hence punished cooperation | 
associated with y; unilateral, hence rewarde 
defection, associated with 1 — z; and punishe 
defection, associated with 1 — w. í 

Now there are certain stochastic models "E 
the learning process according to which the 
steady state frequencies of the several possible 
Tesponses in a rote learning situation are de- 
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rived to be monotone functions of the asso- 
ciated rewards. If such a model were applicable 
to subjects “learning” to play Prisoner’s 
Dilemma in repeat plays, one would expect 
that 1 — z would have the largest value, being 
associated with T, the largest payoff. The re- 
maining frequencies would be, in decreasing 
order, x, 1 — w, and y, being associated with 
R, P, and S, respectively. Turning to the 
values of these frequencies in our populations 
we see thesinequalities listed in Table 2. 

The rank order of these four variables de- 
parts from that prescribed by a simple sto- 
chastic learning model (assuming the payoffs 
as the reward) in that the two lock-in responses, 
namely, æ (C following CC) and 1— w (D 
folloWing DD) are both greater than both of 
the tendencies to persist in the unilateral 
state, namely, 1 — z (D following DC) and y 
(C following CD). Here women differ from 
men jn that with women the tendency to lock 
in on DD (1 — w) is stronger than the tendency 
to lock in on CC («), while with men the oppo- 
site is the case. This may be another important 
réason why women show much less overall 
cooperation than men when playing against 
partners of the same sex. When women play 
men (cf. the WM population) the two tenden- 
cies are about equal (see Table 2). 

Finally we turn to the index M, which, as 
we recall, is a measure of the extent to which 
failures of martyr runs outnumber successes. 
The values of M in the four populations are 
given in Table 3. 

That is to say, when the man is the defector 


` + in the martyr run, the run ends in a failure 


about 2.5 times more frequently than in a 
Success, regardless of whether the martyr isa 
man or a woman (cf. MM and WM popula- 
tions). When a woman is the defector, failures 


outnimber successes 3.7 to 1 if the martyr is a 
É 


TABLE 2 
INEQUALITIES IN THE FREQUENCIES OF 4, Y, 1—w, 
AND 1—3 
Population 
MM x>l-w>1-z>7 
MW x>i-w>1-z>y 
WM x= (—w)>1-s>y 
ww 1—-w>a4>1-2>9 
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TABLE 3 
VALUES oF M 
Population M i 
MM 24 6 
MW SA. 
WM 2.6 
WW 4.8 


man and 4.8 to 1 if the martyr is a woman 
(cf. MW and WW populations). 

One might take this to be rather dramatic 
evidence of the woman’s lesser propensity to 
cooperate (i.e., to be converted to cooperation 
by example). However, the conclusion is not 
entirely warranted. For a failure of a martyr 
run can be attributed as readily to the fact 
that the martyr has “given up” as to the fact 
that the defector failed to be converted. And 
so we do not at this stage know whether the 
high values of M when the woman is the 
defector and the man the martyr is due to the 
woman’s failure to be converted or to the 
man’s early giving up. To be sure, the differ- 
ence in the values of M between the MM and 
the WW populations tends to put the shadow 
of blame on the woman, but this is contradicted 
by the somewhat higher value of y in the 
women in mixed pairs (cf. y column in Table 1), 
indicating that women are actually somewhat 
more steadfast in the martyrdom when they 
are in that role opposite men. 

In summary, the following results can be 
stated with some confidence. When women or 
men play Prisoner’s Dilemma many times in 
succession with partners of their own sex, 
large differences are observed in the two 
populations. These differences are reflected in 
several indices, and the gross effect is that 
women tend to cooperate less than men. The 
differences do not emerge at the beginning of 
the process. On the first and second plays no 
difference is observed between the two popula- 
tions. Therefore, whatever differences emerge 
must be attributed to the characteristic inter- 
action effects produced by the motivations 
inherent in Prisoner’s Dilemma. 

When men and women play with partners of 
the opposite sex, there is little or no discernible 
difference between them except on one or two 
indices. Prisoner’s Dilemma thus appears as a 
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Fic. 1. Nonoverlapping successive 15-play intervals. (For male pairs and female pairs: 
Subject 1 is indicated by solid line; Subject 2, dashed line. For mixed pairs: man is indicated 
by solid line; woman, by dash-dot line. The differences between Subjects 1 and 2 in the male 
and female pairs are evidently due to statistical fluctuations. In the case of mixed pairs, the 
higher values of C in men seem to be fairly consistent, These are, however, numerically small.) 


2iI-225 241-255 271-285 


great “leveler,” making partners alike. The 
time course of the fractions of cooperative 
responses are shown in Figure 1. 
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NEGRO CHILDREN’S PERFORMANCE ON A DISCRIMI- 
Ge NATION TASK AS A FUNCTION OF EXAMINER 
RACE AND VERBAL INCENTIVE c 


WALLACE A. KENNEDY anp MANUEL VEGA 
Human Development Clinic, Florida State University 


A study to determine the differential effect of Negro and white examiners on 
the performance of 324 rural Negro school children on an oddity discrimina- 
H tion task under the variables of grade level (2, 6, 10), intelligence level (high, 
medium, low), and experimental incentive condition (praise, blame, control), 
Results ¿ndiçated significant effe¢ts from exantiner race, experimental incentive, 
and the interaction between the 2, as well as their combined interaction with 
grade level. It was the Negro Ss’ reaction to blame under Negro examiners 
which differentiated their performance from that of white Ss under white 


examiners, 
e » 

As the pattern of integration unfolds across 
the country, it is apparent that the number of 
Negro students having white teachers, coun- 
selors, and examiners, will greatly increase. 
Katz (1964) has suggested that these changes, 
inherent in the desegregation process, are 
likely to cause detrimental changes in the 
performance of Negro students if they are 
rejected by white classmates or teachers, 
whereas acceptance by white peers and adults 
should “endow scholastic success with high 
incentive value [p, 396].” It would seem 
appropriate, therefore, to gather information 
about the nature of interactions between Ne- 
gro students and white teachers, or examiners. 

Perhaps one of the most relevant variables 
in the educational process is the use of verbal 
praise and blame as incentives for perform- 


‘ „ance and as means of informing students of 


the quality of their work. The Human Devel- 
opment Clinic of Florida State University, 
studying these incentives under various ex- 
perimental conditions, has been interested in 
the performance of Negro children, as well 
as that of white children. Tiber and Kennedy 
(1964), studying the effects of these verbal 
incentives on intelligence test performance of 
different social groups (480 second- and 


_ ` third-graders divided into middle-class white, 


lower-class white, and lower-class Negro), 
found no significant difference between in- 
centive groups, and no significant interaction 
between type of incentive and social group. 

_ Kennedy and Willcutt (1963), investigat- 
ing the effects of praise and blame on a dis- 


crimination task under the variables of grade, 
intelligence, sex, race, social class, school, 
and differences between white examiners (720 
second-, fourth-, seventh-, and tenth-graders 
equally divided between white and Negro 
races), found praise to be more effective for 
both Negro and white children, than was 
blame, which was actually detrimental to 
performance. 

Although Shuey (1958) implies that the 
race of the examiner has no significant influ- 
ence on the performance of Negro subjects, 
other suggestions in the literature (Dreger & 
Miller, 1960; Katz, 1964; Trent, 1954) that 
the race of the examiner might effect Negro 
performance led to the current study, for it 
would seem important for the psychologist 
working in the field to know the nature of the 
effect of his race upon the performance of 
Negro children whom he meets in his daily 
routine. The logical first step to more accu- 
rate interpretation and prediciton of the be- 
havior of Negro children under white exam- 
iners would be to determine whether a sig- 
nificant difference exists in the performance 
of Negro children tested under white and 
Negro examiners. 


METHOD 


Task 

Due to the well-known discrepancy between the 
performance of Negro and white children (Dreger & 
Miller, 1960; Garth, 1931; Kennedy, Van De Riet, & 
White, 1963; Klineberg, 1935; Pintner, 1931; Shuey, 
1958), a discrimination task which all subjects 
could solve in time was chosen as appropriate for 
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the purposes of this study. The task (conceived by 
Turner, 1962, after Chalmers?) consisted of 24 od- 
dity-problem stimulus cards presenting four patterns, 
one of which was different from the other three, at 
eight levels of difficulty involving the distribution of 
32 black squares on a 64-square white checkerboard. 
Each square was divided into four quadrants of 
16 squares each, 8 black. Each quadrant was sub- 
divided into subquadrants of 4 squares each, 2 
black, the placement of which was done randomly. 
The most difficult discrimination was formed by 
moving 1 of the 64 squares from one quadrant to 
another, according to a table of random numbers. 
The second most difficult involved moving 2 squares, 
and so on down to the least difficult problem where 
8 squares were moved. On the easier levels a restric- 
tion of random placement was upheld in order to 
keep a balance between the remaining three quad- 
rants, Thus, when 3 squares were being moved 
from one quadrant, 1 each was assigned to the other 
three quadrants. 

The discrimination task was administered through 
an apparatus designed along with the task. The 
stimulus cards appeared on a viewing screen, below 
which were four momentary switches corresponding 
to the four patterns on the card. The apparatus 
timed, in one hundredths of a second, the period 
between the appearance of the stimulus card on the 
screen and the subject’s pressing of the switch. A 
trial consisted of the same-order presentation of 
all 24 of the stimulus cards to each subject. 

Normative data, using 1,042 subjects, has been 
gathered on this task by four studies under the 
auspices of the Human Development Clinic (Ken- 
nedy, Turner, & Lindner, 1962; Kennedy & Will- 
cutt, 1963; Turner, 1962; Willcutt & Kennedy, 1963), 


Subjects 


Selected from the second-, sixth-, and tenth-grades, 
the 324 Negro subjects were public school pupils 
at the Pine Park and Northside schools in Havana, 
Florida (Gadsden County), Half of them were 
tested by Negro examiners, half by white examiners. 


Materials 


The Short Form A of the 1957 California Test of 
Mental Maturity was used to determine the intel- 
lectual level: high, range = 85-140, M = 89.61, SD = 
9.84; medium, range = 73-84, M = 79.19, SD = 5.77; 
low, range = 47-72, M = 70.60, SD = 6.37. This repre- 
sented a random sample of the Negro children in these 
grades in Gadsden County. 
$ The definition of intelligence poses some problems 
in that obviously the definition at the second-grade 
level is somewhat different from that at the tenth- 
grade level. However, the mean and standard devia- 
tion were virtually identica] at the three grade 
levels: Grade 2, M =81.98, SD = 1.20; Grade 6, 
Hees; SD=1.13; Grade 10, M=78.61, SD 


1E. L. Chalmers, 


EL unpublished data, Florida State 
University. 
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Examiners 


Six examiners, male graduate students 
chology, three white from Florida State U 
and three Negro from Florida A and M U 
trained in the use of the apparatus and 
delivery of the verbal incentives, were 
checked in the field to keep procedure as | 
possible.? 


Procedure 


The subjects, tested individually at their 
were given the first trial on the discrimina 
the experimental incentive cendition, and _ 
trial “on the discrimination“ task. Each { 
worked with the same number of subjects 
of the experimental incentive treatment 
praise, blame, control, 

All subjects were read the following 


This game is to see if you can tell 
ence between designs. When I turn on 
you will see four designs there in front 
[Examiner shows subject where to loo 
are to find the one that is not the same 
others. Three of the designs will be he 
but one will be different. As soon as yi 
the one that is different, press the swil 
underneath it. For example, if the first d 
the left is different from the rest, you p 
first switch on the left. If the second d 
different, press the second switch. And si 
Have you any questions? 


The examiner then demonstrated a sam] 
to make sure that the subject understood the 
switch combination. Although the difficulty 
structions was perhaps too high for second- 
the combination of instructions and dem 
was effective: the error variable was virtu; 
existent. Following the demonstration, 
were continued: 


Now, you try the switches yourself. Pret 
ber one, please, two, three, and four. 
first card is just for practice so you can 
the designs look like. You press the swil 
the design that is different from all 
Ready? [If the subject pressed the co 
the examiner said, “good.”] 3 

All the cards will not be the same, 
problem will be the same. That is, to 
one design that is different from all the } 
the card. Any questions? 

Put both of your hands palm down | 
table in front of the switches. Keep them 
table until you are ready to press the 
not jump ahead. Press the switch as 
you can, but be sure to press the one Un 
design that is different. Ready? 


? Statistical tests of significant differences i 
within-race variations of the six examiners 
nonsignificant differences between same 
aminers, 


Necro’s PERFORMANCE AND EXAMINER’S RACE 


If the subject had further questions, the ap- 
propriate portion of the directions was reread. If 
necessary, a second sample card was used. The 
examiner was as active as necessary to assure that 
the subject fully understood what was required of 
him. 

A trial consisted of 24 stimulus cards randomly 
ordered at each difficulty level, but in sequence as 
far as difficulty was concerned. That is, the easier 
discrimination problems were presented first, build- 
ing up to the more difficult. Or more specifically, 
the subjects started at Difficulty Level 1, worked up 
to Level 8, and then back to Level 1, to repeat the 
patternm@through the 24 cards. 

At the end of Trial 1 reproof was admiristered 
to subjects assigned to thé reproof group as follows: 


Well, that’s it. [Pause] Your scores are not 
nearly as good as I thought they would be. I’m 
really disappoiated. I think these are the worse 
scores I’ve gotten yet. [Pause] But perhaps if we 
ran through the cards again. I'll tell you what. 
Just forget these scores. Let’s go through the 
cards again. Do you understand the directions? 
Let me repeat them once more. 


x, « 

The directions were then reread verbatim and 

the same 24 cards readministered in the same order. 
Subjects in the praised group were told: 


% Well, that’s it. [Pause] Your scores are very 
good. I knew you would do well, but you did 
even better than I expected, I am really pleased. 
These look like the best scores I have gotten yet. 
[Pause] Now, Id like you to go through the 
cards again, To get the time the same, I'll repeat 
the directions, 


Subjects in the control group were asked questions 
of a biographical nature during the interval between 
trials. The time to do this was equated as closely as 
possible with that used to present the verbal in- 
centives, 

At the end of the second trial all subjects were 
asked not to discuss the experiment with anyone. 
Those who served in the reproof group were detained 
and given high praise for their two performances to 
counteract any adverse effect resulting from the 
reproof, 


Criterion Measure 


a 
` The reaction time between the stimulus card 
appearing on the screen and the depression of the 
subject switch was the criterion measure for the 
experiment. The major variance studied was the 
mean reaction time of Trials 1 and 2 and the mean 
reaction time difference between Trials 1 and 2. 

The discrimination task used for this study was 

. such that all subjects, given time, could solve the 
problem. A check was kept on the error responses, 
which were so few as to be only a random effect. 
No correction was made for the few errors which 
appeareds 
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TABLE 1 


SIGNIFICANT F Ratios FOR MEAN SCORE DIFFERENCES 
BETWEEN TRIALS 1 AND 2 


Variable df P o 
Examiner race (A) 1/270 15.90*1* 
Grade level (B) 2/270 2.19 
Intelligence (C) 2/270 S3 
Reward (D) 2/270 9.30** 
AXB 2/270 ‘61 
AXC 2/270 1.61 
AXD 2/270 6.00* 
AXBXC 4/270 64 
AXBXD 4/270 3.06* 
AXCXD 4/270 1.05 
AXBXCXD 8/270 1 
BXC 4/270 2.00 
BXD 4/270 1.50 
BXCXD 8/270 14 
CxD 4/270 1.40 

*p= 
Ap = 01 
wey = 001. 


Statistical Design 


A general linear hypothesis analysis was per- 
formed using as a criterion the difference scores 
(mean reaction time score difference between Trial 
1 and Trial 2). 

This was a 2X3X3X3 analysis of variance 
with 2 examiner races, 3 grade levels, 3 intelligence 
levels, and 3 reward conditions. A difference score was 
used because, although there was no significant 
difference between the means of the 54 cells on 
Trial 1, there was a considerable amount of variabil- 
ity within the cells. 

The reaction time of the subject was the dependent 
measure. That is, it was the time in seconds inter- 
vening between the stimulus onset and the subject’s 
pressing the response switch, The individual re- 
action time for each card was converted into trial 
means for each subject. 

Because the BI-MED 5 program used for the 
statistical analysis would not handle signs, a constant 
(7) was added to the difference score means, How- 
ever, in the table of means the constant value has 
been removed and the signs restored for clarity of 
presentation. 


RESULTS 


Table 1 illustrates the significant F ratios 
for the statistical analysis of the mean dif- 
ference scores between Trials 1 and 2. Sig- 
nificant were examiner race, experimental in- 
centive, and the interaction between the two, 
as well as their combined interaction with 
grade level. 

Illustrated in Table 2 are the significant 
mean differences and their standard devia- 
tions. For examiner race, subjects working 
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TABLE 2 
MEAN AND STANDARD DEVIATION FOR’ MEAN SCORE DIFFERENCES BETWEEN TRIALS 1 AND 2 
By Examiner Race (A), GRADE LEVEL (B), AND EXPERIMENTAL INCENTIVE (D) $ 
— 
a Praise Blame Control Total 
AXBXD 
: M SD M SD M SD M sD 
Whit 
2 ; 1.63 1.58 —1.21 2.13 1.07 2.06 
6 76 1,57 .09 1.75 87 1.05 
10 1,73 93 — 47 1,75 1.54 78 
Total 1.37 1.46 = 53 1.96 1.16 1,44 .67 1.85 
Negro $ 4 
2 1.59 1.62 99 5 3.70 1.35 1.40 £ 
6 82 1.74 77 1,24 1.86 1.76 - 
10 1.87 96 1.43 1,58 1,33 1.10 
Total 1.43 £55 1.06 2.45 1.51 1.46 1.33 1.88 
Grand total 1.40 6.66 27 2.35 1.33 1.46 1.00 


under white examiners showed an improve- 
ment of .67 from Trial 1 to Trial 2, while 
subjects under Negro examiners improved 
their overall performance on Trial 2 by 1.33, 

Praise and control resulted in a greater de- 
crease in speed of performance for all sub- 
jects, than did blame (praise, 1.40; control, 
1.33; blame, .27). 

Working under white examiners, subjects 
reacted to praise and control with improve- 
ments of 1.37 and 1.16, respectively, while 
they suffered a decrement of —.53 under 


TAB 
Newman-Kevts SEQUENTIAL RANGE TES 


BETWEEN Groups as DIVIDED BY EXAMINER RACE, EXPERIMENTAL 
INCENTIVE, AND GRADE LEVEL 


white-examiner blame, On the contrary, sub- 
jects working under Negro examiners reacted 
to blame with a 1.06 improvement in score 
(praise, 1.43; control, 1:51). In other wards, 
all Negro subjects reacted to praise and con- < 
trol with decreased reaction time on Trial 2 
with no difference in response according to 
examiner race. However, reaction to blanté 
was different under white and Negro exam- 
iners, with a resulting decrement in perform- 
ance under white examiners and in improve- 
ment under Negro examiners. 


LE 3 
T SIGNIFICANT MEAN SCORE DIFFERENCES 


$ $, Thit lame/10 

Ha by tae White/blame/2 White/blame/6 White/blame/! 

M SD M SD M sD M pe 
Negro/praise/2 1:59: | 162 | Laat | 213 =47 1.75 
Negro/praise/6 821 174 E E | O43 oi ve) le | a 
Negro/praise/10 187 | 96 | <4 | 23 09 175 | —47 | 175 
Negro/blame/2 SOES S |. Sar | Ns j -47 | 175 
Negro/blame/6 GEA | TRAE 213 —47 | 18 
Negro/blame/10 143: | 158 | -12 | 213 09 1.75 —47 1.75 
Negro/control/2 135} 140 | -121 | ZB ‘09 175 —47 1/5 
Negro/control/6 186) 176 | -12 | Z3 ‘09 1.75 — 47 1.75 
Mee moto ass ato | Laan | 38 ‘09 175 | —47 | 178. 
White/praise/2 163 | 9158 | 121 | 3415 09 175°!" 47> | ate 
White/praise/6 PORES To | Saar «| Saaie] bah =ar" | 48 
White praise/10 173} "93 | “a3 | 3 09 LISA | 147 ont 
White/blame/2 121 | 213 =1.21 | 213 09 1.75 
White/blame/6 OOM TEAS tor |. o'ts 09 1:75 
White/blame/10 | —'47 | 173 : J —47 | 145 
White/control/2 107 | 206°) 1.91 | (943 —47 | 175 
White/control/6 87°) 9105 | <1 | 23 -47 | 175 
Walieeateel/io |, 154 Ma | Dia | 2i 09 es A7 pe 


Necro’s PERFORMANCE AND EXAMINER’s RACE 


A Newman-Keuls sequential range test run 
on the data for the interaction between exam- 
inet race and verbal incentive indicated that 
subjects working under white examiners in 
the blame group reacted significantly differ- 
ently from every other experimental group. 
No other groups differed significantly from 
each other. 

The significant results of the Newman- 
Keuls sequential range test performed on the 
signifigant < interaction data between examiner 
race, verbal incentive, and grade level are 
illustrated i in Tablé3, It is only at the sixth- 
grade level that blame administered by white 
examiners does not consistently produce mean 
score differences, between Trials 1 and 2 which 
are sfgnificantly different from other experi- 
mental group mean score differences. 


Discussion 


It is apparent that the only significant dif- 
ference between Negro subjects working under 
white and Negro examiners is the differential 
reaction to blame, Under white examiners 
Negro students react to blame in the same 
manner as white students under white exam- 
iners: with a decrement in performance (as 
demonstrated in Kennedy & Willcutt, 1963, 
pp. 43-44), Under Negro examiners Negro 
students react to blame with an increment in 
performance similar to that under praise and 
no incentive, Thus the fact that Negro stu- 
dents showed a greater improvement in per- 
formance under Negro examiners is directly 
related to the detrimental effect of blame from 


- white examiners, 


It appears then, that the verbal incentive, 
blame, will have an increasingly detrimental 
effect on simple learning tasks as Negro chil- 
dren encounter white examiners, teachers, and 
administrators,  * 
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PARENTS AS DISPENSERS OF AVERSIVE STIMULI? 


G. R. PATTERSON 
Child Study Center, University of Oregon 


60 families participated in a laboratory procedure designed to test the effect 
of the parent in dispensing social disapproval. A child and 1 parent from 
each family participated in an operant conditioning situation in which the 
parent dispensed social disapproval contingent upon a simple motor response of 
the child. The data show that sex of child, age of child, and sex of parents 
are not significantly related to the magnitude of the child’s responsiveness to 
parental disapproval. Data from an interview with the mothers of these 
families show that the punitive, restrictive homes were associated with re- 
sponsiveness to disapproval in béys. Nonpunitive homes were associated with 
responsiveness in girls. Fathers who received high scores on the Manifest 
Anxiety scale and low scores on the MMPI K scale were associated with boys 


and girls who were most responsive to paternal disapproval, Boys responsive 
to parental disapproval were described by parents and by teachers as being 
inefficient in the classroom and as “immature.” Girls most responsive to 
paternal disapproval were described by parents and teachers as being in- 
efficient in the classroom, tense, and angry, 


There is extensive experimental literature 
which shows that aversive social stimuli are 
more effective than positive social reinforcers 
in controlling the behavior of both adults 
(Ferguson & Buss, 1959) and children 
(Brackbill & O’Hara, 1958: Meyer & Seid- 
man, 1961; Starr & Patterson ?). These find- 
ings taken in conjunction with the fact that 
aversive stimuli are rather frequent occur- 
rences in the life of the child (Fawl, 1963; 
Simmons & Schoggin, 1963) would imply that 
the child’s responsiveness to aversive stimuli 
is a variable which is crucial to the socializa- 
tion process, 

Tt is assumed here that the child who has 
been repeatedly exposed to aversive stimuli 
will be more responsive to social disapproval 
than will the child who has been exposed to 
a lesser degree. More specifically, children 
from punitive-restrictiye homes will be more 
responsive to social disapproval than will chil- 
dren from nonpunitive homes, This hypothe- 
sis will be tested by relating current parental 
practices in the home to a laboratory task 


1This study was supported by Grant M-5429 
from the United States Public Health Service. The 
writer gratefully acknowledges the assistance of 
D. Anderson and H. Hawkins in collecting the 
data, and to Beverly Fagot for her assistance in the 
data analysis. 

Use of Criticism and Praise in Operant Con- 
ditioning of a Simple Motor Response,” under- 
graduate National Science Foundation Paper, 1961. 
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assessing the responsiveness of the child to 
social disapproval dispensed by the parent. 
A pilot study by Williams * showed that 


(neutral) adults with high anxiety scores on | 


the MMPI tended to be more effective with 
children in dispensing both positive and avet- 
sive social stimuli. This leads to the second 
hypothesis that parents describing themselves 
as anxious on the MMPI would be more ef 
fective than nonanxious parents in dispensing 
aversive stimuli. 

The research by Kagan (1956) and Em- 
merich (1961) would suggest that the father 
is seen by the child as being more punitive 
and fear arousing than is the mother. These 


findings lead to the third hypothesis that ` 


fathers will be more effective in dispensing 
social disapproval than will mothers. 

The review by Church (1963) of the effect 
of punishment upon animal behavior strongly 
Suggests a dichotomy of effects. The research 
literature shows that in addition to thé gen- 


eral effect of punishment in suppressiag 1 


sponding, there may also be an emotiona 
consequence attached to the use of punish- 
ment. These findings lead to the very tenta- 
tive hypothesis that the same parental puni- 
tiveness which results in a child’s being overly 
responsive to disapproval will also be relate 


3 Unpublished manuscript entitled “Personality 


Characteristics of the Adult in Operant Conditioning 
of Children,” 1962, 


. tracting 


PARENTAL DISAPPROVAL 


to the child being described as “overly emo- 
tional.” More specifically, it was predicted 
that the child found to be most responsive to 
parental disapproval in the laboratory would 
also be described by the teachers as being 
poorly adjusted in the classroom. 

The apparatus used in this study was an 
adaptation of the operant conditioning device 
described by Gewirtz and Baer (1958). Par- 
ental disapproval is made contingent upon 
one of the responses available in a simple 
two-choice game played by the child.: The 
magnitude of the child's change in preference 
for these two choices constitutes a measure of 
his responsiveness to parental disapproval. 
Following the conditioning trial, the mother 
was interviewed to assess current child-rearing 
practices. The teachers were also asked to 
provide ratings of the child’s current class- 
room adjustment behaviors. 


© 


PROCEDURE 


c 


Sample 


&ixty families volunteered to participate in the 
study; each family contributed one parent and one 
child for a conditioning trial. The 30 boys and 30 
girls were taken in approximately equal numbers from 
the second, third, and fourth grades of the local 
public schools. 

Data obtained from the interview with the mother 
showed that the sample was drawn largely from 
the middle class. Only those families were used in 
the study in which both the mother and father could 
be present for the conditioning trial. A table of 
random numbers was used to determine which of 
the parents would serve as social agent in the 
conditioning trial. 


Apparatus 


To facilitate the cooperation of the families, the 
equipment was mounted in a 15-foot house trailer 
which was taken to the home. A soundproof parti- 
tion separated the electronic equipment and the 
experimenters from the subjects. To reduce dis- 
sounds, the recording equipment was 
mounted in insulated cabinets. 

The apparatus was an adaptation of the marble 
box described by Gewirtz and Baer (1958). The box 
was approximately 8 inches high and 18 X 10 inches 


° wide. The two holes at the top were .75 inch in 


diameter and placed 1 inch apart. In a tray at the 
base of the box were 250 blue glass marbles. The 
‘box was located on a level platform adjusted so that 
the base was approximately at waist level for the 
subject. The subject faced the reinforcing agent 
who sat on a bench 3 feet to the front and right 
of the subject. The apparatus was automated to the 
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extent that frequency of response to either hole 
was recorded along a time line on a multiple channel 
event recorder. A programer was used to signal the 
reinforcing agent through an earphone. Upon re- 
ceiving a signal, the reinforcing agent read a state- 
ment from a list given to him by the experimenter. 
The statements were: not too good, huh-uh (with 
head movement), not yet, no that’s not right, nope, 
guess not, huh-uh, wrong. 


Procedure 


Parents who consented to participate in the study 
were contacted about a week in advance and ar- 
rangements were made for the laboratory to be 
brought to their home. The majority of the ap- 
pointments were on Saturday mornings and after- 
noons, the remainder being scheduled between 7:00 
and 8:30 weekday evenings. At the appointed time, 
the trailer was brought to the home and approxi- 
mately 10 minutes were taken to introduce the 
research team (at least two experimenters) and to 
set up the apparatus. Invariably, the staff were well 
received by the family, 

One of the parents (randomly assigned) was taken 
into the trailer first and given the following in- 
structions: 


Your child will be playing a simple game of 
dropping marbles into this box. We would like 
you to be in this room with him [or her] part 
of the time, and listen for signals that we will 
send you over this earphone. [The experimenter 
demonstrates how earphone is worn, then sends a 
few signals to test apparatus and make sure parent 
can hear the signal clearly.] Whenever you hear 
a signal, we would like you to make one of the 
following remarks from this sheet. [Hands parent 
sheet.] Use only these words, and do not say them 
when you do not hear a signal over the earphone. 
OK, if we could practice that now a couple of 
times. 

The experimenter then dropped some marbles and 
checked to make sure that the parent’s discrimina- 
tion of signals and his behavior were in accord with 
the instructions. The parent was then cautioned not 
to give his child any other information during the 
experimental session other than to speak the ap- 
propriate comments, The parent was ushered out and 
the child was brought into the laboratory and 
given the following instructions: 


Would you stand here behind this blue box. 
OK, now here you see a whole bunch of marbles 
and up here are two holes. What I want you to 
do is to pick up the marbles, one at a time, and 
drop them into the holes, You can drop as many 
as you like, in any order, into either hole [experi- 
menter demonstrates]. Just use one hand at a 
time, and remember not to pick up more than one 
at a time. Would you like to try a few? OK. Now 
you can wait until I give you the signal to start 
by tapping on this window from outside. Then 
just keep on dropping them one at a time until I 


tell you to stop. 
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The first 100 responses constituted the base oper- 
ant level. The frequency of responses was tabulated 
on both an event recorder and cumulatively on 
two electric counters. The latter could quickly be 
scanned by the experimenter. When the counters 
showed a total of 100 responses, the experimenter 
identified” the hole which was most preferred; he 
then activated the programer so that each time a 
response to that hole occurred the parent would 
receive a click in his earphone. A 1:1 schedule of 
aversive stimuli was maintained for 10 responses 
to the preferred hole. Following this, the child re- 
ceived 20 aversive stimuli on a 4:11 variable ratio 
schedule. 

At the end of the trial, the parent was given a 
note instructing him to dispense a positive reinforcer 
each time he heard a click in the earphone. Although 
the data from this phase of the trial were not used 
in the analysis, the procedure was introduced to 
counteract the effect of a long period of disapproval 
by the parent. 


Dependent Variables 


The data were tabulated for the frequency of 
occurrence of responses A and B for each 15-second 
interval through the experiment. The measure of 
preference change was obtained by computing X/ 
A+B for each 15-second interval where X was the 
frequency of the least preferred response. The median 
of this relative frequency score was calculated for 
both the base operant phase and the conditioning 
phase of the experiment. The difference between 
these two medians provided a measure of the magni- 
tude of shift in preference. From previous studies, 
the test-retest reliability of this score was 75, 
Patterson and Hinsey (1964). 

Previous studies have also shown that the magni- 
tude of the difference score is significantly determined 
by the magnitude of the position preference shown 
during the base operant period, Patterson and Hinsey 
(1964). In the present study the correlation between 
position preference during the base operant period 
and the magnitude of preference change was .50 
(P<.001). This would indicate that it is necessary 
to partial out the effect of base operant position 
preference before subjecting the data to intensive 
analysis. To meet this problem, data have been 
analyzed for several hundred children who have par- 
ticipated in conditioning trials with a variety of 
social agents using both Positive and aversive social 
stimuli, Frequency distributions of preference change 
Scores were calculated for three base operant levels 
of position preferences, Deviation scores were com- 
puted for each of these distributions; the deviation 
scores were then converted to T scores by using 
McCall’s T score transformation, The correlation 
between base operant position preference and the 


z Ane for preference change was not significant 
hl). 


Interview with Mother 


Tape recordings were made of a 30-minute struc- 
tured interview with the mother, The mimeographed 
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form used by the interviewer contained both the 
questions and the rating scales; the latter were filled 
out by the interviewer as soon as he felt he dad 
obtained enough information to make a judgment, 
The tape recordings of one half the families were 
listened to by a second experimenter, who made 
independent judgments on each of the 33 scales, The 
reliabilities for the individual items will be presented — 
in a later section. The data specific to each item d 
ranged from .18 to .91 with a median correlation 
of .74. 
The questions and ratings scales were based par- | 
tially upon a procedure developed by Patterson, f 
Littman, and Hinsey (1964). Many of the® original 
questions and scales were baséd upon material pub- 
lished by Bandura and Huston, 1961 4; Becker, 1960} 
Roff, 1949; Sears, Maccoby, and Levin, 1957. Al 
though these latter studies do not show factor iden: 
tity, they all identify the following two bipolar ~ 
factors: warm versus hostile and permissive versus; 
restrictiveness. Approximately one half of the items 
used in the present interview were taken from these 
studies, Ì 
Each parent who served as the social agent in the | 
conditioning trial filled out the items of the Z and 
K scales of the MMPI in addition to the» items 
forming the Taylor Manifest Anxiety (MA) scale. | 


Interview with the Child 


Following the conditioning trial, the child was 
interviewed to determine his awareness of the various © 
contingencies. The interview first identified whether | 
the child was attending to the behavior of the 
parent, or to changes in his own behavior. Finally, = 
he was told that dropping a marble into one of the 
holes elicited the comments from the parent; the 
child was asked to guess which hole it was. The 
interview and rating scale for awareness were dé 
scribed in detail elsewhere and will not be reproduced — 
here, Patterson et al. (1964). © ' 

In previous studies with the present operant pro- — 
cedure and its very complex ratio schedules, there 
have been no significant correlations between ratings a 
of awareness and changes in preferences. In the pres: — 
ent study the correlations for the subsamples range 
from .03 to .40, none of which were significant. 

Each child was also interviewed with a series © 
questions designed by Kagan (1956) to determine 
the child’s perception of parental roles in the famili 
These questions were: If you were in an argument 
at home with your mother and father, who, wous 
be on your side, your mother or your father? Lets | 
make believe you were bad and your mother A 
father were both home. Who would punish a 
your mother or your father? Who is the boss a 
your house, your mother or your father? Who k j 
you scared of more, your mother or your ite 
For the correlation analysis reported in a la 3 
section of the paper, the answer to any given que — 


*“Tdentification as a Process of Incidental lan 
ing,” progress report to the National Tostitute f 
Mental Health, United States Public Health Servi 
1961, 


a 
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tion was scored as 1 if the child replied “father” 
and scored 2 if the child replied “mother.” 


c 


Teachers’ Ratings 


After an interval of from 3 weeks to 4 months 
following the conditioning trial, each child was rated 
by his classroom teacher. The 47 bipolar items used 
in the rating schedule were taken from a schedule 
developed by Becker (1960). Becker’s pool of items 
were intended to sample the factor domain outlined 
by Cattell. The items were designed to be appro- 
priate to the behavior of nursery school children. 
Becker Gerived six factors by a centroid factor analy- 
sis of a large samplecof teachers’ ratings using this 
scale. The items in the present study sample the 
variables with the highest loadings on each of these 
six factors. 

6 ” RESULTS 

Parents dispensing disapproval were highly 
effective in changing the child’s choice be- 
havior. The total number of responses occur- 
ring in a 15-second interval ranged from 4 to 
8. The mean increase in the least preferred 
response was 15.5% of the response occurring 
in a 15-second interval. The ¢ test of these 
ditference scores was significant, p < .001. Al- 
though the studies are not directly compara- 
ble, it is of interest to note in passing that the 
parents dispensing disapproval were more ef- 
fective in changing preference behavior than 
were parents dispensing social reinforcers 
(M = .070) or peers dispensing social re- 
inforcers (M = .098) (Patterson & Anderson, 
1964; Patterson et al., 1964). 


Preference Change Controlling for Sex and 


` Age of Child and Sex of Parent 


The mean T score for preference change 
and the sample size of the various subgroups 
are presented in Table 1. 

The analysis of variance followed a 2 X 2 
3 design controlling for sex of the child, sex 


` ofthe parent, and age of the child. The analy- 


sis of variance and correlation for unequal cell 
frequencies was based upon the procedures 


_ Outlined by Winer (1962). Neither the analy- 


sis of main effects nor interactions were sig- 
nificant. Although, for most of the subgroups, 
fathers tended to be more effective than moth- 
ers, the effects are not significant. The lack of 
significance for main effects for sex of child 
and sex of parent were also found in the study 
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TABLE 1 
Mean T Scores AND SAMPLE SIZE 


ENN A Sos Age of child 
jects |forced by: 
N/7/|N]} 8 |N |910| m 
Boys | Fathers| 4 |55.1) 5 |58.0) 7 |49.8|53.6 
Mothers| 4 |48.7| 4 |46.5) 6 | 47.5/47.6 
Girls | Fathers | 4 |63.2| 4 |44.8] 4 |51.1/53.0 
Mothers} 5 |51.7| 7 |56.2| 6 | 51.0}53.2 
M 54.5 52.4 49.7 
Total | 17 20 23 


by “Patterson et al. (1964). In this latter 
study, parents dispensed positive social 
reinforcers. 


Parental Practices Correlated with Respon- 
siveness 


Earlier in the discussion, it was hypothe- 
sized that the responsiveness of the child to 
social disapproval would be correlated with 
use of punitive-restrictive parental practices 
in the home. To test this assumption, the 33 
rating scales from the interview were corre- 
lated with the preference T' scores. To mini- 
mize confounding effects, the data are pre- 
sented separately for boys-fathers, boys-moth- 
ers, girls-fathers, and girls-mothers. Only 
variables with a Pearson product-moment 
correlation significant at < .10 are presented 
in Table 2. The reliability correlations repre- 
sent the agreement between judges in filling 
out the rating scales, 

The data for boys supported the hypothesis 
that responsiveness to parental disapproval 
will be associated with homes described as 
restrictive and punitive. 

Boys were most responsive to fathers who 
described themselves as being rather tense 
individuals. These fathers tended to support 
the boy during periods of family discord. In 
these homes, although deprivation of privi- 
leges was not used with any great frequency, 
the mother tended to explode frequently as 
a technique for controlling the child and both 
parents made intensive use of reasoning. The 
boys themselves were described as seeking 
attention and rewards and affection; the cor- 
relations for these last two variables were not 
significant, however. 

The significant correlations relating to boys’ 
responsiveness to their mothers are so limited 
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TABLE 2 
PARENTAL PRACTICES CORRELATED WITH PREFERENCE T Scores f 
å d 
Samples 
> 
Reliability Interview variable Boys Girls 
Fathers | Mothers Fathers. | Mothers 
Parental practices 
80 Frequency mother explodes 64" 16 — 46 - 43 
74 Use deprivation of privileges —.60** | —.34 —.18 01 
38 Restrictions outside of the home 09 52 36 A aM 
.91 Frequency parent spanks, slaps 13 58% | —.40 a AL 
+63 Use of reasonin, „573 18 38 -12 
90 Frequency scolding, nagging f ! 03 39 —.49* = 05, 
4 Mother tends to give socially desirable responses in 
interview —.61** 04 — 37 23 
In argument, mother is on child’s side" —.52* — Al 75%" | —OL 
Child behaviors A 
-88 Attention seeking 259 26 =.10 = K 
„81 Seeks tangible rewards 35 12 584 ET. 
85 Seeks affection 44 — 36 .08 
Parents’ self-description i 
MMPI, K scale —.36 AS | —.57 | 02 
MMPI, Manifest Anxiety scale 49* | —.02 67 20 


pee data from (Kagan, 1956) interview with the child. 
b<. 


** p <.05. 


in number that this number of correlations 
could have occurred by chance. In general, 
the more frequent the punishments and re- 


TABLE 3 


Boys: PERSONALITY RATINGS CORRELATED WITH 
PREFERENCE T SCORES 


Correla- 
Correlations with |tions with 
teachers’ ratings | parents’ 
ratings 
Personality trait behaviors ST ia 
Punishing agent 
Mothers | Fathers 
(N =14) | (N =12) (N =15) 
Schoolroom intelligence : 
Dull minded-intelligent 32 —.10 —.43* 
Poor memory-good memory | —.40 —.48* —.14 
Meee reeset: i =.23 01 674 
‘eaningless-meaningful —.24 <2 —.52* 
Hostile withdrawn 4 Z 
Interesting-boring -30 —.24 45* 
Relaxed disposition 
Fluctuating-stable —.24 —.61** | —15 
Ce of aggression 
impatient-patient =.32 =; —AS* 
Submission 4 ty oo 
Strong willed-weak willed -20 Al A8* 
Dominant-not dominant —.02 64 
Conduct problems 
Obedient-disobedient 52 | 05 —.04 
Responsible-irresponsible 46* 42 454 
Cooperative-obstructive „5204 -21 -07 
Fasily disciplined-difficult s 
o discipline 64" A —.0: 
Adult like-infantile 42 —.30 Soe 
*p<.10, 
** p <.05. 


[i 
strictions, the more responsive the boys wefé 
to maternal disapproval. i 

The significant correlations relating child- 
rearing practices to girls’ responsiveness to 
fathers’ and mothers’ disapproval were too 
limited in number to permit interpretation. It 
seems quite probable that the variables relat- 
ing to the child’s responsiveness to maternal 
disapproval were not being sampled in the 
present procedure. The fathers of girls most | 
responsive to paternal disapproval described | 
themselves as being rather tense, retiring, 10- |} 
hibited individuals. 


Adjustment Behaviors Associated with ue 


Sponsiveness to Disapproval 


Tt was hypothesized that the child’s resp 
siveness to parental disapproval would be ‘a 
lated to poor adjustment behaviors in 
child. The relevant data for boys are pre- 
sented in Table 3. The correlations are be : 
son product-moment correlations between n 
preference T scores and both the teachers’ an 
the parents’ ratings of personality trait be- 
haviors. Because the ratings from the para 
were collected only during the last half of t36 4 
experiment, the resulting sample size Was 


small to permit analyzing the data separately 


| , 


tense, 
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or mothers and fathers as social agents. 
Combining the data from the two parents 
wills introduce some confounding and lower 
he correlations, Only those items significant 
at p < .10 are presented here. The significant 
variables are grouped according to the factor 
dimensions derived by Becker (1960). 

The number of significant correlations with 
he teachers’ ratings did not exceed chance 
expectations. However, there are a significant 
number of high correlations with the parents’ 
ratings of personality trait behaviors. In addi- 
ion, there is some general agreement between 
teachers and parent ratings in terms of the 
actors within which the significant correla- 
tions are found and the signs of the correla- 
tions. The boys most responsive to parental 
disapproval were described by parents and by 
teachers as manifesting behaviors which 
would be inefficient in the classroom setting, 
for example, dull minded, ineffective, and 
meaningless. The responsive boy was also de- 
scribed by his parents and teachers as being 
irresponsible and infantile. The correlations 
from both the parents’ and the teachers’ rat- 
ings would suggest that the most appropriate 
label for the boy responsive to parental dis- 
approval would be “immature.” The data 
offer rather weak support for the hypothesis 
that boys most responsible to parental dis- 
approval would be characterized as poorly 
adjusted. 

The corresponding data for girls are pre- 
sented in Table 4. 

Only the data for teachers’ ratings corre- 
lated with girls’ responsiveness to fathers pro- 
duced more significant relations than would 


` be expected by chance. 


Tn general, these data support the hypothe- 
sized relation between responsiveness to dis- 
approval and poor adjustment. As in the case 
of boys, both teachers and parents seemed to 
agree in describing girls responsive to parental 


- disapproval as showing inefficient classroom 


«behaviors, for example, dull minded and sub- 
‘Ject to distraction. The girls most responsive 


_ to disapproval by the father are particularly 


described by the teachers as being poorly ad- 
justed. These girls are described as being 
anxious, fearful, demanding, and 


Jealous. «e 
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TABLE 4 


GIRLS: PERSONALITY RATINGS CORRELATED. WITH 
PREFERENCE T SCORES 


; „| Corréia- 
Correlations with [tions with 
teachers’ ratings P parents’ 
ratings 
Personality trait behaviors Punishing agent 
Mothers | Fathers 
(N =14) | (N =12) | (N =15) 
Schoolroom intelligence 
Du’ minded-intelligent —34 —.39 —.49* 
Subject to distraction- 
able to concentrate —.29 —. 74% 12 
Slow-quick 02 21 —.68" 
Bored-interested —.24 48% —.28 
Hostile withdrawn 
Responsive-aloof —,43* 09 20 
Extraverted-introverted —.51%* | 25 204 
Trusting-distrusting 17 52* —16 
Relaxed disposition 
Tense-relaxed —.04 —.48* 11 
Anxious-nonchalant —32 = 57" 00 
Fearful-not fearful —.02 =,65** 00 
Lack of aggression 
Demanding-not demanding | —.10 =.S8"* | —,29 
Jealous-not jealous 35 —.48* —.40 
Submissive 
Noisy-quiet — 56m 08 16 
* p <.10, 
** p <5. 
ED <01 
Discussion 


The data for boys support the hypothesis 
that the punitive-restrictive home is most 
likely to be associated with the child who is 
most responsive to parental disapproval. Pre- 
vious research by Patterson et al. (1964) 
shows that this type of home is also associated 
with boys who are most responsive to social 
reinforcers dispensed by the mother, This 
would suggest that parental practices which 
are punitive and restrictive may be related to 
rather high potential for the control of the 
boy. Taken together the two studies would 
suggest that in such a home the boy would 
be highly responsive to disapproval dispensed 
by either parent and also highly responsive 
to social approval dispensed by the mother. 
If this is true, it would suggest some signifi- 
cant implications for the socialization in the 
child. At the present time a study is being 
initiated in which the boy is reinforced by the 
mother and punished in a second task by the 
father. This study will provide a necessary 
check upon the assumption that the restrictive- 
punitive home has implications for the boys’ 


850 


responsiveness to both social approval and 
disapproval. 

The data for girls do not provide support 
for the hypothesis that parental practices re- 
late to responsiveness to disapproval. The 
correlations between responsiveness to either 
of the parents and parental practices are too 
limited in number to permit any interpreta- 
tion. This would suggest that, in the case of 
girls, the hypothesis should be changed. 

Bronfenbrenner (1960) and Hoffman 
(1960) suggest an alternate hypothesis which 
might relate to responsiveness to disapproval 
by girls. These writers indicate that some 
combination of parental affection and parental 
authority would lead to be the best prediction 
about the child’s responsiveness to social cues. 
In the present case, the sample is too small to 
permit such an analysis. Data from a study 
now underway will, however, provide a test 
of this hypothesis, the prediction being that 
parental restrictiveness will again relate to 
responsiveness in boys but that a combination 
of affection and restrictiveness will be re- 
quired to account for responsiveness in girls. 

The second hypothesis predicted that the 
parent describing himself as anxious and tense 
(MA scale) would be associated with the 
child most responsive to parental disapproval. 
Although the hypothesis was in part based 
upon earlier data by Williams,* it also seemed 
reasonable to suppose that the anxious parent 
would tend to be rather unpredictable in his 
punitive reactions to the child. This should 
lead in turn to his being seen as a more threat- 
ening figure. The data show that the hypothe- 
sis is supported only for fathers. The more 
anxious the father described himself to be, 
the more responsive are both the boys and 
the girls to paternal disapproval. These data 
suggest that it is not the general role played 
by the parent, for example, mother or father, 
which determines the responsiveness of the 
child to disapproval but rather Specific char- 
acteristics of the parent. In the present study, 
anxiety proved to be a significant variable. 
This would suggest that further studies of this 
type include a broader coverage of personality 
characteristics in patients. 

The preliminary impression from the pres- 
ent data is that girls most responsive to pa- 
ternal disapproval tend to be seen as emotion- 
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ally disturbed and inefficient in the classroom — 
setting. Boys who are responsive to parental 
disapproval are described by the parents as 
immature. If these findings are replicated, 
they could be of major significance in expli- 
cating the nature of some cases referred to 
child guidance outpatient settings. For ex- 
ample, research (Levine & Simmons, 1962; | 
Patterson et al., 1964) showed that nonrespon- 
siveness to social approval was associated with 
maladjusted behaviors in children, The pres- 
ent data suggest that many of these children, 
while relatively nonrespdhsive to social ap- 
proval, might be more responsive to social 
disapproval than would normal children. 
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CONFORMITY PROCESS AND PRIOR GROUP SUPPORT * 
E. P, HOLLANDER, JAMES W. JULIAN, anp GORDON A. HAALAND i 
a State University of New York at Buffalo | 


A 2-phase experiment on conformity process was conducted with 76 women 


and 36 men. In same-sex groups, Ss were presented with 1 of 3 conditions of 


prior group support before confronting erroneous judgments apparently given 
by 4 other group members. In the Ist phase, Ss responded to an unambiguous 
stimulus for 20 trials as the 1st person to report in a standard Crutchfield 
situation. The 3 support conditions involved uniform agreement from the 
group on 20 out of 20 trials, 14 out of 20 trials, or 10 out of 20 trials; a © 
control condition had no feedback from others. Ss then shifted to the usual 
last response position for 20 more trials on which it now appeared that the 
other group members were responding directly opposite to accurate perception 
of the same unambiguous stimulus. Initial and sequential conformity were 
found to differ significantly as a function of the 4 treatments, As predicted, 
total conformity and a postinteraction measure of dependence were significantly 
correlated. Sex differences in total conformity were also noted, with the 
women being significantly higher overall. 


The study of conforming behavior involves 
the identification of behavior change where 
more than a single response alternative exists. 
Typically, a “conforming response” matches 
the relevant normative information provided 
by the group while, in the absence of this 
norm, the choice of other alternatives would 
be expected. 

There have been at least three noteworthy 
historical developments in experimentation on 
this phenomenon, Variations on these consti- 
tute a good share of the work done in this 
area today. Sherif (1935) demonstrated nor- 
mative effects on behavior in an ambiguous 
stimulus situation. Asch (1951) initiated the 
use of the unambiguous stimulus with social 
influence pressures produced by a contrived 
group norm at variance with subjects’ per- 
ceptions. Crutchfield (1955) modified and 
mechanized aspects of both approaches with 
the introduction of simultaneously controlled 
feedback to all subjects by means of signal 
lights; he also sought personality correlates 
of yielding under a variety of stimulus con- 
ditions, 

Though these three methods differ in stimu- 
lus type and the extent of the effect produced, 
they all have in common the consideration of 


1 Presented to the 1965 Eastern Psychological As- 
sociation Convention at Atlantic City, New Jersey. 
This study is part of a program of research supported 
under ONR Contract 4679 from the Group Psy- 
chology Branch, Office of Naval Research. 
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the group as an influence source which pre 
scribes norms on which persons might of 
might not depend. They also share the fail- i 
ing, however, of not accounting for sources of 
dependence which grow out of processes of 
interaction, including the significance of py0t 
conditions of exposure to the group. A mort | 
process-oriented approach to conformity em 
phasizes neither structural properties of the | 
situation nor personality dispositions of the 
participants (cf. Hollander, 1958, 1960) 
Rather, it studies conformity as an ongoing 
commerce between parties which these factors | 
can only partially determine. This is in Jine | 
with a number of contemporary movements, | 
including conceptions of social exchangt | 
(Homans, 1958, 1961; Thibaut & Kelley, 
1959) and of reciprocity (Gouldner, 1960). | 
Clearly, a central feature of confor 
resides in the significance persons come | 
attach to others as a relevant group on whit! | 
to depend in solving problems. In terms i 
process, therefore, dependence can be Er 
to factors in interaction producing i i 
effects on behavior (see Hollander & wil ” 
1964; Willis, 1963, 1964; Willis & HO) 
lander, 1964a, 1964b). What is Jearned a 
given influence situation is a weighting f 
feedback from others such as to produce 
greater or lesser dependence upon their ju 
ments as against one’s own impressions. 
Such a process approach is illustrate n 
the experiment of Pepinsky, Hemphill, 


. first of the five positions where, 
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Shevitz (1958), in which they studied social 
behavior as a function of prior conditions 
prevailing between the individual and the 
group. Where an individual was low in the ini- 
tiation of activity within the group they 
found that such responses could be encouraged 
by the group’s positive support of suggestions 
offered by that individual; alternatively, those 
who had been high in initiating activity were 
found to give a far lower level of such re- 
sponse by <a pattern of discouragements from 
the others. In a related way, Bachrach, Cand- 
land, and Gibson (1964) have similarly dem- 
onstrated the effects of “support” by a group 


_ on the verbal conditioning of individuals. The 


significance of york of this kind lies in the 
linkage it affords between social behavior and 
the prior reinforcement experience provided 
by the group. 

A different approach, exemplified in the 
reseaych of Kelman (1950), Mausner (1954), 
and Luchins and Luchins (1961), provides 
feedback to subjects by having the experi- 
menter report their accuracy or correctness in 
advance of being confronted with a conflict- 
ing position from one or more other subjects. 
Generally, it is found that conformity to the 
alternative position is reduced by such prior 
reinforcement or support by authority. While 
this research appears to have a similarity to 
the work just noted, it has quite the opposite 
cast in that it makes for greater independence 
vis-A-vis others in the group and hence less 
conformity. 

Tt seems to us that one meaningful ap- 


. proach which builds upon this earlier work is 


to study the dependence on the group arising 
from immediately preceding encounters, In 
pagticular, our focus is upon prior group 
support as a source of “reinforcement of ex- 
pectancy” leading to later elicitations of con- 
formity behavior in that group. Consequently, 


` in’ the present study we employ the standard 


Crutchfield apparatus with the essential dif- 
ference that subjects initially respond in the 
to varying 
degrees, they appear to receive uniform sup- 
port or agreement from the others. Only after 
this initial background experience of receiv- 
ing some level of group agreement are the 
subjects then shifted to the usual last posi- 
tion. There other members of the group ap- 
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pear to disagree with the accurate response to 
the unambiguous stimulus event. Conformity 
is inferred from those instances where the 
erroneous group report is given by a subject 
as his own response. A separate measure of 
perceived dependence is obtained from the 
postinteraction questionnaire. 


PROBLEM 


Within a learning framework, conforming 
responses can be considered to reflect a de- 
pendence upon the group, varying as a func- 
tion of the previous levels of support the 
group provides. We therefore hypothesize that 
conforming behavior after a shift to the last 
response position will be significantly related 
to the prior agreement or support from the 
group. We look upon group support as a re- 
inforcer of the implicit expectancy that one’s 
judgment of an unambiguous stimulus is 
shared by others. We use the term “expec- 
tancy” in the sense conveyed by Rotter 
(1954) in his model of social learning. 

Four patterns of group support were chosen 
for the present study. An initial condition of 
100% support or agreement was predicted to 
have the most pronounced effect in producing 
dependence upon the group, and that therefore 
it would yield the highest initial conformity 
response; furthermore, learning theory would 
lead us to predict the greatest dropoff over 
trials in the absence of any continuing con- 
firmation of a high expectancy. Two variable 
levels of group support (a 70% and a 50% 
condition) should produce the effects associ- 
ated with intermittent reinforcement under 
conditions of “extinction,” that is, nonconfir- 
mation of a lower expectancy. Such effects in- 
clude less dependence on the group and lower 
initial conformity, yet a greater persistence 
of the conformity response. Finally, the con- 
trol condition, which provides no feedback of 
the group’s response in the first trials, should 
produce little or no dependence and there- 
fore less conformity in the last set of trials. 


METHOD 
Equipment 
The equipment consisted of the five panels of 
signaling lights and switches. Each panel presented 
15 lights, arrayed in three columns of five lights 
each. In addition, a mercury switch was mounted 
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beneath each column of signaling lights. These 
switches corresponded with one of three stimulus 
lights mounted on the wall. These stimulus lights 
were controlled by two Hunter timers; one light 
went off first and the other two simultaneously .05 
second later, The sequence of extinguishing was 
randomly determined. Three blue candelabra screw- 
mount lights were used. Previous pilot work indi- 
cated this to be an unambiguous stimulus insofar as 
the correct order of extinguishing was identifiable 
95% of the time. 


Subjects and Procedure 


The subjects were 76 females and 36 males drawn 
from lower level psychology courses at the State 
University of New York at Buffalo. Many weeks 
prior to the experiment itself, these students had 
been administered the social desirability scale, as 
modified by Ford (1964). With no reference to 
that administration, subjects were later recruited 
from courses for the experiment. They reported to 
the laboratory in groups of four or five, and each 
was escorted to one of the signaling panels which 
were arranged in cubicles to shield the subjects from 
one another’s view. There they found instructions 
before them, which were also read aloud, as follows: 


This is an experiment on perceptual ability. The 
task is relatively simple, requiring only that you 
judge which of the three blue stimulus lights on 
the wall goes off first. This experiment is supported 
in part by a Navy grant and is thus interested in 
the ability to coordinate various colors of lights 
with relatively complex readings of panel lights. 
This is the reason for the extensive equipment, 

In front of you there is a panel of red lights. 
These show which blue stimulus light people judge 
to go off first. This permits you to see how your 
judgment matches the judgment made by others, 
as the experiment proceeds. 


_ Each subject was told that the bottom row of 
lights would show his response and the other four 
rows would show the responses of the other sub- 
jects. The instructions also went on to indicate that 
they were to close the mercury switch which cor- 


TABLE 1 


RESPONSES TO THE Quzstion “In Waca Pos; 
Was Ir Easier ror You TO MAKE Yooei 
Jupcments?” For PRIOR SUPPORT 


CONDITIONS 
na er a e 
Position 
Prior support 
condition ~ | No response 
First Second 
100 T pote pe 
i 22 5 5 
50% 16 5 7 
0-control 10 9 9 
Total 71 21 20 
a re eet 
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responded to their judgment of the first light to go 
off. For the initial 20 trials, all subjects responded in 
what they believed to be the first position after 
which they all shifted positions and this time each 
believed that he or she was responding in the fifth | 
position, that is, last. They were also instructed to ,| 
maintain the proper sequence of responding, that is, 
first position answers first, second next, and so forth, 


Design 


During the first 20 trials the independent variable 
took the form of the number of times the subject 
was led to believe the group uniformly: agreed with 
his own response. There weresfour conditions: 100% 
support (all 20 trials), 79% Support (14 randomized 
trials), 50% support (10 randomized trials), and a 
control condition involving no feedback of the 
group’s responses. Nine males and 19 females were* 
run for each condition, always in same-sex groups. 
For all of the last 20 trials, when they were in the 
last position, subjects received information appearing 
to indicate a uniform group response opposite to the 
accurate perception of the stimulus. A willingness to 
accept this manifestly incorrect group response as 
one’s own was taken as the behavioral measure of 
conformity on each trial. A postexperimental ques- 
tionnaire was employed to supplement this measure 
of conformity for the last 20 trials. Responses to this 
questionnaire were checked for any indications of 
suspiciousness by the subjects regarding the flue 
intent of the experiment; no evidence of awareness 
in this regard was found. This was corroborated by 
interviews. 


RESULTS 
Induction of Effects 


As expected, the postinteraction question- 
naire reflected important differences in the 
effects of the prior support conditions. Re ' 
vealing this, Table 1 demonstrates that the | 
treatments produced varying responses te 
garding the position in which subjects re- 
ported they found it easier to make their 
judgments. A chi-square comparing the first 
two columns in this table yields a value of 
10.42 (p < .01). Clearly, the subjects evi- 
denced a marked tendency to report the first ” 
position as the easier of the two for the three 
experimental conditions of support; for the 
control condition, however, a roughly equa 
split is found between the positions. This pat 
tern was unaltered when the sexes were com- 
pared. Looking at the total frequencies with 
which the positions were chosen as easier, 
distribution of choices for the 100% and fot 
the 70% support conditions was significan 
different (p < .02) from the control:conditio. 


s 


CONFORMITY PROCESS 


TABLE 2 


ANALYSIS OF VARIANCE OF MEAN CONFORMITY 
a FOR FEMALES 


Source df SS MS E 

Support conditions (A) a 1.251 | .417 | 18.13* 
Trial blocks (B) 3 365 | .122 

Linear component 1 .262 | .262 | 11.39* 

Quadratic component il 050 | .050 | 2.17 

Residual 1 .053 | .053 
AXB 9 167 | .018 

Linear component 3 O71 | 024 | 1.04 

Quadratic gomponent 3 .027 | .009 39 

Residual (error) Oe: 070 | .023 

2 . 


* p <.05. 


In general, the magnitude of the conformity 
response was in, keeping with past research 
utilizing subjects in anonymous circumstances 
(cf. Deutsch & Gerard, 1955) and supported 
the inference that little suspicion had been 
aroused. Sex differences in the amount of total 
confogmity also were found when a median 
test was applied to the total conformity scores 
for the two sexes. The chi-square value of 
6.81 was significant ( < .01) for df= 1. 
Aécordingly, analyses of variance of the con- 
formity response measure were performed 
separately for the two sexes. Because of the 
nonnormal nature of its distribution, the cell 
means were used as synthetic cases in these 
analyses. Such a procedure, though drastically 
reducing the degrees of freedom, permitted 
application of a parametric model to the con- 
formity data.2 Tables 2 and 3 present the 
summaries of these analyses for females and 
males, respectively, As predicted, the prior 
support conditions did yield significant differ- 
ences in the total number of conforming re- 
sponses elicited. This was true for both sexes. 


Sequential Conformity 


Of central interest is the sequence of con- 
formity responses following the conditions of 
prior support or “reinforcement.” In Figures 
1 and 2 these differences by conditions are 
shown for female and male subjects, respec- 


2 This trend analysis (see Winer, 1962, pp. 353- 
369) entailed the necessary assumption that the 
cubic component of the residual sum of squares was 
zero. Although such an assumption could be un- 
founded in the analysis of the data for females, if it 
were to be so the residual component would act to 
provide an even more conservative test of the null 
hypothesis” 
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tively. In keeping with our prediction, the 
100% support condition yielded the highest 
initial level of conforming behavior for both 
sexes. Intermittent group support produced 
more complex effects but these were generally 
in harmony with the anticipated persistence 
of conforming behavior revealed in the inclina- 
tion found in both the 50% and 70% condi- 
tions for each sex. The trend analyses, indi- 
cated in Tables 2 and 3, showed the statistical 
significance of some of these features. For 
each of the sexes there was a clear negative 
linear trend across blocks of conformity trials. 
In addition, the significant interaction compo- 
nent for males indicated that the simple trends 
across trials differed for the different condi- 
tions of group support. The pattern of trends 
for females, however, can apparently be well 
described by the overall linear component. 
Thus, for the men and women, analyzed 
separately, a clear shift is evident in con- 
forming from the first to the second trial 
blocks, Similarly, the “extinction” process is 
evident for each of the sexes in the linear de- 
cline in response for the 100% support 
condition. 


Relation between Conformity Response and 
Perceived Dependence 


A measure of dependence was developed 
from two questions on the postinteraction 
questionnaire. These called for answers indi- 
cating to what degree the subjects found the 
judgments of other members useful or hinder- 
ing in making their own responses, Summing 
these led to a simple index of the degree of 
perceived dependence-independence from the 


TABLE 3 
ANALYSIS OF VARIANCE OF MEAN CONFORMITY 
FOR MALES 
Source df | SS MS F 
Support conditions (A) 3 | .6722 | .2240 | 149.33 
Taal blocks (B) 3 | .7361| .2454 
Linear component 1 | .4320| .4320 | 288.00 
Quadratic component 1 | 2782 | .2782 | 185.47 
Residual 1 | 0257 | .0257 
AXB 9 | 2786} .0310 
Linear component 3 | 2170) .0723 | 48.20** 
Quadratic component 3 | .0570| .0190 | 12.66* 
Residual (error) 3 | .0045| .0015 


* p <.05. 
apco. 
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Fic. 1. Sequential conformity of female subjects by prior support conditions (W = 19 
each), 


group. A  tetrachoric correlation analysis, 
based on roughly equalizing splits for this 
measure and for the total number of conform- 
ity responses in the last 20 trials, yielded a 
value of .46 (p< .01) each for females and 
males, independently calculated. Leaving out 
the control condition, on the expectation that 


1.40; 


MEAN CONFORMITY PER BLOCK OF FIVE TRIALS EACH 
ip a a œ 
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TRIALS 


TI 
(21-25) RAS 


(26-30) 


. q 
it should not yield dependence in association j 
with conformity, the recalculated values were 
raised to .53 for females and .67 for males. 
Thus, the omission of the control condition 
has the effect of raising the correlations for 
the experimental treatments where support 1s 
provided. In every instance, these correlations 
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Fic. 2. Sequential conformity of male subjects by prior support conditions (N = 9 each). 
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reveal a significant relationship between con- 
formity behavior and the postinteraction 
measure of dependence. 


Conformity Response and Social Desirability 


Previous work has shown that conformity 
in an Asch situation is significantly correlated 
with scores on social desirability, measured by 
the Marlowe-Crowne Social Desirability scale 
(Crowne & Liverant, 1963). In our study, 
however, we do not find this strong relation- 
ship holditig with a similar social desirability 
measure (Ford, 1964). The correlation be- 
tween total conformity and social desirability 
for women was .17 and for men, .15. It ap- 
pears then that the conditions of prior group 
suppert were stifficiently salient to override 
any substantial effects from social desirability. 


CONCLUSIONS 


The findings are consistent with the the- 
oreti¢al framework of this experiment and 
indicate that prior conditions of group support 
do yield differential consequences for subse- 
quent conforming behavior. 

The curves presented in Figures 1 and 2 
reveal sequential patterns of conformity which 
are in line with expectation. There are, of 
course, sex differences in level of conformity 
which require further study. Crutchfield 
(1955) has long since noted the tendency for 
women to be more accepting of the group’s 
normative response than are a comparable 
group of men. We also have the suggestion of 
variable effects of intermittent support or 


_ “partial reinforcement” upon responses of 


men and women under the same conditions. 
Additional research is needed to amplify and 
test for this differential effect. 

ft may be noted, further, that for both 
sexes, considered separately, the control con- 
dition appears to yield a pattern which is 


. quite “parallel in the main to that produced 


by 100% reinforcement. Bearing in mind that 
the control condition is closest in character to 
that found in research following the usual 
Asch-Crutchfield procedure, this finding can 
be interpreted to indicate the possibility of 


an implicit expectancy that others in the 


group should see an unambiguous stimulus as 
the subject does, even in the absence of con- 
firmations 


The question inevitably to be considered is 
whether prior support gives persons a greater 
credence in the group judgment, or a sense 
that they themselves are better judges for 
having had group support. On analysis of our 
postinteraction measures, both of these proc- 
esses appear to be at work. Indeed, in either 
instance they resolve down to an implicit de- 
pendence relationship. In one case the group’s 
view is adopted as a standard of judgment re- 
garding the stimulus task, and in the other 
the group’s view is adopted as a standard for 
assessing one’s own judgment. The fact that 
in the present data, and for each sex, such a 
high relationship obtains between dependence 
and frequency of conforming gives evidence of 
both these processes. 

In sum, it seems plain that the concept of 
dependence growing out of interaction leads 
to another fruitful line of study regarding 
conformity process. In this respect, one evi- 
dent outcome of the present study is to dem- 
onstrate the different sequential effects pro- 
duced by varying levels of prior support, 
apart from establishing the broader linkage 
between prior experience and subsequent 
levels of conformity. Viewing conformity, 
then, as a process involving learning experi- 
ences in groups lends greater richness to the 
study of this phenomenon. 


REFERENCES 


Ascu, S. E, Effects of group pressure upon the 
modification and distortion of judgments. In H. 
Guetzkow (Ed.), Groups, leadership, and men, 
Pittsburgh: Carnegie Press, 1951. Pp. 177-190. 

BACHRACH, A. J., CANDLAND, D. K., & Gisson, JANICE 
T. Group reinforcement of individual response 
experiments in verbal behavior. In I. A. Berg & B. 
M. Bass (Eds.), Conformity and deviation. New 
York: Harper, 1961. Pp. 258-285. 

Crowne, D. P., & LIVERANT, S. Conformity under 
varying conditions of personal commitment. Jour- 
nal of Abnormal and Social Psychology, 1963, 66, 
547-555. 

CruTcHFIELD, R. S. Conformity and character. 
American Psychologist, 1955, 10, 191-198. 

DeurtscH, M., & GERARD, H. B. A study of norma- 
tive and informational social influence upon indi- 
vidual judgment. Journal of Abnormal and Social 
Psychology, 1955, 51, 629-636. 

Foro, L. H., Jr. A forced-choice, acquiescence-free, 
social desirability (defensiveness) scale. Journal of 
Consulting Psychology, 1964, 38, 475. 

Goupner, A. W. The norm of reciprocity: A pre- 
liminary statement. American Sociological Review, 
1960, 25, 161-179. 


858 


Hottanper, E. P. Conformity, status, and idiosyn- 
crasy credit. Psychological Review, 1958, 65, 117- 
127. (Reprinted: E. P. Hollander, Leaders, groups, 
and influence. New York: Oxford Univer. Press, 
1964. Pp. 161-178) 

Hopranper, E. P. Reconsidering the issue of con- 
formity, in personality. In H. P. David & J. C. 
Brengelmann (Eds.), Perspectives in personality 
research. New York: Springer, 1960. Pp. 210-225. 
(Reprinted: E. P. Hollander, Leaders, groups, and 
influence. New York: Oxford Univer. Press, 1964. 
Pp. 179-193) 

Hottanper, E. P, & Wuus, R. H, Conformity, 
independence, and anticonformity as determiners 
of perceived influence and attraction. In E. P, 
Hollander, Leaders, groups, and infiuence. New 
York: Oxford Univer. Press, 1964. Pp. 213-224. 

Homans, G. C. Social behavior as exchange. Ameri- 
can Journal of Sociology, 1958, 63, 597-606. 

Homans, G. C. Social behavior: Its elementary forms. 
New York: Harcourt Brace, 1961. 

Kerman, H. C. Effects of success and failure on 
“suggestibility” in the autokinetic phenomenon. 
Journal of Abnormal and Social Psychology, 1950, 
45, 267-285, 

Lucums, A. S., & Lucus, Epira H. On con- 
formity with judgments of a majority or an 
authority. Journal of Social Psychology, 1961, 53, 
303-316, 


E. P. Horranper, J. W. JULIAN, AnD G. A. HAALAND 


Mausner, B. The effect of prior reinfor 
the interaction of observer pairs. Jou 
normal and Social Psychology, 1954, 49, 

Pepinsky, PAULINE, HEMPHILL, J. K, & § 
R. N. Attempts to lead, group produci 
morale under conditions of acceptance ; 
tion. Journal of Abnormal and Social P: 
1958, 57, 47-54. s 

Rorrer, J. B. Social learning and clinical 
ogy. New York: Prentice-Hall, 1954, 

Smer, M. A study of some social facto! 
ception. Archives of Psychology, 1935, 27, 

Tuwaur, J. W., & Kerrey, H. H. The sı 
chology of groups. New York: WEey; 1 

Wars, R. H. Two dimensions of confo 
conformity. Sociometry, 1963, 26, 499-51 

Warts, R. H. Conformity, independence, 
conformity. Human Relations, 1964, in 

Wis, R. H., & Horranper, E. P, An ex 
study of three response modes „in social 
situations. Journal of Abnormal and So 
chology, 1964, 69, 150-156. (a) 

Wius, R. H., & Horranper, E. P. Supp! 
note: Modes of responding in social influent 
ations. Journal of Abnormal and Social 
ogy, 1964, 69, 157. (b) , 

Wer, B. J. Statistical principles in exp 
design. New York: McGraw-Hill, 1962, 


(Early publication received May 17, 196 


Journal of Personality and Social Psycholo, 
1965, Wel 2, No. 6, 859-864 a a 


BRIEF ARTICLES 


pat POWER, COMMUNICATION, AND POSITION SATISFACTION IN 
TASK-ORIENTED GROUPS r 


DAVID WATSON anp BARBARA BROMBERG 


University of Toronto 


In a 2X 2 design, position in a social power structure and the frequency of 
communication reception were varied independently. It was observed that: 
o ° (a) the frequency of communication input directly affected communication 
output, “regardless of position in the power structure; (b) there was both a 
direct effecteof power position and an interaction effect upon the individual's 


satisfaction with his position; 


(c) both power position and communication 


reception affected the amount of positive social-emotional content in Ss’ mes- 


Several studies have indicated that the occu- 
pancy by group members of different positions in 
. a group power structure affects the group mem- 

ber’s morale, Those holding a position of rela- 

tively high power enjoy their position within the 
group more than do low power group members 

(cf. Hurwitz, Zander, & Hymovitch, 1960; Tor- 

raace, 1955; Watson, 1965). Position within the 

group power structure also affects communica- 
tion variables, as individuals with high power 
communicate tore frequently than do low power 
individuals (cf. Cohen, 1958; Hurwitz et al., 

1960), From such results it might be concluded 

that holding a position of high power leads to 

higher position satisfaction and higher frequency 
of communication. 

This last conclusion is not warranted, however, 
as it appears that in all reported research on $0- 
cial power variables the individual’s position in 
the group power structure and the frequency with 
which he receives communication are confounded. 
That is, since position in the group power struc- 
ture generally determines the frequency of com- 
munication that a group member receives, high 
power members receiving more communication 
than low power members (cf. Torrance, 1955; 
Watson, 1965), then holding a position of high 
power guarantees the individual both the oppor- 
tunity to influence or determine the behavior of 
othèrs and the opportunity to receive a relatively 
large amount of communication from other group 

` members. The purpose of the present study was 
to separate the effects of position in power struc- 
ture and the frequency of communication recep- 
tion to determine which, or possibly both, of 
these two variables influences position satisfac- 
tion and communication frequency. 


sages; and (d) both affected Ss’ appraised chances of being helpful in the 


group, but only power affected Ss’ appraised chances of being influential. 


The design of the experiment was a 2 X 2 fac- 
torial. The independent variables were position 
within a power structure, either high or low, and 
the frequency of communication reception. Power 
was defined as the authority of one group mem- 
ber to issue task-relevant commands to other 
group members. These other group members 
held no formal power, as they were obliged to 


comply with whatever commands they received 


and could not themselves issue commands. Ma- 
nipulation of the frequency of communication 
reception was accomplished by means of pre- 
pared messages which the experimenter substi- 
tuted for the messages which the group members 
sent to each other. 


METHOD 


Subjects and Task 


The subjects, strangers to one another, were male 
undergraduates, randomly assigned to one of the 
four experimental conditions. The subjects 
that they were ting in a functioning four- 
man group which met for 1 hour, but actually were 
not, as all communication between them was inter- 
cepted and replaced by prepared messages.* Subjects 
were seated in separate booths so that during the 
experimental hour they could not see or communi- 
cate verbally with each other. All subjects were run 
by the same ter. The N was 64 subjects, 
16 in each of the four conditions.? 


1 Six subjects were replaced because they indicated 
on a posttask questionnaire their suspicion that they 
were not members of a functioning group. 

2 For all analyses, N is defined as the number of 
subjects rather than the number of four-man groups. 
Since at no time during the experiment did the sub- 
jects in fact interact, and since, via the prewritten 
notes, subjects at comparable power positions in all 
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TABLE 1 
MEAN NUMBER oF MESSAGES SENT 


Power 
Message reception 
o High Low 
High 32.3 32.6 
Low 20.9 23.8 
Analysis of variance 
Source df MS F 
Message reception (A) 1 1,620.1 23.31* 
Power (B) i 42.3 0.61 
AXB 1 27.5 0.40 
Within 60 69.5 
*p <.001. 


All subjects performed the same task, a group test 
of general geographical information. The test con- 
sisted of eight items. Each item required that the 
group rank three geographical areas according to 
some specific characteristic, For example, one item 
was: “Rank these islands according to their size: 
Newfoundland, Iceland, Cuba.” The eight items were 
presented one at a time to all four group members, 
At the end of a 3-minute period, an answer was 
collected from each subject and scored. In order for 
an individual group member to receive credit for a 
correct answer, all the members of the group had to 
rank the particular areas correctly. The group’s score 
was kept on a card in front of each member. 


Procedure 


Subjects received a first set of instructions, identi- 
cal for all subjects, explaining the task and inform- 
ing them that all communication was to be by writ- 
ten messages which the experimenter would pick up 
and deliver. Subjects were allowed to send as many 
Messages as they wished to any group member at 
any time during the 3-minute trials. Each group 
member was identified by a code name. A single 
restriction, with appropriate rationale, was placed 
upon the content of the subjects’ Messages, that di- 
rect questions could not be asked, or if received, re- 
sponded to. This enabled the experimenter to substi- 
tute prewritten messages without detection. 

A second set of instructions assigned each subject 
to a position within the group power structure. The 
rationale offered to subjects was that the group 
needed leadership or coordination to solve the task. 
High power subjects were informed that they had 
been named “coordinator” for their group and that 
they must issue a command to all other group mem- 
bers, instructing them how to rank the areas under 


groups received identical stimulus materials at the 
same times, the independent experimental unit to 
which the treatments were applied was the indi- 
vidual rather than the group. 
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consideration, at the end of each 3-minute trial, 
Low power subjects were informed that one group 
member had been named “coordinator,” and that he 
had the authority to issue commands with which 
they must comply. All subjects were informed that 
no group member had any more information about 
the task than any other, and that in order to get 


the correct answer it was necessary to communicate 


freely with one another. 

Frequency of message reception was manipulated 
by intercepting all messages sent by subjects and 
substituting for them prepared messages. Those in 
the high message reception condition received twice 
as many messages (40) as those in*ths low condi- 
tion (20). There was a range of 4-7 messages per 
question for the high reception condition, and of 
2-3 for the low reception condition. Both the high 
and low reception groups received the same basic 
20 messages, but the high group received an addi- 
tional 20. Information that would help the subjects 
reach a solution was held constant by using the 
same task-relevant content in both conditions. The 
additional messages were relevant to the task, but 
not helpful in reaching a solution. For example, on 
the question asking subjects to rank Newfoundland, 
Iceland, and Cuba according to size, both conditions 
received a message which read, “I think the Correct 
order is 1-2-3.” Two additional messages were, 
“Cuba is 90 miles from Florida,” and “You really 
have to know a lot about geography to get this 
one!” Thus, the number of specific, solution-rele- 
vant suggestions any subject received was constant 
for the two conditions, Further, the number of 
group members supporting any particular sugges- 
tion was held constant, 

In order to simulate functioning groups aS its 
alistically as possible, in the low power condition 
half the messages ostensibly came from the group 
leader (cf. Bales, 1953; Hurwitz et al, 1960) and 
the remaining half from the low power group mem- 
bers. In the high power condition the ostensible 
source of messages was evenly distributed among 
the three low power members. Commands were also 
substituted by the experimenter, all low power sub- 
jects receiving identical commands. High power a 
jects did not receive commands, A special formal 
was used to distinguish commands from other mes- 
Sages. 

It was predetermined that all subjects would? re- 
ceive a score of 5 points out of a possible 8. i 
control was possible since the experimenter sihi 
tuted all commands and since the three areas to 


j 
l 


tanked in each question were selected so that they 


were nearly identical on the attribute in quea 
The same serial order of correct or incorrect ani 
applied to all conditions. After completing the iik 
items all subjects were given a questionnaire pi d to 
pate of the experiment was then explaine 
them. 


RESULTS 
i n 
Three items on the questionnaire „checked d 
the subjects’ understanding of the isted 
All subjects correctly indicated to whom 
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TABLE 2 
MEAN ENJOYMENT OF POSITION WITHIN THE Group 
P 
” Power 
Message reception 
High Low 
High 4.6 4.0 
Low 5.0 3.0 
. 
Analysis of variance 
rr 
Source af MS F 
p 
Message reception (A) t 1.0 0.83 
Power (B) 1 27.6 23.00** 
AXB 1 7.5 6.25* 
Within 60 1.2 
bs . 
*p < 025. 
wD <.001. 


could send messages, who their group “coordi- 
nator” was, and to whom he could issue com- 
mands. To check on the possibility that the sub- 
jects in the high message reception condition 
received more helpful information than low re- 
ception subjects, subjects were asked how help- 
ful they felt the messages they received were. 
There were no significant differences in response 
to this question. Subjects were also asked how 
satisfied they were with their score, how much 
they enjoyed the game, how much of a necessity 
they felt to cooperate with other members, and 
how important it was personally for them to 
solve the task, None of these questions elicited 
significantly different responses between condi- 
tions. Thus, any observed differences between 
conditions cannot easily be attributed to artifacts 


` of differential motivational effects. 


The mean number of messages sent by sub- 
jects is presented in Table 1. In computing the 
number of messages sent, commands issued by 
the high power subjects were not counted, There 
was a significant effect of communication recep- 
tion and no other significant effects, indicating 
that.the number of messages sent by the sub- 
jects „was affected solely by the number of mes- 
Sages they received. Mi 

The subjects were asked on the questionnaire, 
“Considering it as a whole, how much did you 
enjoy your position in the group?” (Trow, 1957). 
Table 2 presents the mean scaled responses to 
this item. There was a significant effect of power 

_ position * indicating that those who held high 
power enjoyed their position more than did those 

3 Subjects responded on a 7-point scale to all ques- 
tionnaire items. The scale points were labeled 0 
through 6. 
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with low power. There was also a significant in- 
teraction effect of message reception and power 
position. 

In an earlier study (Watson, 1965) two mo- 
tives were postulated to be operating in tha so- 
cial power situation, desire to help the group 
solve the task and desire to exercise influence. 
Two questionnaire items were included to deter- 
mine if these motives were affected by the ex- 
perimental manipulations. Subjects were asked 
what they thought their chances were of being 
influential in the group, and what they thought 
their chances were of helping the group solve the 
problem. Table 3 contains the mean scaled re- 
sponses for these two items. There was a signifi- 
cant power effect upon perceived chances of exer- 
cising influence, high power subjects perceiving 


TABLE 3 


MEAN PERCEIVED OPPORTUNITY OF BEING 
INFLUENTIAL OR HELPFUL 


Opportunity of being influential 


Power 
Message reception 
High Low 
High 4.8 3.6 
Low 4.8 SA 
Analysis of variance 
Source df MS F 
Message reception (A)| 1 0.70 0.62 
Power (B) 1 34,50 30,53** 
AXB 1 1.30 1.15 
Within 60 1.13 
Opportunity of being helpful 
Power 
Message reception —— 
High Low 
High 4.6 3.8 
Low 4.1 3.0 
Analysis of variance 
Source df MS Pr 
ea E 
Message reception (A) | 1 7.50 8.43" 
Power (B) 1 14.00 15.73** 
AXB 1 0.70 0.79 
Within 60 0.89 
01 
oh iooi 
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TABLE 4 


NUMBER OF SCORES FoR SoctAL-EMOTIONAL CONTENT 
ABOVE AND BELOW THE GRAND MEDIAN 


> Above median | Below median 
> 
Message reception Power Power 
High | Low | High | Low 
High 11 14 5 2 
Low 1 6 15 10 


Analysis of variance 


Source df x> 
Overall 4 24.50** 
Message reception (MR) 1 0 


MR X power (P)* 
MR X median split 


(MS) 1 20.25** 
P XMS 1 4,00* 
MR X P XMS 1 0.25 


AMR X P is necessarily 0, as each cell entry summed over 
aoe n = 16, that is, all scores are either above or below the 
median, 


“p Sioi. 

greater chance of exercising influence. Concern- 
ing the individual’s chances of helping his group, 
both power position and Message reception had 
significant effects, 

In the low message reception condition all mes- 
sages were solution oriented while in the high 
condition, half the messages were solution irrele- 
vant, It is possible that these latter messages 
created a more personal or social atmosphere, 
and that the manipulation may have consisted 
of two conditions: one which was strictly solu- 
tion oriented and one that placed some emphasis 
on social-emotional considerations, In order to 
check for any differences which may have re- 
sulted from this situation, a content analysis of 
the messages actually sent by the subjects was 
performed. 

Each message was scored according to whether 
it contained any content which fell in Bales’ 
(1951) positive social-emotional categories—one, 
two, and three—and the percentage of messages 
with such content was computed for each sub- 
ject. The distribution of scores was very skewed, 
a Pearson Type IIT distribution; 22 subjects 
Sent no messages with social-emotional content. 
To analyze the data, the grand median for scores 
in all four conditions was obtained, and Table 4 
presents the number of scores falling above or 
below this median for each cell, a 2x%2%2 
table. Analysis was performed by partition of 
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chi-square (Winer, 1962). There were two sig- 
nificant effects. Subjects in the high message re- 
ception condition sent a greater percentage‘ of 
social-emotional content. While it is possible that 
the high message condition elicited more social- 
emotional content from the subjects because of 
the nature of the messages they received, it is 
clear that this is not the only variable eliciting 
such content, as subjects in the low power con- 
dition also sent a significantly higher percentage 
of social-emotional content. 


Discussion er 


id 3 

It was observed thatthe frequency of com- | 
munication output was affected only by the fre- 
quency of communication input, and not by i 
Power position. Thus the conclusion that high 
power persons send more communication =than 
do low power persons because of their power po- 
sition per se is not warranted, and is probably 
the result of confounding two variables, power 
position and frequency of communication recep- 
tion. When these two variables are separated ex- 
perimentally it becomes clear that the frequency 
of communication: reception is the effective vari- 
able. It is hypothesized that high power persons 
in functioning social power situations send out 
more communication because they receive more 
communication than do low power persons in 
such groups. 

In the present study it was decided to hold 
the information in the prepared messages con- 
stant while varying the number of messages re- 
ceived. As a check on the informational aspects 
of the content, the subjects were asked about 
the helpfulness of the messages they received, 
and no differences between groups were observed. 
This suggests that the solution-relevant informa- 
tion was held relatively constant. 

While the observed differences between groups 
do not seem to be affected by informational con- 
tent, it is possible that they may have been af- 
fected by the solution-irrelevant nature of the 
additional messages delivered to the subjects in 
the high communication reception condition: and 
not by the actual number of messages received. 
The results in Table 4 indicate that the addi- 
tional messages may have created two condi- 
tions, one that was strictly solution oriented and 
one that placed more emphasis on a personal or 
social atmosphere in the groups, It seems equally 
plausible that subjects in the high message recep- 
tion condition sent more social-emotional con- 
tent because of the atmosphere created by te- 
ceiving solution-irrelevant messages or because 
of the pleasant value of receiving a large num- 
ber of messages from other group members: 
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Given the design of this study it is impossible 
to differentiate the two possible effects. 

The fact that low power subjects sent more 
positive social-emotional content than high power 
subjects cannot, of course, be explained by dif- 
ferences created by the message reception con- 
dition. The effect of power position is similar to 
that of Kelley (1951), and it seems likely that 
such content is the result of attempts by low 
péwer subjects to improve or defend their po- 
sition by either gaining power or mollifying those 
who hold power (Hurwitz et al., 1960; Kelley, 
1951).° ° : 

The observation that high power individuals 
enjoyed their position ‘within the group more 
than did low power individuals replicates the re- 
sults of other studies (Hurwitz et al., 1960; 
Watson, 1965). Jn response to a questionnaire 
item,°84% of the subjects indicated that they 
would rather be coordinators than low power 
group members. This result, although not a direct 
measure of an individual’s satisfaction with his 
position in the group, suggests that occupying a 
positfon of high power is appraised as being more 
satisfying than occupying a low power position. 

There was also a significant interaction be- 
tween power position and message reception. If 
the additional messages in the high message re- 
ception condition created a more personal or so- 
cial atmosphere, as the evidence in Table 4 in- 
dicates, then the interaction effect is interpret- 
able. It was noted above that a hypothesized goal 
of low power group members js to improve or 
defend their position vis-a-vis high power group 
members. From the point of view of the subjects 
in this study, low power subjects in the high 
message condition succeeded in attaining that 
goal, their evidence for this being either the 
relatively large number or the kind (solution 
irrelevant) of messages they received, while sub- 
jects in the low-low condition did not achieve 
that goal. Thus the low power, high message sub- 
jects enjoyed their position more than the low- 
low subjects. Further, the high power subjects 
may have felt that they lost power or authority 
because of the amount or kind of messages they 
received from other group members. From this 
point of view the interaction effect is under- 
stood as: high power gives most satisfaction in 
a strictly solution-oriented situation; low power 
gives least satisfaction in a strictly solution-ori- 
ented situation; the difference between high and 
low power is reduced in a situation where other 

» interpersonal relationships are allowed to enter 
into the communication. 

The results presented in Table 3 suggest that 
increased satisfaction with position may be the 
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result of the high power individual’s greater per- 
ceived chances of exercising influence and of 
helping the group solve the task (cf. Mulder, 
1960; Trow, 1957). Watson (1965) hypothesized 
that position in a group power structure varied 
the group member's perceived opportunities of 
exercising influence and of being helpful, and the 
present study offers some confirmation for this 
hypothesis, in that those who either held high 
power or received a relatively large number of 
messages perceived their chances of being help- 
ful to be highest. Further, those who held high 
power felt their chances of being influential to 
be highest. Obviously the group member with 
relatively high power will perceive his chances 
of being influential to be higher than the group 
member with low power. He may also perceive 
his chances of being helpful to the group to be 
higher because he has, by virtue of his power, 
a greater opportunity to influence the group. The 
group member who receives a relatively large 
amount of communication from the other group 
members may perceive his chances of being help- 
ful to be high because he is in frequent com- 
munication with the others, 

In studies investigating the effects of location 
in a power structure on the behavior of the in- 
dividual group member, the authors often refer 
to “morale” as a dependent measure. On reading 
these studies it is usually found that the indi- 
vidual’s “morale” has been equated with a meas- 
ure of his job or position satisfaction (cf. 
Mulder, 1960; Trow, 1957). If “morale” is gen- 
erally affected by experimental manipulations 
then it might be expected that enjoyment of the 
task, satisfaction with performance, ego involve- 
ment in solving the task, etc., as well as position 
satisfaction would be affected. By means of the 
questionnaire items in the present study it was 
possible to determine whether or not social power 
position had only a position satisfaction effect, 
or the more general effects noted above. The ob- 
servation that power position affected only the 
{ndividual’s enjoyment of his position in the 
group suggests that the use of the general con- 
cept of morale in power structure studies is un- 
necessarily imprecise. 
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COMMITMENT TO ACTION TAKING AS A CONSEQUENCE 
OF COGNITIVE DISSONANCE + 
DAVID MARLOWE, ROBERT FRAGER 
Harvard University 
anb RONALD L. NUTTALL 
Boston University 
2 studies were undertaken to test predictions concerning the relationship of 


personal importance of a 
after indicating their attitude toward 


Several current theories in social cholo; 
share the view that people will EG Foi 
imbalance or incongruity between their attitudes 
or between an attitude and related behavior. 
The theory of cognitive dissonance (Festinger 
1957) has provided the impetus for a variety 
of experiments bearing on this notion. Brehm 
and Cohen (1962) provide a comprehensive re- 
view of these studies. In brief summary, the 
theory of cognitive dissonance maintains that a 
person who holds two incongruous or dissonant 
cognitions will experience discomfort, and will 
seek to reduce this discomfort by altering his 
ideas or his overt behavior so as to restore 
balance. Most studies carried out within this 


1This study was supported by funds fr 
National Science Foundation, Grant 25113 E Das 
Marlowe. A shortened version of this paper was 
presented at the American Psychological Associati 
meetings, August 1963. i ‘Sra 


theoretical framework have focused on attitude- 
behavior changes which were manifested in shifts 
in scale ratings obtained in situations with ob- 
vious experimental connotations to the subjects. 
The main purpose of this study was to investi- 
gate the effects of dissonance on the willingness 
to take overt action in a realistic social situation. 

Two questions were formulated for experi- 
mental study. Are persons who experience, some 
negative consequence after expressing a given 
belief more willing to take action congruent with 
those beliefs than persons who experience neu- 
tral consequences? Would this discomforting 
outcome cause the person to attach greater per- 
Sonal importance to his beliefs? Dissonance 
theory would lead us to expect that a person 
who suffers for his beliefs would subsequently 
tend to value the beliefs more strongly, thereby 
justifying the suffering undergone. In line with 
this formulation, Festinger (1961) in a discussion 


= 
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of the psychological effects of insufficient re- 
wards has suggested that “rats and people come 
to “love things for which they have suffered.” 
The results of two studies using human subjects 
Jend credibility to this assumption. Aronson and 
Mills (1959) found that college girls who under- 
went a very embarrassing initiation as a pre- 
requisite to joining a group rated the group as 
more attractive than girls who became mem- 
bers following a mild initiation or no jnitiation. 
In a study of a “doomsday group” whose mem- 
bers hag made many sacrifices for their beliefs, 
Festinger, Riecken, and Schachter (1956) found 
that the disconfirmation of the group’s liefs 
led to increased proselyting behavior. There is, 
then, some empirical evidence to support the 
view that to suffer or make sacrifices for one’s 
beliefs has constquences for behavior predicta- 
ble from dissonance theory. 

The present experiment formally investigated 
this proposition in the following way- i 
a situation in which someone expresses socially 
desirable beliefs and isethen told that his beliefs 
are “not the right ones.” Further, he 
that because his beliefs are “not the right ones” 
he has lost an opportunity to earn an easy $20. 
Fhe person should be in a state of dissonance 
or discomfort since he has been made to suffer 
for his beliefs, that is, uy pelieve something 
which is true and good and it has cost me $20.” 
Since nothing can be done about the $20, the 
person might reduce dissonance by convincing 
himself that the beliefs are extremely important, 
and that he is readily willing to give UP $20 
for what he values so highly. Tt is not too likely 
that dissonance could be reduced by minimizing 
the personal value of the $20. The subjects used 
came to the experiment for $1.50 in pay; for 
them, $20 is a very considerable sum. However, 
a person who goes through the same experience 
but is told that he has lost a chance to make 
another $1.50 should experience little or no 
dissonance, and hence should be less inclined to 
revalue his beliefs. 

Finally, we would also expect that following 
the “Joss” of $20, the subjects would be more 
Willing than the “$1.50 subjects” to take overt 
social action congruent with the relevant ex- 
pféssed beliefs. This prediction follows from the 
assumption that by taking action the person can 
overtly demonstrate to himself how important 
the beliefs are, thereby minimizing further the 


_ discomfort of having suffered for them. 


S METHOD 
Subjects 


The subjects were 43 freshman students enrolled 
at Harvard and Radcliffe (31 males and 12 females). 
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They were recruited at the student. employment of- 
fice and paid the prevailing rate of $1.50 per hour. 
The “sign-up” sheet described the study as “a sur- 
vey of student attitudes and opinions.” 
Procedure f 
Prior to their appearance, the subjects “were ran- 
domly assigned to either the $20 (experimental) 
condition or $1.50 (control) condition by experi- 
menter: (RF). The experiment consisted of two 
parts, each conducted by a different experimenter. 
The experimenter for Part II (DM) had no knowl- 
edge of the condition to which any subjects had 
been assigned. Upon appearing for the experiment, 


Hello, Im Mr. Frager. I’m doing a survey of 
student attitudes. Would you please fill out this 
questionnaire? 


The subject then completed a 6-item attitudes 
toward Negroes scale which was labeled “Student 
Attitude Questionnaire.” ‘These items, adopted from 
The Authoritarian Personality Ethnocentrism Scale 
were answered on @ 6-point scale ranging from 
Strong Agreement to Strong Disagreement. An illus- 
trative item is «Jt would be a mistake ever to have 
Negroes for foremen or Jeaders over whites.” Scores 
on this scale could range from +18 to —18. The 
subject was then told by experimenters: 


Tm going to score your answers on this ques- 

have the right attitudes, you 

can fill out some more questionnaires now for 
an hour or so, oF on another day if that’s more 
convenient for you. Tf you have the right atti- 
tudes, I'll be able to pay you $20.00 [$1.50] for 
the other work. 

Experimenter: then “scored” the questionnaire, 
and said to all subjects regardless of their particular 
answers: 

Im sorry but you don’t have the right atti- 
tudes. You won't be able to fill out the other ques- 
tionnaires for $20.00 [$1.50]. 
people a little less open-minded. That’s all. Oh! 
I almost forgot. I want you to rate how im- 
portant to you are the attitudes you just expressed. 


jmenter: then pretended to be unable to find 
the rating scale and left the room for a minute to 
get one. ‘This deliberate delay was included in order 
that the subject have some time to mull over the 
“Joss” of $20 ($1.50) - When experimenters returned, 
he handed the subject a sheet labeled “Personal 
Importance of Ethnic Attitudes.” The subject then 


exp! 
Following this, experimenter: said the survey was 


over and asked the subject if he would 

please g0 upstairs for 4 few minutes and see Dr. 
talk to all students who 
building. I don’t know what it’s about 
put it will only take a minute or two. 
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The subject then saw experimenters for Part II of 
the experiment. After a moment of feigned puzzle- 
ment as to what the subject wanted, experimenters 
said: 


Ïm Dr, Marlowe and this year I'm director of 
Harvard’s Visiting Student and Exchange Pro- 
gram, I’ve asked everyone in the building who is 
working with students to send them up to see 
me. You may have read in the Harvard Crimson 
about 10 days ago that a group of Negro students 
will be visiting Harvard for a week sometime 
later this semester. Right now I’m in the process 
of organizing a volunteer guide service to show 
the students around. The guides will work ahout 
three hours a day for at least one day showing 
these students around the Harvard area. It will 
be strictly volunteer work—we are unable to pay 
the guides anything, Would you like to volunteer? 
In order to work out the schedule, I'll need a 
definite “yes” or “no” answer now, 


Almost all subjects were able to give a definite 
answer at this point. On a few occasions, a subject 
asked “Where are the students from?” (“ 2) 
or “Exactly when are they coming?” (“I don’t 
know yet, but it will be after exams.”) When any 
other question was asked, experimenter, Tepeated 
that portion of his initial statement appropriate to 
the question or else said “I don’t know yet.” Im- 
mediately after the subject gave a definite answer, 
experimenters conducted an interview to determine 
antes the subject had any suspicions as to the 
Teal purpose of the meeting. In eve; instan 
subjects said they believed what ae me 
told and did not suspect that they were still in an 
experiment. In 


their free hours, Two points should be Teemphasized, 
no knowledge of the conditions 
the subjects were assigned to by experimenter, 
and the subjects fully understood that they were 
being asked to work at least 3 hours. Before the 


subject left, experimenters 
had just been in an experimen that the subject 


explanation for the necessity 
had the subject promise not 
ment with fellow students, 


RESULTS AND Discussion 


variables in thi 
were the ratings the subjects Made of go 


the request for volunteers, 

ported without regard to the sex ot ates 
since males and females did Not diff 
cantly on any variable or measure 
Males and females were approximately equall: 
distributed between the $20 and $1.50 conditions 


The prediction that subjects in the $20 condi- 
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TABLE 1 


FREQUENCY OF VOLUNTEERING AS A FUNCTION , 4 
OF TREATMENT CONDITION 


Volunteer $20 $1.50 7 
Yes 17 9 
No 5 12 


* 


> 


tion would rate their attitudes as significantly 
more important to them than subjects in the 3 
$1.50 condition was not borne out, Th: mean 
importance rating obtained. by subjects in the- 
$20 condition (6.05 on ’a 7-point scale) is vir- 
tually identical to the mean of 6.09 obtained 


in the $1.50 condition. Thus, the subjects in 5” +. 


general, regardless of treatment condition con- 
sidered their attitudes towards Negroes to be 
of at least moderate personal importance, Only 

4 out of 43 subjects checked a scale position 
lower than 5, whereas 16 subjects checked 7, 
the maximum position on the scale. That sub- e 
jects in the $20 condition did not rate their 
attitudes as personally more important than 
$1.50 subjects may have been due to a ceiling 
effect resulting from our failure to provide ex- ` 
treme enough wording at the upper end of tht 
rating scale. 


__ The ratings of importance were related though 


not very strongly, to the willingness to volun- 

teer. Subjects who volunteered (N= 26) ob- 
tained a mean importance rating of 6.27 (SD= 

857) while those who refused (N= 17) ob- 
tained a mean of 5,82 (SD = .922). The differ- 

ence between these means is significant at the 

12 level (¢= 1.57, two-tailed). One might con- 
clude, though somewhat weakly in view of the » 
marginal significance level, that the more per- 
sonally important one’s beliefs, the greater the . ,. 
likelihood of behaving in a manner congruent 
with those beliefs, 

The results for the “volunteering” variable 
Strongly support the prediction that subjects in 
the dissonance ($20) condition would volunteer 
cantly more often than subjects in the ra 
nondissonance condition. In the $20 condition, 

77% of the subjects volunteered as compared to 
42% in the $1.50 condition. The difference 
these percentages is significant (f= 
2.46, b = 02). The data are presented in Table 1. i 
We interpret the greater frequency of volun- + 
in the $20 condition as a consequence 
of the subjects having experienced cognitive dis- 
Sonance, Our Teasoning is that the “loss” of 
$20 induced dissonance because the loss was 
linked to the expression of beliefs that subjects 
undoubtedly view as true and good. By volun- 


s 
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teering, the subject can overtly demonstrate to 
himself how personally important the beliefs are, 
thereby minimizing the discomfort of having 
“suffered” for them. The $1.50 subjects, having 
experienced little, if any, dissonance, were less 
inclined to volunteer. It should be noted that 
$1.50 is a sum readily available to any Harvard 
student who wants to serve in an experiment for 
an hour. The results, moreover, cannot be at- 
tfibuted to, any initial difference between con- 
ditions in the favorability of subjects’ attitudes 
towards Neggoes. The means for the $20 and 
$1.50 fonditions og the 6-item Negro attitude 
scale were —15.60 and,—15.00; (negativé signs 
indicate “pro-Negro” attitudes). There was, 
moreover, only a very slight tendency for sub- 
jects who volunteered to have more favorable 
attitydes towards Negroes than subjects who 
refused (t= .917, p = .18, one-tailed). 

After completing this experiment, a certain 
ambiguity about the conclusion we have drawn 
from our main finding became apparent. Do 
the data unequivocally support the conclusion 
that the observed increase in the frequency of 
volunteering for subjects who “suffered” is in 
fact due to their having suffered specifically for 
their beliefs? Is it not possible that suffering 
per se, quite independent of what one has just 
said or done induces a tendency to behave in 
a socially desirable way? Perhaps the act of vol- 
unteering made the subjects feel better and 
thereby alleviated the discomfort due solely to 
“losing? a relatively large sum of money. Con- 
sequently, a second experiment was carried out 
to determine’ whether “suffering,” unrelated to 
the expression of a given belief would induce 
overt social action consistent with that belief. 

Experiment II was an exact duplication of our 
first study except for one change in the proce- 
dure which served to divorce the monetary loss 
from the subjects just previously expressed 
ethnic beliefs. After completing the 6-item Negro 
attitude scale, experimenter, announced that the 
survey was over. The scale was not scored, nor 
was any mention made of additional work if the 
subject had the right attitudes. Instead, the sub- 
gects* were asked if they would care to serve as 
a subject in another unrelated study for an hour 
a 


TABLE 2 


FREQUENCY OF VOLUNTEERING AS A FUNCTION 
oF TREATMENT CONDITION 


Volunteer $20 $1.50 
Yes fe 10 
No 8 5 


—— ee 
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or so for which they would be paid $20 (or 
$1.50), All subjects said “yes,” whereupon ex- 
perimenter, then said he would now determine 
whether the person would be placed in the ex- 
perimental or control group. The experimenter 
then drew a number out of a box and announced 
to all subjects, 


I’m sorry, but you won't be able to participate in 
this other study, I’m assigning subjects to the 
Experimental or Control group by drawing numbers 
at random. You are in the Control group so you 
won't be able to participate and make $20 (or 
$1.50) for another hour’s work, 


The procedure then continued exactly as in 
Experiment I, The subjects rated their beliefs 
for “personal importance” and then saw experi- 
menters. A total of 30 subjects were used, 15 
assigned at random to the $20 and $1.50 condi- 
tions. 

The results for volunteering (Experiment II) 
are contained in Table 2. 

The frequency with which the subjects volun- 
teered was unrelated to the manipulations of 
Experiment II (x? = 1.22). Thus, the combined 
results of Experiments I and II indicate that 
to “suffer” for reasons unrelated to one’s specific 
beliefs is not likely to induce action taking con- 
gruent with those beliefs. The increased incentive 
to act in accord with one’s beliefs as a conse- 
quence of suffering is contingent upon experi- 
encing a loss specifically tied to those beliefs. 
This conclusion is highlighted when we make 
comparisons in the frequency of volunteering be- 
tween experiments. The difference in the per- 
centage of subjects who volunteered in the two 
$20 conditions (.77 versus .46) falls just short 
of significance at the .05 level (¢ = 1.93), In the 
two $1.50 conditions, the difference between 42 
and 66% is not significant (¢ = 1.47). 

Results similar to those of Experiment I were 
obtained in Experiment II with the “impor- 
tance” and “Negro attitude” variables. The 
treatment of Experiment II had no effect on the 


subjects’ ratings of the personal importance of ` 


their beliefs. Subjects in the $1.50 and $20 con- 
ditions obtained virtually identical mean “im- 
portance” ratings of 5.93 and 6.06. The impor- 
tance ratings again were only weakly related to 
the willingness to volunteer. Subjects who volun- 
teered (N=17) obtained a mean importance 
rating of 6.11, while those who refused obtained 
a mean rating of 5.84 (f= 1.36, p > .10). Simi- 
larly, the subjects’ attitudes toward Negroes were 
not very strongly related to volunteering. Sub- 
jects who volunteered obtained a mean of —15.41, 
while those who refused obtained a mean of 
—13.46 (t=1.64, p=.06, one-tailed), There 
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was, thus, a slight tendency for subjects who 
volunteered to express more favorable attitudes 
toward Negroes. 

A major feature of these results is our demon- 
stration that dissonance theory can successfully 
predict the occurrence of personally meaningful 
behavior elicited in a realistic social situation. 
The theory has been criticized on the grounds 
that it has produced experiments with overly 
elaborate manipulations carried out in artificial 
settings (Chapanis & Chapanis, 1964), Hence 
our effort to examine a dependent variable 
(volunteering) in a situation with minimal ex- 
perimental connotations for our subjects, sWe 
were apparently successful in doing so and have 
thereby, we believe, added to the generality and 
predictive utility of dissonance theory. 
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o) 
A motor performance and a kinesthetic figural aftereffect ini ‘ ; 
task dminist 
to 134 college students, and scores reflecting the buildup Aa ERRAI | 
reactive inhibition were derived, In order to evaluate Eysenck’s theory of 
on -curavetsion, these scores were correlated among themselves and 
hitler tad of anxiety, hysteria, Psychasthenia, extraversion, and neuro, 
‘ 7 addition, groups defined as inhibitory (fast buildup and slow dis- 4 
ipation of inhibition) or excitatory (slow buildup and fast dissipation) were 
oa oeted with respect to personality scale scores, The findings suggested a z t 
il t relationship between performances on the two tasks, but relations found s 7 
‘ween inhibition and Personality lent no support to Eysenck’s theory. 4 
ot sae erat that extra- susceptibility to reactive inhibition (Iz) (Hull, 
able, represents one of the aiden, variz 1943, 1951). Individuals who generate inhibition í 
etavior, He has eki ER $ ensions of quickly and dissipate it slowly are expected „to 
1955) that the HAID Pa (Eysenck, show extraverted patterns of behavior, while . 
underlying this behavioral A A ‘that ee ake ee soe a eel 
es j of it quickly are expected to show introverted “<. 
to the D ait a ‘5 ote on a dissertation submitted behavior patterns. fut 
cg et ce hs ree, si the State Uni- Two of the many empirical phenomena which 
E T a a eN penake: Supposedly reflect Zp are the work decrement in 
see nas appreciation to T E. Farber for his guide effect A Tance tasks and the figural after- + 
ie gpaehont the investigation. Further thanks 1956- in kinesthetic perceptual tasks (Duncan, > +, | 
a ae 1 arold P. Bechtoldt and the University 6; Eysenck, 1957; Köhler & Fishback, 1950). ` 
Pri aN ae for their aid in analysis of the Thus, if both of these tasks reflect the same oh 
anes oe Sate wn reading and criticizing basic inhibitory process, subjects’ performances ° 4 
D E N Eo a ue hee pP. on the two tasks should be correlated due to sas E 
of the kinesthetic afterefest ones e loan individual differences in susceptibility to Ze  ™ 


Moreover, to support Eysenck’s formulation, 
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extraverts should differ from introverts in work 
decrement and kinesthetic aftereffect (KAE) in 
the manner suggested above for Ip- 

The evidence for these theoretical implications 
is, at best, ambiguous. A number of studies have 
shown some kinds of performance differences be- 
tween introverts and extraverts on motor tasks 
(Eysenck, 1956; Lynn, 1960; Meier, 1961; Ray, 
1959; Star, 1957; Treadwell, 1956), while others 
{Bendig & Vaughn, 1959; Rechtschaffen, 1958) 
have found no differences whatsoever. Eysenck 
(1957) citeg, three investigations in support of 
the Hypothesis that introverts and extraverts 
differ in regards te KAE, but Harvarth (1963), 
Meier (1961), Norcross, Lipman, and Spitz 
(1960), and Rechtschaffen and Bookbinder 
(1960) found no such differences. Moreover, two 
studies have directly explored the relationship 
between KAE and work decrement (Lipman & 
Spitz, 1961; Meier, 1961). Both studies lent 
slight support to Eysenck’s position in finding a 
negative relationship between magnitude of KAE 
and rate of recovery from work decrement. 

These authors have derived scores reflecting 
either the buildup or dissipation of Zp and re- 
lated these scores separately to the extraversion 
dactor. However, Eysenck’s theory does not de- 
‘mand that these aspects of the inhibitory process 
be consistently related to one another in the 
general population, so that, for example, some 
individuals could acquire Zg rapidly but also 
dissipate it quickly. The theory does demand 
that an individual who falls at one extreme or 
the other with respect to both phases of the 
inhibitory process should show the theoretically 

- appropriate behavior pattern. The major purpose 
of this investigation was to relate introversion- 
extraversion to Jp considering both phases of 
the inhibitory process, buildup and dissipation, 
simultaneously. : 


METHOD 


The subjects were 134 college men selected from a 
cSurse in introductory psychology. Scores on a num- 
ber of personality scales were available from a bio- 
graphical inventory administered as part of the 
regular classroom procedure approximately 2 months 
‘prior to this investigation. The subjects were tested 
individually on a motor task after they had com- 
preted a perceptual task in another experimental 
room;, the motor task was always given last due 
to the large amount of fatigue and kinesthetic 
stimulation involved. 


_ Motor Task 
A ball-dropping task, devised by Bilodeau (1952), 
was chosen in order to minimize the effect of habit 
changes on the performance curve and because the 
. effects of varying amounts of practice and rest had 
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been empirically determined. Instructions to subjects 
and apparatus specifications were identical to those 
set forth in Bilodeau’s original work. Generally, the 
apparatus consisted of a wooden chute and three 
wooden balls, and the subjects were instructed to 
lift the balls from a cup at the base of the“ chute 
and drop them into the top of the chute as quickly 
as possible. As a ball passed through the chute it 
tripped a microswitch; this recorded a score of 1 
on one of the two electromagnetic counters which 
operated alternately for 30 seconds each. 

All subjects were given an initial practice period 
and then short rests and test trials as follows: 
8 minutes continuous practice, 30 seconds rest, 30 
seconds test trial, 60 seconds rest, 30 seconds test 
trial. 

Three performance measures were derived from 
this sequence. The difference between the number 
of balls dropped during the fourth minute and the 
number dropped between the thirtieth and ninetieth 
second of prerest practice served as an index of the 
jnitial buildup of work decrement. The difference 
between the number dropped in the last minute and 
the number dropped between the thirtieth and 
ninetieth second gave an index of work decrement 
built up after maximum practice. Speed of recovery 
from the decrement was estimated by the difference 
between number of balls dropped during the last 
minute of prerest practice and the number dropped 
during the first test trial. Wherever 1-minute 
intervals were used, the scores were divided by two 
to make them comparable to test trial scores. 


Perceptual Task 


The general procedure and the exact apparatus 
specifications for this task are set forth by Charles 
and Duncan (1959). Generally, the apparatus con- 
sisted of two wooden blocks called the test object 
(TO) and the stimulus object (SO), and a wooden 
wedge called the comparison scale (CS). The sub- 
jects were first required to grip TO with their right 
hand and CS with their left and find, on the CS, 
the point of subjective equality (PSE). Later, the 
SO was introduced to the right hand and subjects 
were required to rub this for varying lengths of 
time, after which they returned to make more PSE 


-judgments on TO and CS. 


The subjects were given two practice judgments 
on TO and then were given four trials which yielded 
a measure of PSE for each subject. These initial 
judgments were followed by a series of test trials 
which followed either stimulation or rest. Six test 
trials were given, each consisting of the average of 
four judgments on TO. The entire sequence was: 
practice, PSE measure, 15 seconds stimulation (on 
SO), first test trial, 180 seconds rest, second test 
trial, 60 seconds stimulation, third test trial, 120 
seconds stimulation, fourth test trial, 60 seconds rest, 
fifth test trial, 120 seconds rest, sixth test trial, 

This procedure yielded three performance mea- 
sures, The difference between the PSE and the 
score on the first test trial gave an index of the 
initial buildup of KAE. The difference between the 
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PSE and the score on the fourth test trial estimated 
the KAE built up after maximum stimulation. The 
difference between scores on the fourth and fifth 
test trial gave an index of the rate of recovery 
from KAE. 


Personality Measures 


Eysenck (1957) has postulated that the Extra- 
version (E) and Neuroticism (N) scales of the 
Maudsley Personality Inventory (Eysenck, 1957) 
measure orthogonal dimensions of behavior and that 
only the E scale should be related to indices of Iz. 
He has also suggested that hysteria, as a psychiatric 
diagnosis, is a syndrome found in extraverted neu- 
rotics, while psychasthenia is a syndrome found, in 
introverted neurotics, Therefore, the Hysteria (Hy) 
and Psychasthenia (Pt) scales of the Mihnesota 
Multiphasic Personality Inventory (Hathaway & 
McKinley, 1951) should be highly related to the 
introversion-extraversion dimension. The Manifest 
Anxiety (MA) scale (Taylor, 1953) is supposed to 
reflect both extraversion and neuroticism (Eysenck, 
1957); it should therefore be related to Ip only 
insofar as it is a measure of extraversion, All of these 
scales were included in the biographical inventory 
noted above. 


RESULTS AND Discussion 


The performances on the work decrement and 
KAE tasks were generally in line with expecta- 
tions, and the data did not appear to contra- 
indicate the use of the previously discussed 
measures of inhibitory tendencies, 

Performance on the ball-dropping task closely 
resembled data presented by Bilodeau (1952). 
Scores increased slightly for the first 30 seconds, 
then dropped continuously until the rest interval, 
rose above the initial performance level following 
the first rest, and rose even higher following 
the second rest. Thus, the estimates of work 
decrement were confounded with the growth of 
habit, but there is unfortunately no way to 
evaluate the relative effects of these opposing 


Processes at different points on the performance 
curve, 


TABLE 1 


Propuct-MomENT CORRELATIONS 
PERFORMANCE INDICES ON Work 
DECREMENT AND KAE 
Tasks (N = 134) 


KAE rt 
Initial 
buildup 
Initial buildy —.020 
Maximum buildup 041 
Recovery 138 


a 
*p < 05, 


The KAE began to build up very rapidly 
(within 15 seconds) and progressively increased 
during the second and third periods of stimula- 
tion. In contrast to the rapidity of buildup, rela- 
tively little KAE dissipated during any of the 
rest intervals, 


Comparison of Work Decrement and KAE 


In order to evaluate the relations betwee 
measures of inhibition on the two tasks and to 
test Eysenck’s hypothesis that homologous mea- 
sures of work decrement and KAE>woyld be 
positively related, the three qneasures of inhibi- 
tory tendency for each task were intercorrelated 
by means of Pearson r. These correlations are 
presented in Table 1. Of the nine correlations 
between estimates of inhibition, only one reached 
an acceptable level of significance. This cosrela- 
tion indicated that the faster the initial buildup 
of KAE the slower the recovery from work 
decrement (r= —.191, p = .05). 

Even though the size of the correlations in 
Table 1 suggests that no strong relationship «held 
between performances on these two tasks, the 
directions of the obtained relationships may hold 
some interest, First, the three homologous mea- 
sures of inhibitory tendency were all negatively 
related, in contrast to Eysenck’s prediction of 
positive relations. Second, Lipman and Spitz 
(1961) and Meier (1961) found High negative 
correlations between the rate of recovery from 
work decrement and the total amount of KAE 
built up; this correlation was negative in this 
investigation also (r = —.114) but did not reach 
an acceptable level of significance. Mortover, 
seven of the nine coefficients in this investigation 
were negative in sign. Thus, while these findings 
lend little support to Eysenck’s theoretical po- 
sition, they are not opposed to the empirical 
findings of the above authors. 

Both of the above investigations have reported 
correlations which were considerably higher than 
those reported in the present study. One reason 
for this difference may be related to their use 
of rotary pursuit (Lipman & Spitz, 1961)» and 
inverted alphabet printing (Meier, 1961), , both 
of which involve considerably more learning 
man the ball-dropping task. While inhibitory ef- 
ects are apparent in the performance of” All 
three of these tasks, it may well be that sub- 
suming these effects under one general construct, 
Tp, is misleading without taking into account the 
nature of associative factors, There is, in fact, 


Some evidence to suggest that inhibition in a` 


relatively pure motor task does not follow the 


Same course as in other motor tasks more af- 
fected by learning. For instance, recovèry from: 
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TABLE 2 
CQgRRELATIONS BETWEEN PERSONALITY SCALE SCORES 


AND PERFORMANCE INDICES IN BALL DROPPING 
AND KAE Tasxs (N = 134) 


Hy Pt E N 


Work decrement 


Initial buildu; —.073 | —.120 | —.077 047 | —.097 
Maximum bulidup| 1008 | —.148 | -001 | —.077 | 021 
pecavei —.040 | —.032 | —.045 11S 072 
A 

Initial buildu; —.007 | —.067 | —.041 .062 | —.050 
Maximum buildup —.040 2067 | —.118 024 | —.100 
Recovery —.067 1135 | —.091 | .003 | —.138 

2? 


Note—The faster thd rate of recovery from inhibition, the 
greater the recovery scot. Thus, a negative correlatidn means 
that the higher the personality scale score the slower the rate 
of recovery. 


work decrement in motor performance tasks such 
as ball droppittg or cranking (Bilodeau, 1952, 
1954) appears to be much more rapid than in 
motor learning tasks such as rotary pursuit 
(Ammons, 1947). 


Inhibition and Personality 


To evaluate the possibility that there are per- 
sonality variables associated with the various 
aspects of the inhibitory process, product- 
Tnoment correlations were obtained between the 
MA, Hy, Pt, E, and N scales and the estimates 
of rate of, buildup and dissipation of work 
decrement and KAE. None of the correlations 
reached an acceptable level of significance. These 
correlations are presented in Table 2. 

Eysenck’s theory clearly demands that any 
individual who tends to be inhibitory in both 
phases of the inhibitory process should be extra- 
verted in personality. To select inhibitory and 
excitatory subjects, a median for the total group 
was computed for scores on each aspect of work 
decrement and KAE. Inhibitory subjects were 
those whose rate of buildup scores fell above 
ate of recovery scores 


the median and whose r: ‘ 
the median. Conversely, excitatory 
E E fell below the median 


fbjects were those who n 
U above the median on 


of rate of buildup and k 
rateʻof recovery scores. To compare personality 
gcalee scores of these groups, ¢ tests were run 
between the personality scale score means of the 
inhibitory and excitatory groups. No significant 
differences were found between subjects clas- 
sified as inhibitory and subjects classed as excita- 
tory on the motor task. The only notable finding 
on the perceptual task was that excitatory sub- 
jects obtained higher scores on the Hy scale 
(t= 1.93, df = 26, p<.10). Since Eysenck re- 
gards hysterics as extraverted, this finding is di- 
rectly opposed to his prediction. This conclusion 


‘is of course based on the assumption that what 
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is measured by the Hy scale is what Eysenck 
refers to as hysteria. 

The expectation that extraverts would be clas- 
sified as inhibitory more often than introverts 
was evaluated by comparing the extreme groups 
on each personality scale (upper and Tower 15%, 
n=20, 20) with respect to the number which 
fell into the inhibitory classification as defined 
above. Fisher’s exact test failed to show any 
differences between these extreme groups in the 
proportion classified as inhibitory on the per- 
ceptual task. The only difference in proportions 
which approached significance on the motor task 
was that introverts (low E) were more fre- 
quently classified as inhibitory than extraverts 
(high E) (p=.1010). This finding is also op- 
posed to Eysenck’s prediction. 

These findings lend slight support to the pos- 
sibility of a relationship between motor and 
perceptual tasks, but they lend no support 
whatsoever to Eysenck’s theory of introversion- 
extraversion. 
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Data were obtained on the birth order = 
1962, and 1963, at the University of gy = 


Ist born to be overrepresented among college students. 4 


studies had shown the 


In The Psychology of Affiliation, 
(1959) emphasised the gromp orlenttlice ar are 
first born under anxiety-producing situations, One 
fact which he did not comment on in this earlier 
publication is implicit in Tables 22a and 22b of 
that book: It is the marked preponderance of 
the first born among his sample of 298 students 
from the undergraduate psychology classes at 
the University of Minnesota. If the only child 
is considered a first born (as he usually is in 
studies of ordinal position), some 164 (55.03%) 
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were first born while 134 (44.97%) iiss 
l -97%) were later 
. Even with the only child excluded from 


consideration, some 48.46%, or nearly qne-half , ` 


the total, were first born. Such a statistic would . , 


not be surprising if all studeni j 
i irp ts came from two- 3 
a where equality of number ‘might be 
sighs ited; but since representatives of families’ - 
Pe iverse Sizes are to be expected, such a sta- 
me EBA | 
t (1928) reported a stud: involvi a 
Sophomore Dartmouth students, ane which it 


is possible to calculate that 42.31% were first 
born, when the only child is excluded from con- 
sideration, and 53.13% when the only child is 

included among the first born. Hayes (1938) 
published a study of 76 females at Mt. Holyoke 
College. Of these, 50.84% were first born when 
the only child is excluded, and 61.84% are first 
born when the only child is included. Abernethy 
(1940) reported on 300 women from Queens 
College, North Carolina; among this group 
39.85% or 46.66% were first born, depending 
upon how. the only child is treated in the calcu- 
lation# It'is clear that all these studies point to 
an overrepresentation pf the first born ‘at the 
college level, if one may assume some modicum 
of representativeness among the various samples 

© of the total college populations involved. 

It,should also be noted that Terman et al. 
(1925) found a bias favoring the first born in 
their study of the gifted, particularly among 
those coming from families of two, three, and 
four children, He noted the similarity of the 
ratigs he found to those of Cattell (1947), in his 
survey of the eminent American scientist. To 
illustrate with the two-child family, Terman re- 
ported that 56.1% of his gifted sample were 
gmong the first born; in Cattell’s study of emi- 
nence among American men of science, the cor- 
responding percentage is 57.4. The figures for all 
birth-order positions among representatives of 
the three- and four-child family were likewise 
quite similar in both studies. Ellis (1926) in his 
study of eminence among the British found that 
56% were first borns among those who came 
from ¢wo-chifd families, The congruence of these 
data is rather unexpected since the criterion for 
Térman was simply psychometric brightness 
among children, while for Cattell and Ellis the 
criterion was achieved eminence in science for 
one and eminence in general in the other. 

The tabular data from Schachter (1959) 
show that 62.22% of his representatives from 
the two-child family were among the first born, 
The comparable data from Hayes (1938) yields 
a pecentage of 57.69 favoring the first born in 
tle two-child family; for Abernethy (1940), it is 
63.16. Bender (1928) did not report his data so 
sthat the statistic for the two-child family could 
be ‘computed. 

« _ Theeuniformity and consistency of these data 
led the writer to embark on a program of birth- 
order studies, one facet of which concerned 

a the ratios of the birth order of all students enter- 
ing the University of California, Santa Barbara, 
for the years of 1960, 1961, 1962, and 1963. It 
was predicted that the first born would be over- 
‘represented, regardless of the sex of the student. 
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RESULTS 


The record for about 1 student in 20 had to 
be discarded since half siblings, step siblings, and 
foster siblings were judged to render a birth order 
invalid for research purposes. The data on all 
the valid records for students coming from fami- 
lies of one, two, three, or four children are given 
in Table 1. Only 6.96% of the male and 6.25% 
of the female students come from families of 
five or more children. Such small numbers were 
entailed among the larger families that data 
beyond the four-child family are not shown. 

The consistency of percentages for the various 
birth orders stands out clearly in the tabular 
data. The only child shows a percentage varia- 
tion of less than five points, 12.06 to 16.99. The 
first born from the two-child family shows rela- 
tively even less variation than does the only 
child since the older of two ranges from a low of 
24.02% to a high of 29.96%. The later born in 
the two-child family shows the same relative 
incidence as does the only child, ranging as he 
does from 12.03% to a high of 17.94%. The 
first born in the three-child family shows a 
comparable constancy, ranging from 12.79% to 
a high of 18.04%. The variation is somewhat 
greater in the later birth orders, where smaller 
numbers reduce the reliability and increase the 
discrepancy somewhat. Still, the predictability 
from sex to sex and from year to year is good: 
The youngest in the four-child family hovers 
about a mode of 1%, dipping from a low of 
.64% to a high of 1.87%. 

1 Men and women show strikingly similar birth-? 
order ratios. What variability there is could } 
easily be explained by chance fluctuation. Sex 
factors enter negligibly, if at all, in the ratios 


noted in Table 1. The size of the family is shown ` 


to be of marked significance, however, in terms\) 
of the student population. The representatives ) 
of two-child families comprise 42.68% of all the 
students who come from families of one through 
four children; and the families of one through 
four children, it will be remembered, contribute 
93% of all the entering students to whom a 
birth order could validly be assigned. The three- 
child family contributes 30.51%; the four-child 
family, 12.63%; the one-child family, 14.19%. 
The percentages for the total population of 
students, without restriction as to family size, 
are not much different: one-child family, 11.43%; 
two-child, 39.17%; three-child, 29.51%; four- 
child, 13.02%; five-child, 3.95% ; all other orders, 
2.92%. This linkage with size of family is in 
part explicable by the observed linkage of in- 
telligence and number of children (Maxwell & 
Pilliner, 1960). It seems likely that differential 
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TABLE 1 
SIBLING ORDER FOR Peon ase ay ee iad aN 1963, FROM FAMILIES OF : 
> Birth ordera 
ak 1/1 1/2 2/2 1/3 2/3 3/3 1/4 2/4 3/4 4/4 Total 
"Mea | 1| ws | so] mar io aes) sa| zor) 12) ogs 
Men | 12| zess] os| sens] wgs) so sw sae | 222| 1z 
1962 iat 
Men | gl aal sal goj saj sol eos] ssf zs 179 
"Mea | 2e 202] mos ise] nol sao] gss) gas | aos | nsr 
Total y i 
a 2 35 25 1,659 
aa abe Nae GEVA EEA Wre HA A (AON 211 1.51 100,00 
Women 
se ies Lae ieee EEA e e R K EAA Hin #35550 
° 


Note,—In percentages. 
a Read thus: 1/1 means an only child; 1/2 is the first born 
In a three-child family; and so on. 


of 
academic motivations as social class is varied 
must account for part of the variance, Economic 
factors also probably enter since family income 
tends to drop among families of larger size, 
The most significant aspect of Table 1 is, for 
the writer, the observed effect of birth order on 
the four entering college classes. The total num- 
ber of first-born males from families of two 
children is seen to be 433 or 62.84%, while 256 
or 37.16% are the second born or younger. For 
| women from two-child families, the first born 


| accounts for 63.03%, while the second born 


i yields 36.97%. It is thus clear that for each 
| eight entering students who come from two-child 
families, five will be the first born or older 
while only three will be the second born or 


| younger, For the three-child family, 49.81% of | 


the males are first born; 33.40% are second 
born; 16.80% are third out of three. The ratios 
are roughly 1/2 for the first born, 1/3 for the 
second born, and 1/6 for the third born. For 
women from three-child families, the percent- 
ages are 50.96 for the first born, 29.07 for the 
second born, and 19.97 for the third born or 
youngest. Again, there is a marked aberration 
from the theoretical expectancy, which would be 
1/3 or 33.33% for each of three birth orders, 
The four-child family continues the same trend: 
When men and women are considered as a total, 
about 1/2 of all representatives of the four- 


| ated with size of 
2 of the same size 


f two; 2/2 is the younger in a family of two; 1/3 is the first born 


child family are first born; about 1/4 are seconf 
born; the remaining 1/4 are divided between the 
last two birth orders, with the younger of the 
two getting a slightly smaller percentage. 

The percentage of first borns for all entering 
students regardless of sex and family size is 61.34 
when the only child is considered a first born and 
55.05 when the first born is disregarded in the 
computation. For women, the figures are’ 62.29 
and 55.80; for men, 59.87 and 53.49—the per- 
centages varying with the inclusion or exclusion 
of the only child. 


Discussion 


Tt would appear that to the extent the data 
reported in this paper are representative, the 
) probability of attending college is closely associ- 
family and that within families 

the first born has the cards 
stacked in his favor, Over 90% of the local stt- 
dent population comes from families of one, two, 
three, or four children. Of the entering students, 
over 60% 
sidered as a first born without any siblings; if 
the only child is left out of the computations, 
then about 55% are still among the -first-born 
group. 

Some evidence which has come to light since 
this program of studies was begun tends to cor- 
roborate the findings here noted. Capra and 


are first born if the only child,is con- + 


E s 
si 


x 
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Dittes (1962) reported that 61% of two samples 
of Yale undergraduates were either only children 
or® first borns. This figure corresponds almost 
‘exactly to the 61.34% found locally. The figure 
for a sample of 133 students at Reed College * 
in 1962 showed 73% to be either only children 
or first borns; when the only child was left out 
of the equation, the percentage was 66. Stewart 
(1962) in a study of 7,000 boys and girls in 
grammar and modern secondary schools in a 
London borough found that the first born tended 
to be signjficently overrepresented, a tendency 
which Became moreemarked in the upper grades 
since the first born*tended to persist in school 
longer than did the later born. The phenomenon 
of academic primogeniture, which has been noted 
among American colleges for the past 40 years, 
appeays therefore to have its analog when the 
geographical locus is shifted to England and the 
educational level to the state-supported secondary 
school. * 

Schachter’s (1963) publication belatedly came 
to the writer’s attentioa as he was in process of 
final revision of the present paper subsequent to 
its acceptance for publication. Schachter stresses, 
in this significant recent paper of his, the linkage 
af eminence to order of birth, citing many previ- 
ous reports beginning with Galton. He then 
proceeds to show that at the University of Min- 
nesota some %,013 students taking introductory 
psychology in 1959 and 1961 give similar evi- 
dence for a preponderance of the first born. To 
make his findings comparable to those reported 
in the present study, the percentages of first-born 
studenés from families of two, three, and four 
children were calculated from his data. Some 
55.43% of the representatives of the two-child 
family at the University of Minnesota are first 
born; the comparable figure at Santa Barbara 


* is 62.96%. For representatives of the three-child 


family, the first born at Minnesota comprise 
45.97%; at Santa Barabara the figure is 50.50%. 
Fog the four-child family, the figures for the 
first born are 41.07% at Minnesota and 50.56% 
at Santa Barbara. Some 48.94% of the two-, 
thfee-, and four-child family representatives 


- among University of Minnesota students are first 
born; at Santa Barbara, the figure is 56.70%, 


higher by some 7.76 points. This somewhat larger 
percentage of first-born students at Santa 


+ Barbara may be due to the relatively high en- 


trance standards at the University of California, 
where only the top 10-15% of graduating high- 
‘school seniors are found admissible on the basis 
of secondary grade averages. The data on first 
borns at Yale and at Reed previously noted 


* 1 Personål communication, 1964. 
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would lend some credence to the linkage of 
higher entrance standards to a greater ratio of 
first-born students. If this inference is a correct 
one—the writer has no data to corroborate it— 
then public junior colleges which must accepf any 
high-school graduate should show a greater pro- 
portion of later-born students than obtains in 
collegiate institutions with comparatively stringent 
entrance standards. 

Schachter also furnished data on two-child 
family representatives matriculating at Columbia 
University between 1943 and 1962; here, like- 
wise, the first born is overrepresented in every 
corfparison. Among graduate students in the 
Department of Psychology and the Institute of 
Child Welfare at the University of Minnesota, 
Schachter again found the first born overrepre- 
sented. He calculates that 57.79% of this latter 
group is first born but this is done by including 
the only child in the first-born category. If the 
only child is removed from the computations, 
the figure drops to 46.84%, which is slightly under 
the 48.94% found for the first-born representa- 
tives from families of two, three, and four chil- 
dren matriculated at Minnesota as undergradu- 
ates. One reason for this drop is that the pro- 
portion of only children who are graduate stu- 
dents in Schachter’s study is approximately 
double that for the local undergraduate popula- 
tion, who take introductory psychology courses. 

In the same article, Schachter reports the re- 
lation of birth order to grade averages for 628 


first born is shown to have a higher grade average 


N 


students from a Minneapolis high school. The 4 


which is consistent though small. In discussing 
this finding, Schachter disclaims any belief that; 
it may be caused by differences in tested apti- | 
tude between first and later borns, saying that ) 
“Sanity certainly would suggest a conclusion of 
no association.” The explanation for the grade 
differentials he believes to reside in motivational 
factors, He goes on to suggest that since he has, 
found (Schachter, 1964) the first born to be less 
popular than the later born, “Perhaps with little. 
else to do, first borns simply spend more time at/ 
their books.” 

Despite Schachter’s belief that aptitude is not 


a parameter relating to order of birth, there is ' 


some evidence that for high levels of aptitude 
there may well be linkages between birth order 


and measures of aptitude. It will be remembered / 


that Terman et al. (1925) found the first born 
overrepresented among his gifted. Altus (1965) 
reports the first-born students at the Santa 
Barbara campus of the University of California’ 
earn higher scores than do later-born students 
on the Verbal Aptitude section of the Scholastic 
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Aptitude Test. In a later study (unpublished) he 
found it possible to demonstrate significant 
mean differences, for both men and women, 
between first-born (older) and second-born 
(youhger) students from two-child families at 
the Santa Barbara campus while employing an 
aptitude test of only 10 items, 8 verbal and 2 
quantitative. The mean differences on the short 
ptitude test favored the first born, of course. 
Nichols (1964) has found in an 84% sample of 
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control of the first born, assuming that he is, as 
Adler (1928) once wrote, a “power-hungry 
conservative.” No one has corroborated Adlet’s 


dictum and it may be that he was wrong. It can’ 


(be safely asserted, however, that to the extent 
) the first born differs as a person from the later 
born, our society will increasingly feel that dif- 
| ference as college and professional training be- 
come more and more a sine qua non for achiey- 
ing a managerial position in our power structure. 


1,618 finalists in the National Merit Scholarship / 


Qualifying Test that the first born is markedly 
overrepresented: 66% of the two-child family 
representatives were first born; 52% is the ĉor- 
responding figure for the three-child family; 59% 
for the four-child family; and 52% for the five- 
child family. In all, he found slightly over 59% 
of the representatives of two-, three-, four-, and 
five-child families were first born. Since the merit 
finalists represent a sample some three standard 
deviations above the average of the high-school 
population from which they come, they are 
probably superior in aptitude to the mean of any 
college student body in the United States. The 
fact that primogeniture effects stand out so 
strikingly when the criterion of evaluation is 
aptitude test scores rather than performance ‘in 
the classroom lends significant corroboration to 
the general implied thesis of this paper that 
order of birth is a parameter which future studies 
—At least of the very able—must reckon with. 
The data in the present paper relating order 
( of birth to college attendance are in accord with 
| what Schachter (1963) has found and what 
| many others have occasionally reported since the 
| 1920s about college attendance in the United 
States, The documentation of the case for aca- 
| demic primogeniture seems fairly adequate and 
| complete. The parts which motivation and apti- 
tude—if they can be separately dealt with—play 
\ in determining the composition of college student 
| bodies are, of course, beyond the scope of this 
{ paper. The mere fact of the Overrepresentation 
| of the first born does have a peculiar social rele- 
vance which deserves comment, however, 
; The most significant aspects of birth-order 
linkage to college attendance are the loss to 
educable, later barman the ct Presumptive 
$ $ ya upon our rap- 
idly changing society, with its increasing em- 
phasis upon technology, if the society and its 
technology come more and more under the 
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ETHNOCENTRISM AND ATTITUDES TOWARD THE 
PHYSICALLY DISABLED 


MARK A. CHESLER 1 


University of Michigan 


Much of the psychological literature in the field of intergroup relations gen- 


eralizes from investigations conducted with a limited number of 


ethnic 


e minorities, This study attempts to derive a measure of general ethnocentrism, 
and compares this with individuals’ expressed attitudes toward a nonethnic 


minority group, the physically 


nationflity, and socioeconomic’ class, 


disabled. Physically disabled persons are often 
‘targets of interpersonal and intergroup prejudice and constitute a minority 
outgroup rarely investigated. 4 dimensions of ethnocentrism: race, religion, 
as weil as 13 specific groups, were the 


targets of similar attitudinal expressions. Indiviguals who expressed ethnocen- 


trism toward 1 specific minority group, 


or on 1 intergroup dimension, ex- 


pressed similar attitudes toward other groups and dimensions. Moreover, in- 
diyiduals who manifested high ethnocentrism, or high rejection of outgroups, 
y also expressed rejection of the physically disabled. 


The psychological disciplines have long been 
interested in the role of personality factors in 
ethnic prejudice and intergroup relations (eg., 
Allport, 1958; Doob *& Sears, 1939; Frenkel- 
Brunswik & Sanford, 1945; Harding, Kutner, 
Proshansky, & Chein, 1954). Part of the evi- 
dence supporting the psychologists’ emphasis on 


. Personality variables has been the consistent dis- 


covery of positive and statistically significant 
relationships, between attitudes directed toward 
one specific minority group, and attitudes ex- 
pressed toward various other minorities (Adorno, 
Frenkel-Brunswik, Levinson, & Sanford, 1950; 
Lindzey & Rogolsky, 1950). This phenomenon 
suggests that* the source of prejudice lies more 
within the organism than in external and objec- 
tive conditions, and that attitudes toward various 
minority groups tend to be organized into a co- 
herent pattern and then expressed in a mutually 
supportive and consistent manner. The purpose 
of this study is to explore some of these conten- 
tions by assessing individuals’ attitudes toward 
a variety of ethnic, as well as nonethnic, minor- 
ity groups. 

Several contemporary theorists suggest that 
beliéfs, attitudes, and behavior tend to be inte- 
grated in meaningful and sensible ways (Heider, 
1958; Osgood & Tannenbaum, 1955), and serve 
aguseful function in aiding the organism to pur- 
sue and attain certain goals (Katz, 1960; Peak, 
1960). Attitudes toward ethnic groups may be 

1 This research was carried out while the author 
was a Research Assistant at the Human Resources 
Foundation, Albertson, New York, and a graduate 
student at Hofstra University. Appreciation is ex- 
tended to H. E. Yuker and J. R. Block for their 


„advice and assistance. 
é 


treated in much the same manner as other clus- 
ters of attitudes. Allport and Kramer (1946), for 
instance, suggest that a person’s prejudices are 
deeply ingrained in his personal philosophy, are 
reflections of a broader system of belief, and are 
integrated into the very fabric of his personality. 
More specifically, Sullivan and Adelson (1954) 
and Rose (1951) observe that intolerance toward 
one minority group is usually accompanied by 
intolerance towards other minority groups. 
Frenkel-Brunswik and Sanford (1945) report 
a correlation of +.75 between anti-Semitic senti- 
ment and the rejection of other minority groups. 
These findings were later supported by the more 
extensive research enterprise conducted by 
Adorno and his colleagues (1950). In the latter 
study, anti-Semitic sentiment, anti-Negro senti- 
ment, patriotism or nationalism, and general 
attitudes toward minorities yielded intercorrela- 
tions ranging from +.74 to +.83. These findings 
led the authors to state that there was evidence 
for the “generality of the ethnocentric approach 
to group relations [p. 122].” Campbell and Mc- 
Candless (1951) conclude, from a correlational 
analysis of the subtests of the California E Scale 
and several other personality and attitude tests, 
that there is “a general factor among most if not 
all attitudes toward other ethnic groups [p. 90].” 
The ethnic groups so considered—Jews, Mexi- 
cans, Negroes, English, Japanese—all elicited 
attitudes that were significantly positively cor- 
related. Both Bray (1950), utilizing the Levin- 
son-Sanford Anti-Semitism Scale and the Likert 
Anti-Negro Scale, and Prothro (1952), using the 
Grice-Remmers generalized attitude scale, report 
significant positive correlations between the ex- 
pression of anti-Semitic-sentiment and anti-Negro 
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sentiment. Bettelheim and Janowitz (1950) also 
find a pattern of similarity between the corre- 
lates of anti-Semitic and anti-Negro attitudes. 
The latter authors conclude that not only were 
persqnality factors that are related to the ex- 
pression of one attitude related to the expression 
of the other, but personality characteristics un- 
related to one of these attitudes are also unre- 
lated to the other. Finally, Rokeach (1960) cor- 
roborates these conclusions and further states 
that “rejection of the Negro is but a special case 
of a wider rejection of all people, ‘regardless of 
race, creed, or color’ [p. 145].” These findings 
are consistently upheld in the literature, except 
in a few studies where strong social norms and 
standards modify the relationship. Prothro and 
Pettigrew (1959), for instance, report finding 
Southerners anti-Negro, but not necessarily anti- 
Semitic or ethnocentric in general. In these 
Southern studies, anti-Negro sentiment can be 
seen more as a function of sociocultural influ- 
ences than personality factors, Despite these 
influences, psychological reports clearly suggest 
that personality differences with regard to ethno- 
centrism exist in all situations. In general, the 
ethnocentric individual tends to hold his own 
ethnic group in high esteem, and to reject most 
other groups. 

In addition to the problem of generalizing to 
a wide range of specific ethnic minority groups, 
a well-developed theory of intergroup relations 
must handle problems of ingroup-outgroup rela- 
tions that are not primarily ethnic in character. 
Therefore, a principal concern here is the evalu- 
ation of attitudes toward identifiable individuals 
who are members of outgroups other than ethnic 
outgroups. 


Prejudice and the Physically Handicapped 


One such outgroup consists of physically handi- 
capped persons who are often subject to the same 
intolerant behavior as are members of ethnic 
minorities. However, the physically handicapped 
are not generally considered as an outgroup, and 
comparatively little research has been done 
treating them as such. Wright (1960), in her 
summary of the psychological aspects and effects 
of physical disability, remarks that disabled 
persons have much in common with members of 
other minority groups, Barker's (1948) early 
work in this area is even more explicit: “the 
physically disabled person is in a Position not 
unlike that of the Negro, the Jew, and other 
under-privileged racial and religious minorities: 
he is a member of an under-privileged minority 
[p._31].” Barker and his colleagues (Barker, 
Wright, Meyerson, & Gonich, 1953) further re. 


BRIEF ARTICLES 


port that persons with various disabilities were 
unable to find adequate employment and were 
afforded an inferior legal status and rights as.a 
result of “irrational prejudice.” Thus it seems 
that the handicapped person experiences some of 
the same social rejection and ostracism that is 
experienced by members of ethnic minority 
groups. 

Recent empirical studies on attitudes toward 
the disabled support these general contentions, 
Handel (1960) studied attitudes toward blind 
persons and concluded that common stereotypes 
of the blind person usually place him in‘ an in- 
ferior ‘social role. The authoz further reflects on 
the tone of his own analysis and remarks that 
this report “sounded as though we were consid- 
ering a problem of race relations instead of dis- 
ability [p. 363].” A similar study was under- 
taken by Himes (1960), who adapted the Bo- 
gardus social distance scale for use with blind 
students and collegiate normals (Bogardus; 1927). 
His findings indicate that the more intimate the 
proposed relationship with a blind person, the 
more clearly and intensely is he rejected by the 
physically normal. What Bogardus finds for eth- 
nic outgroups Himes finds with regard to blind 
persons, o 

Cowen, Underberg, and Verrillo (1958) present 
further evidence for a common factor in atti- 
tudes toward both ethnic and disability groups. 
Starting with their locally developed scale meas- 
uring attitudes towards blindness, Cowen and his 
colleagues tested several groups of individuals 
with a series of items selected from the Cali- 
fornia E Scale and F Scale. They find that, nega- 
tive attitudes toward the blind correlate positively 
with negative attitudes toward Negroes, toward 
other minority groups, and with positive authori- 
tarian attitudes. Attitudes toward religious and 
racial minorities are seen to have determinants 
similar to those of attitudes toward the blind. 

These studies (Cowen et al., 1958: Handel, 
1960; Himes, 1960) support the view that atti- 
tudes expressed toward physically disabled indi- 
viduals have much in common with attitudes ex- 
pressed toward ethnic minorities. Some of the 
studies have compared several disability groups, 
or generalized attitudes toward the disabled, with 
a single ethnic minority sentiment. Others have 
compared attitudes toward single dicability 
groups with generalized ethnic group attitudes. 


Those using a single disability group suffer from — 


the same lack of generalizability noted earlier 
with regard to ethnic groups. Himes himself sug- _ 
gests that “each physical disability—dea fness 


and crippleness as well as blindness—is signif- 5 


cantly, though differently, stereotyped [p. 551-2 


Thus, the use of attitudes toward the blind as 
evidence for a common expression of attitudes 


* © toward all disabled groups needs to be further 


substantiated. 
The literature indicates that attitudes toward 
o ethnic minorities may be general in character 
and should be expressed in a relatively common 
fashion towards a variety of outgroups. Hypothe- 
sis 1, therefore, states that: Individuals who ex- 
hfbit ethnocentric attitudes toward one particu- 
lar minority group, will express similar attitudes 
toward a variety of other outgroups—teligious, 
racial, $ocideconomig, or nationality groups. 
The literature furéher, suggests that physically 
disabled persons represent a minority group, an 
outgroup, in the American culture. If ethno- 
Ta centrism is general in character, the outgroup of 
_ physically disablgd persons should be subject to 
a the game interpersonal rejection and negative 
sentiment as other, ethnically determined, out- 
groups. *Therefore, Hypothesis 2 states that: 
Individuals who express ethnocentric attitudes 
toward various outgroups will express similar 
attitfides toward physically disabled persons. 


0 


METHOD 


©The Intergroup Relations Scale (IRS) was de- 
veloped by the author as a measure of an individual’s 
attitudes toward a variety of minority groups. It is 
a Likert-type ‘scale consisting of 34 items. The sub- 
ject indicates the degree of his agreement or dis- 
agreement with each statement on a 6-point scale. 
The score for the entire scale, or for each of four 
subscales, indicates the degree of acceptance or 
rejection of mifority groups. 

‘The items in the IRS are slight modifications of 
statements selected from the Levinson-Sanford Anti- 


w Semitism scale (Levinson & Sanford, 1944) and the 


Negro and Minority subscales of the California 
Ethnocentrism Scale (Adorno et al., 1950). Three 
judges, research psychologists, selected the items 
used in the IRS from the total of 76 items available 
from these original sources. The criteria for selection 
included specification that the items: were worded 
so'as to be amenable to modification for use in the 
current study, and had discriminated well in previous 
studiés. Modification in the current study involves 
the deletion of the named minority group and the 
random insertion of a new group. 
Some statements could not be reworded and 
médified for purposes of this study and were there- 
« fore eliminated; for example, “The true Christian 
. can never forgive the Jews for their crucifixion of 
Christ.” If the two italicized groups were deleted 
no other minorities could be randomly substituted 
sand still have the full statement make any sense. An 
example of a stem that was used is the following: 
“The Jewish problem is so deep and general that 
one often doubts that democratic methods can ever 
Solve it.”°In this example, Jewish could be deleted 
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and Irish, Negro, or Catholic randomly assigned, 
and the full statement would still be meaningful. 

Groups were randomly assigned to these nega- 
tively valued stems in random order to insure that 
there was no “necessary” relation between the group 
characteristic, and the specific group with Which 
it was to be associated. Thirteen groups were utilized 
as targets in the IRS. These groups represented the 
major racial, religious, social class divisions, and 
several of the most prominent nationality groups in 
the United States. Specifically, the targets were: 
Catholic, Protestant, and Jewish religious preferences; 
Oriental and Negro racial groups; upper, middle, and 
lower socioeconomic classes; and German, Ttalian, 
Trish, Mexican, and Puerto Rican nationalities. Each 
of fhe three religious groups was randomly assigned 
to three statements, resulting in a religious subscale 
of 9 items. The two racial groupings were randomly 
assigned to three statements each, making a racial 
subscale of 6 items, Each social class was randomly 
assigned to three items, making a subscale of 9 
items, Finally, each of the five nationalities was 
randomly assigned to two statements, resulting in a 
nationality subscale of 10 items. The establishment 
of four separate subscales permits an analysis of the 
generality of ethnocentrism as it is manifest in these 
different outgroup dimensions, and a test of the 
hypothesis that persons who are prejudiced with 
respect to religious groups will also manifest na- 
tional origin, social class, and racial prejudice, 

The Attitudes Toward Disabled Persons Scale 
(ATDP) was designed to measure the way indi- 
viduals view persons who are physically disabled 
(Yuker, Block, & Campbell, 1960). This scale is 
represented as measuring the extent to which the 
disabled person is seen as different from the physi- 
cally normal person. Wright (1960) explains that, in 
the language of interpersonal relations, to be differ- 
ent or set apart often signifies rejection. The ATDP, 
then, should provide a “general measure of preju- 
dice toward the physically disabled [Yuker et al, 
1960, p. 13].” The ATDP consists of 20 Likert-type 
statements, and the subject indicates the degree of 
his agreement or disagreement with each item. Each 
statement suggests that disabled persons are either 
the same as, or different from, nondisabled persons 
in their personality or need for special social rela- 
tions. For instance, one such item is: “Disabled 
people are more easily upset than nondisabled peo- 
ple.” Agreement with this item suggests an emo- 
tional difference between physically disabled people 
and nondisabled people. 

The ATDP has been found to be highly corre- 
lated with the degree of contact between subjects 
and disabled persons. Subjects who have had con- 
tact with disabled persons in their family or work 
place generally manifest lower scores than those 
persons who have not had such intimate contact 
(Yuker et al., 1960). Unpublished studies at Hof- 
stra University indicate that students change their 
attitudes toward the disabled after they have been 
exposed to a semester’s instruction by a well-liked 
and competent instructor who is physically handi- 
capped. 
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TABLE 1 


PRODUCT-MOMENT CORRELATIONS BETWEEN THE SUB- 
SCALES OF THE IRS AND BETWEEN THE 
SUBSCALES AND THE FuLL IRS 
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TABLE 3 


PRODUCT-MOMENT CORRELATIONS BETWEEN THE 
ATDP, AND THE IRS AND THE 
SUBSCALES OF THE IRS 


Stale IRS Race | Religion |Nationality ATDP 
+ 

IRS —.52** 

Reve +.82%* IRS-Race | — A450 
Religion +.82** | -4.59 IRS-Religion — .40** 
Nationality | +.88** | -4.69 | --.64** IRS-Nationality — 43%" 
Social Class| +.81%°* | -+.51* | -+.58** | -++,53%* IRS-Social Class —.46* 

** p <01. Dp <01. 


Both the IRS and the ATDP were administgred 
to samples of 77 Hofstra University students en- 
rolled in introductory psychology courses, and 243 
high-school students attending a week long con- 
ference on human relations. The National Conference 
of Christians and Jews (1960) sponsors this human 
relations program in the expectation that some 
individuals’ attitudes will change and other indi- 
viduals can be provided with the means for the ex- 
pression and fruition of their values, Therefore, the 
high-school students are probably atypical of the 
adolescent school population in terms of their in- 
terests in intergroup problems, and their leadership 
activities in school or community organizations, The 
high-school population was asked to respond to 
the instruments immediately upon their arrival at 
the conference site, prior to their exposure to the 
NCCJ’s education program. 


RESULTS AND Discussion 


The high school and college populations did not 
differ significantly in their scores on the two 
scales. The mean for the college population on 
the IRS was 39.64, for the high-school popula- 
tion it was 41.64 (ms). On the ATDP scale the 
college mean was 75.44 and the high school mean, 
77.21 (ns). We have already noted that these 
high-school students are atypical of the adoles- 
cent school population and are probably more 
like the collegiate sample than a random sample 
of high-school-age youngsters. In view of these 
findings, all further operations combine these 


TABLE 2 


ATDP Means AND CRITICAL Ratios BY SEX AND 
Contact WITH DISABLED PERSONS 


N M t 
Sa i 
ale 
Female in Wage 2.54% 
Co with disabled : 
now disabled persons 103 
Don’t know disabled persons | 217 EY 2.59 


subpopulations into a single, ‘population with an 
N of 320. 
An index of the reliability of the IRS was 


sity students. The uncorrected Spearman-Brown 
split-half reliability coefficient is +.84, “when 
corrected for attenuation, +.91. This reliability 
coefficient is comparable with the similar cor- 
rected coefficients for the California Anti-Semi- 
tism Scale (+.92) and the California E Scale 
(+.91) (Adorno et al., 1950, pp. 73, 112). Cron- 
bach (1951) has suggested the coefficient alpha 
as a homogeneity or internal consistency meas- 
ure. The coefficient alpha can be applied to sub- 
tests in the following manner: k 


n [ PAA | 
a= 1- 
n—1 


V test’ 

where V is the variance and » is the number of 
subtests. The coefficient alpha obtained for the 
IRS is +.66, indicating, roughly, the amount of 
variance attributable to a single factor. ,These 
manipulations suggest there is considerable homo- 
geneity among responses to the items in the IRS. 
Persons who respond a certain way to one out- 
group are likely to respond in a similar fashion 
to other outgroups. 

The correlation matrix in Table 1 presents the 
product-moment correlations among the four sub- 
scales of the IRS, and between each of the sub- 
scales and the full scale. Intercorrelations among 
the four subscales range from +.51 to +.69, 
exhibiting moderate to high degrees of positive 
relationship. The correlations between the sub- 
scales and the full scale range from +.81, to 
+.88, representing a high degree of relationship. 


All of these correlation coefficients are statisti- , 


cally different from zero (p < .01). 
The indication that measures of all four di- 
mensions of ethnocentrism—racial, religious, na- 


tionality, and social class—correlate positively’ 


significantly with each other and with the total 
scale is a direct confirmation of the initial hy- 
pothesis. These findings suggest that individuals 


obtained for the subsample of Hofstra Univer- p” 


1 


who express ethnocentrism toward racial groups, 
are also likely to express such attitudes toward 
refigious groups, toward nationality groups, and 
‘towards social class divisions. Both interitem and 
jnterscale correlations indicate that ethnocen- 
trism is expressed to outgroups in general, rather 
than to only specific types of outgroups. 

In all the above cases, the social class sub- 
sgale showed the lowest intercorrelations. The 
reason may be that social class is probably the 
least obvious and most fluid interpersonal cate- 
gory. Races can be distinguished by their physi- 
cal characteristics, ‘and religious and nationality 
groups are usually identéfiable by distinctive cus- 
toms and traditions. The social class groups are 
probably not so easily identifiable, and therefore 
not as pinpointed as more distinctive outgroups. 
In the American*culture, it is probably easier for 
a person to change his social class than to change 
on any one of the other three dimensions. Indi- 
viduals can shift from one class to another, and 
therefore the membership of a given class can- 
not be identified quite as consistently as can 
members of other outgroups. 

Yuker et al. (1960) report that females score 
higher on the ATDP than do males and persons 
who have had contact with disabled persons tend 
to score higher than persons who have not had 
such contact. Data from the present study which 
are relevant *to these findings are presented in 
Table 2. 

Table 2 substantiates the findings of Yuker et 
al. (1960). Females scored significantly higher 
on the ATDE than males (p < .05), indicating 
greatet acceptance of the disabled outgroup mem- 
bers. Further, people who have had contact with 
disabled individuals scored significantly higher, 
that is, showed greater acceptance, than persons 
without such contacts (p< .01). 

Table 3 presents product-moment correlations 
between the IRS and the ATDP. All correlations 
are in the hypothesized direction, since high 
scores on the IRS indicate ethnocentrism and 
low scores on the ATDP indicate rejection of the 
disabled. The full IRS manifests the greatest co- 
efficient of correlation with the ATDP, —.52. 
The subscale correlations range from —.40 to 
—.46. All these correlation coefficients are sta- 
tistically different from zero beyond the .01 


te level, and thus support Hypothesis 2. 


These results suggest that for some purposes 
the physically disabled can be conceptualized as 
a minority group subject to many of the same 
attitudinal and behavioral predispositions as are 
ethnic minorities. Our findings regarding general 
categories of disabled persons support Himes’ 
(1960) dnd Handel's (1960) results regarding 
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particular types of physical handicaps. The re- 
sults also support those authors who suggest that 
ethnocentrism, or prejudice, is a general phe- 
nomenon expressed towards a wide variety of 
outgroups and is not narrowly focused on ofie or 
another particular minority group. * 
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DEMAND CHARACTERISTICS VERSUS DISSONANCE REDUC- 
TION AS DETERMINANTS OF FAILURE- 
SEEKING BEHAVIOR 


IRWIN SILVERMAN ano CHARLES MARCANTONIO a 
State University of New York at Buffalo 


Based on Ward and Sandvold’s reinterpretation of Aronson and Carlsmith’s 
findings, the failure-expectancy condition of the latter study was replicated 
with reliability-validity emphasis included for one group and excluded for 
another. In contrast to Ward and Sandvold’s Position, the present data sug- 
gested that the reliability-validity emphasis increased success-seeking behavior 
at the expense of consistency-seeking behavior, An alternate explanation of Di 


the differences between Ward 
findings is offered. 


Aronson and Carlsmith (1962) performed an 
experiment, based on dissonance theory, in which 
they demonstrated that subjects who were led to 
expect that they would either succeed or fail on 
a given test altered their performance to conform 
to these expectancies. Ward and Sandvold 
(1963) contended that consistency-seeking be- 
havior on the part of the subjects in the Aronson 
and Carlsmith study may be interpreted as a 
response to demand characteristics (Orne, 1962) 
rather than a mechanism of dissonance reduction, 
inasmuch as these authors emphasized to their 
subjects the reliability and validity of the test. 
Ward and Sandvold performed a partial replica- 
tion of the study in which experimenter demands 
were excluded by eliminating that portion of 
Aronson and Carlsmith’s introduction to the 
subjects which stressed reliability and validity. 
Subjects in the Ward and Sandvold study altered 


and Sandvold’s and Aronson and Carlsmith’s 


their performance in the direction of achieving 
greater success, regardless of their expectancy. 

It is considered that though these findings are 
Suggestive of a demand-characteristic bias in 
the Aronson and Carlsmith study, more con- 
clusive support for Ward and Sandvold’s inter- 
pretation requires a replication in which ‘it, is 
demonstrated that consistency-seeking behavior 


is obtained with reliability and validity emphasis ~ 


included and success-seeking behavior is obtained 


with this emphasis removed. The need for this _ 


type of replication, which was attempted in the 
present study, is particularly salient in light of 
the fact that the paradigm used by Ward and 
Sandvold differed from Aronson and Carlsmith’s 
in a number of aspects other than those consid- 
ered relevant to the demand-characteristics 
interpretation, 2 


a 


| before th 
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METHOD 


Forty female students, randomly selected from the 
four sections of the introductory psychology course 
“at the investigators’ institution, served as subjects 
for this investigation to meet the course requirement 
for participation in ongoing research. One-half of 
these subjects were arbitrarily assigned to the “reli- 
ability-validity” condition. Here the attempt was 
made to duplicate precisely the procedure of Aron- 
son and Carlsmith’s failure-expectancy treatment, in 
which success-oriented and consistency-oriented be- 
haviors are rendered mutually exclusive. Subjects 
were igformed*that they were to take a test of 
“social sensitivity” ' which they were to (choose 
which one of three®phetographs was that of a 
schizophrenic, After each of the first four series of 
20 trials, subjects were told that they had obtained 

@ scores of 5, 4, 4, and 5 correct, respectively. For the 
fifth series, one-hglf of the group were given scores 
of 5 “consistent feedback), and the others received 
scores of 17 (inconsistent feedback). All subjects 
were required to take the fifth trial again on the 
pretense that the experimenter had forgotten to 
time their performance. The remaining 20 subjects 
were, similarly divided into consistent and incon- 
sistent groups and administered the Aronson and 
Carlsmith procedure, with the exception that the 
statement by these authors dealing with the relia- 
bility and validity of the test, which is reproduced 
Below, was excluded from the introduction. 


This test has been widely used with remarkable 
success by psychologists for several years. More- 
over, in my own work, thus far, it has proved to 
be the most useful of all the short, objective tests 
Ive tried. It is an excellent measure of how 
sensitive an individual is to other people; i.e. the 
subjects whoe score high on this test are the same 
people who, when interviewed, express a good 
deal of understanding and insight into other 
people. Subjects who score low on this test, on 
the other hand, tend to express a very superficial 
understanding of other people when interviewed 


oe [p. 179]. 


If Aronson and Carlsmith’s interpretation was 
supported, it was expected that the criterion score, 
that is, number of judgments changed on the retest 
of the fifth series, would be higher for the incon- 
sistent feedback group than for the consistent feed- 
back “group, independent of the reliability-validity 
cgnditfons. Ward and Sandvold’s hypothesis, on the 
other hand, would lead to the prediction that a con- 
verse effect would be obtained with the reliability 
and validity statement excluded. 

% The second author conducted all of the experi- 


* mental sessions, alternating, one subject at a time, 


among the four conditions. At the conclusion of the 
session, the subject was informed of the actual pur- 
“pose of the experiment, which none had suspected, 
and was requested not to divulge this information 
for a prescribed period of time. When questioned 
session, none of the subjects indicated 
knowledge about the study. 
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TABLE 1 
Mean CHANGE Scores 

Feedback on fifth trial 

Reliability-validity o 
statement 

Inconsistent Cénsistent 

Included 7.0 9.2 

Excluded 7.2 5:5 


RESULTS AND DISCUSSION 


The mean change scores for the four condi- 
tions of this study appear in Table 1, and the 
summary of the analysis of variance is presented 
in Table 2. In terms of the simple effects, sta- 
tistical significance is obtained solely for the 
difference between reliability-validity groups 
under the consistent feedback condition (¢= 
3.43, p< 01). 

These results do not conform to either a dis- 
sonance-reduction or a demand-characteristics 
explanation and are directly opposed to predic- 
tions based on the latter. The interaction effect 
obtained here suggests that subjects in this in- 
vestigation exhibited success-seeking behavior 
with the reliability-validity statement in the in- 
troduction, and consistency-seeking behavior when 
this statement was removed. These data strongly 
indicate that Ward and Sandvold’s findings were 
not due to the exclusion of reliability-validity 
emphasis. In contrast, in the present study this 
emphasis appeared to increase the ego involve- 
ment of the subjects in the task and thus in- 
creased the strength of the motive to succeed at 
the expense of the motive to maintain consist- 
ency. 

The problem remains, however, of how to 
account for the fact that Aronson and Carlsmith 
observed consistency-seeking behavior with the 
reliability-validity statement included, while 
Ward and Sandvold obtained success-seeking 
behavior with this statement absent. Here we 
consider that there were differences between 
these studies and between these studies and the 
present study, which may have affected the rela- 
tive strengths of the two opposing motives and 


TABLE 2 
SUMMARY OF ANALYSIS OF VARIANCE 


Source SS df MS F 
Statement (A)| 30.63 1 30.63 5.76* 
Feedback (B) 63 1 63 
AXB 38.02 1 38.02 7.15* 
Within 191.70 36 5.32 

Total 260.98 39 
+p <.05. 


884 


which may have outweighed the factor of relia- 
bility-validity emphasis. It may be feasibly 
contended that the nursing students of the Ward 
and Sandvold study were more concerned with 
their level of social sensitivity than the random 
selection ef undergraduates employed by Aron- 
son and Carlsmith. In addition, the subjects of 
the former investigation were recruited from a 
psychology course, which may have further en- 
hanced the value of demonstrating high sensi- 
tivity. The absence of these components in the 
Aronson and Carlsmith situation may have low- 
ered the motive to succeed to the extent that the 
motive to maintain consistency became nore 
salient, Further, the possibility remains that 
consistency seeking is based on the subject’s per- 
ception of experimenter demands rather than 
dissonance reduction, though the present findings 
indicate that these demands are not defined by 
the reliability-validity statement. On the basis of 
this possibility, the point raised by Ward and 
Sandvold (p. 203) is relevant, that Aronson and 
Carlsmith’s subjects were paid for their partici- 
pation and thus were more highly motivated to 
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maintain consistency by a sense of obligation to 
the experimenters. Subjects of the present in- 
vestigation were recruited from a psychology 
course, but were not nursing students, nor were 
they paid for their participation. The balance of 
these factors may have rendered the relative 
strengths of the consistency motive and success 
motive more equivalent, and thus the effect of 
the reliability-validity statement became ap- 
parent. 
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MAGNITUDE OF POSITIVE AND NEGATIVE REINFORCEMENTS 
AS A DETERMINANT OF ATTRACTION * ; 


DONN BYRNE anp RAY RHAMEY 


University of Texas 


It was hypothesized that positive and negative reinforcements concerning per- 
sonal attributes are of greater magnitude and hence exert a greater effect on € € 
attraction than reinforcements involving attitude similarity-dissimilarity with 
respect to impersonal topics, In a 4 X 3 factorial design, 4 levels of attitude 
similarity (1.00, 67, 33, and .00) and 3 types of evaluation conditions (posi- 
tive, negative, and control) were employed. Each of 180 Ss was asked to 
read an attitude scale purportedly filled out by an anonymous stranger and to 
evaluate him ona number of variables, including attraction. In the positive and 
negative conditions, Ss also received information concerning the stranger’s 
evaluation of them. Analysis of variance indicated that attraction was sig- 
nificantly influenced by both independent variables (p < .001) and by their 
interaction (p < .01). The effect of the personal evaluation items was found 
to be greater than that of the impersonal items (p <.001). It was suggested that e 

the equation for the law of attraction should include a weighting factor to A 
designate the effect of reinforcement magnitude. 


It has recently been shown (Golightly & Byrne, 
1964) that similar and dissimilar attitude 
statements may effectively be used as positive 


1This research was supported in 
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261-64 from the Air Force Office of Scientific Re- 


search of the Air Research and Development Com- 
mand. 


and negative reinforcements in a discrifnination < 
learning task. This finding lends strong support < 
to the conceptualization of the reinforcement 
function of attitude similarity-dissimilarity with 
respect to the learned drive to be logical and to 
interpret the world correctly (Byrne, 1962; 
Byrne & Nelson, 1965; Newcomb, 1956). 

It is further proposed that attraction is â 


kz 


oe 


` 


function of reinforcement magnitude and that 
the magnitude of the reinforcement of the drive 
te be logical and correct should vary with topic 
»importance. In experimental investigations of 
the effect of attitude similarity-dissimilarity on 
attraction, the assumption is often made that 
topic importance is a crucial parameter. New- 
comb (1961), for example, restricts his index 
of agreement between two individuals to those 
topics which are important to both. As Newcomb 

(1956) suggests, “The discovery of agreement 
between oneself and a new acquaintance regard- 
ing sOme®* matter „of only casual interest will 
probably be less mewarding than the discovery 
of agreement concerning one’s own pet preju- 
dices [p. 578].” 

.As q test of this proposition, Byrne and 
Nelson (1964) ascaled a group of 56 attitudinal 
item$ in terms of importance and then divided 
them into four scales ranging from the 14 least 
important (e.g., gardening, modern art) to the 
14 most important (e.g., God, American way 
of life) topics. When subjects responded to a 
stranger who was either completely similar to or 
completely dissimilar from them on 14 topics, it 
was found that attraction was influenced only by 
similarity-dissimilarity and not by topic impor- 
‘tance or the interaction between the two inde- 
pendent variables. One of the suggested reasons 
for the ineffectiveness of the importance (and 
hence magnitude of reinforcement) variable was 
the possibly restricted range between the least 
and most important topics. 

In an attempt to find a type of reinforcement 
whick would clearly be of greater magnitude 
than that represented by agreements on imper- 
sonal topics, a different sort of stimulus was 
selected. There is considerable support for the 
proposition that the motive to maintain and 
enhance one’s self-concept is a powerful one. 
Thus, when an individual receives positive or 
negative evaluations of himself from another 
person, it seems safe to assume that rewards 
and punishments of a relatively large magnitude 
are being administered. Even if the self is 
simply conceptualized as another possible topic 
about which attitudes may differ, this particular 
topic should be of greater importance than im- 
personal oñes such as politics; for the vast ma- 
jority,of subjects positive and negative evalua- 
tions would constitute, respectively, attitudes 
similar and dissimilar from his own. This would 
assume, of course, that extremely low self-esteem 

“ is rare or that positive and negative evaluations 
by others have the same effect on low and high 
self-esteem individuals. It should be noted that 

« experimental investigations of attraction have 


\ 
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utilized as positive reinforcements both attitude 
similarity regarding impersonal topics (eg., 
Byrne, 1961; Jones & Daugherty, 1959; Rokeach, 
1960, 1961; Schachter, 1951; Smith, 1957; Stein, 
Hardyck, & Smith, 1965; Triandis, 1961) and 
positive evaluations of personal attributes (e.g., 
‘Aronson & Linder, 1965; Keislar, £961; Wal- 
ster & Walster, 1963), but the two types of rein- 
forcements have not been systematically com- 
pared. 

The present investigation is designed to test 
the hypothesis that positive and negative rein- 
forcements concerning personal attributes exert 
a greater effect on attraction than do reinforce- 
ments involving attitude similarity-dissimilarity 
with respect to impersonal topics. 


METHOD 


The procedure for obtaining the subjects’ attitudes 
on a series of topics and later presenting them with 
the attitudes of a stranger on the same topics has 
been described previously (Byrne, 1961, 1962). 
Briefly, a 12-item scale labeled Survey of Attitudes 
was administered during a regular class period. Re- 
sponses to each item were made on a 6-point scale, 
and the topics were of varying degrees of impor- 
tance, including integration, politics, gardening, birth 
control, dating, and the advisability of freshmen 
having cars on campus. After taking this scale, the 
180 subjects (89 male and 91 female introductory 
psychology students) were assigned on a random 
basis to one of four groups with respect to the 
proportion of similar attitudes held by the stranger 
(1.00, .67, 33, and .00) and to one of three groups 
with respect to the stranger’s evaluations of their 
personal attributes (positive, negative, and control). 
Thus, there was a 4X3 factorial design with 15 
subjects per cell, divided approximately evenly with 
respect to sex. 

About a week after taking the Survey of Atti- 
tudes, the subjects were seen in small groups for the 
experiment itself. As in previous studies, subjects 
were told that the experiment concerned the accu- 
racy with which individuals can make interpersonal 
judgments on the basis of limited information. They 
were supposed to receive an attitude scale filled out 
by an anonymous stranger (a student of the same 
sex as themselves), read that person’s responses to 
the 12 attitude scale items, and then make several 
judgments about him or her. Each subject received 
a scale which was purported to be that of a stranger 
but which was actually a bogus one constructed 
by the experimenter so as to constitute the desired 
similarity-dissimilarity ratio for each subject. After 
reading the “stranger's” responses, each subject rated 
him on the Interpersonal Judgment Scale (Byrne, 
1961, 1962) which consists of six 7-point scales 
concerning intelligence, knowledge of current events, 
morality, adjustment, and the two attraction scales 
(liking and desirability as a work partner) which 
are combined to yield a measure of the dependent 
variable (Byrne & Nelson, 1965). 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS OF ATTRACTION SCORES 
= 
Evaluation conditions 
cnet Positive Negative Control Total 
M SD M SD M SD M SD 
i -88 8.40 2.75 12.60 1.20 11.38 2.78 
“or 1300 89 5.87 3.03 12.00 1.46 10.29 3.74 
33 10.73 1.98 4.87 2.63 7.53 2.58 7.71 3.40 
‘00 10.33 1,99 3.47 1.78 6.27 2.72 6.69 3.58 
Total 11,80 2.00 5.65 3.16 9.60 3.46 fi pa? 
D s; z? 


For the subjects responding to the four levels of 
similarity in the control condition, no further in- 
structions were given. For those assigned to the 
positive and negative evaluation conditions, how- 
ever, the following additional information was 
provided before they were given the stranger’s atti- 
tude scale: 


The attitude scale which you filled out, with all 
background information including your name re- 
moved, was given to another student who was 
asked to give these judgments about you. We 
thought it might be interesting for you to know 
how he or she judged you and if this additional 
information would affect the accuracy of your 
judgments about him, 

Each of you will receive the attitude scale of 
one of these people, and that person’s opinion of 
you, For both the attitude scale and the judgment 
scale, the background information has, of course, 
been removed. As far as I know, you do not 
know this person, he or she is the same sex as 
yourself, and he is not in your psychology class. 

First, look over his or her judgments about you 
on the Interpersonal Judgment Scale, and then 
read the answers to the attitude items carefully 


and try to form an opinion of this person from 
this information. . . . 


much aboye average in knowledge of current events, 
someone who would be 


TABLE 2 
ANALYSIS OF VARIANCE OF ÅTTRACTION SCORES 


Source af MS F 
faaea E R 
Attitudes (A) 3 206.60 41.074% 
Evaluation (B) 2 549.54 109.25 
AXB 6 19.02 3.78 
Within 168 5.03 
*p <.01. 
**p <.001. 


liked very much and with whom he would very 
much enjoy working, Those in the negative evalua- 
tion group were informed that the stranger consid- 
ered them to be below average in intelligence, slightly 
below average in knowledge of current events, im- 
moral to a slight degree, maladjusted to a slight 
degree, and as someone who would be «disliked 
very much and with whom he would very much dis- 
like working. In the small experimental groups the 
subjects in the positive and negative evaluation zon- 
ditions were randomly mixed together, but those in 
the control condition had to be run separately 
because of the differences in the instructions. 

» 


RESULTS 

The means and standard deviations of the 
attraction scores are given in Table’l. It should 
be noted that 2 is the most Negative score pos- 
sible and 14 the most positive score. Analysis of 
variance was employed on both raw and trans- 
formed data? with no difference in the signifi- 
cance levels of the Fs, The summary shown in 
Table 2 is based on the raw data, As may be 
seen, both independent variables and the inter- 
action between them exerted a significant effect 
on attraction, 

Even though the evaluation conditions exerted 
a significant effect on attraction, this does not 
test the hypothesis specifically. Each of the six 
items of information about personal attributes -is 
conceptualized as a positive or negative rein- 
forcement as is each of the 12 attitude items 
dealing with impersonal topics. Thus, each, sub- 
Ject in the evaluation conditions was exposed tó 
18 reinforcements, and each subject in the 
control condition received 12 reinforcements. The 


?Even though nonhomogeneity of yariance has 
been shown to have little effect on both the alpha 
level and the power of the F test (Young & Veld- 
man, 1963), Hartley’s maximum F ratio test for 
heterogeneity of variance (Walker & Lev, 1953) was 
computed and Fax = 11.86, p < .01. With each score 
transformed into its Square, the departure from 
homogeneity is not Statistically significant, » 


TABLE 3 


PROPORTION OF PosiTIVE REINFORCEMENTS BASED ON 
« a COMBINATION OF EVALUATION ITEMS 
AND ÅTTITUDE ITEMS 


a Evaluation conditions 


Proportion of 
similar attitudes 


es | i e 
180 (12+,0—)| 1.00 67 1.00 
(67 (8+,4—) 78 ‘Ad ‘67 
"33 (4+4-,8—) ‘56 22 ‘33 
00 O+4,12—-)}o -33 00 ‘00 


co 
o 


proportion of positive "reinforcements received 

by each subject is shown in Table 3. With a sam- 
@ple of 790 subjects, Byrne and Nelson (1965) 
found that the dunctional relationship between 
aa proportion of similar attitudes and attraction is 
described by the formula Y=5.44 X + 6.62. 
Presumably, that formula may be used to pre- 
dict attraction in the present investigation. In 
the control condition, the attitude items were of 
the same type as used in the samples from which 
the formula was derived. If the hypothesis is 
correct concerning the greater effect of the per- 
» sgnal evaluation items on attraction, however, 
the obtained attraction scores would be higher 
than predicted in the positive condition and 
lower than predicted in the negative condition. 
The reason, of course, is that the proportions 
shown in Table 3 are based on the assumption 
that each item is of equal weight. 

Departure from predicted values was tested in 
the following anner. The 60 subjects in each 
condition were divided into those whose attrac- 
tiom responses were above and below the pre- 
dicted value. In the control condition, there 
_ should be a 50-50 split; there were 28 individuals 

' responding below the predicted value and 32 
responding above (z= .52, ns). Departures from 
the 50-50 split in the predicted directions in the 
evaluation groups would confirm the hypothesis. 
In the positive condition, 13 responded below the 
predicted value and 47 above (z= 4.39, p< 
» 004), and in the negative condition, the fre- 
i“ , quencits were 50 below and 10 above (z= 5.17, 

«  p<,001), As hypothesized, the reinforcements 

. provided by “the evaluation items had a signifi- 


i «cantly greater effect on attraction than the rein- 
«+ fortements provided by the impersonal attitude 
4q-— items. 


E A DISCUSSION 


il = Within the limits of the present research de- 
sign, evaluation of personal attributes may be 
æ conceptualized as a variable which influences 


9 
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attraction more strongly than attitude similarity- 
dissimilarity concerning impersonal topics. It is 
proposed that the reason for these differential 
effects lies in differences in reinforcement mag- 
nitude. Is it possible to include the magnitude 
variable within a more general framework? 

The law of attraction (Byrne & Nelson, 1965) 
has been tentatively given as 


pe PR: 
” (PR, + NR) 


or attraction toward X is a positive linear func- 
tion of the proportion of positive reinforcements 
A 2 > . 
received from X. Since reinforcement magnitude 
is hypothesized to have an effect on attraction 
over and above that of number of reinforce- 
ments, and since the data support that hypothe- 
sis, it would be logical to include a weighting 
factor in the equation to designate the effect of 
reinforcement magnitude. It is highly probable 
that in attraction research other reinforcements 
will be found to be of smaller value than those 
involved in the impersonal topics and still others 
of greater value than ratings on personal at- 
tributes. If so and if such weights can be em- 
pirically established, the attraction law should be 
rewritten as 


G = m( D(PRz X M) )4 7 
i (PR. X M) + 3(NRz X M)/ 


or attraction toward X is a positive linear func- 
tion of the sum of the weighted positive rein- 
forcements (Number X Magnitude) received 
from X divided by the total number of weighted 
positive and negative reinforcements received 
from X. 

It would, of course, be preferable to obtain 
the weighting coefficients in some manner inde- 
pendent of their effect on the dependent varia- 
ble. Until an appropriate methodology is devised 
to deal with this problem, we must work back- 
ward from the data in order to arrive at a post 


Az +k 


TABLE 4 


PROPORTION OF POSITIVE REINFORCEMENTS BASED ON 
WEIGHTED (M = 44) COMBINATIONS OF EVALUA- 
TION ITEMS AND ATTITUDE ITEMS 


Evaluation conditions 


Proportion of 


similar attitudes 


iti 
aton | opsie | Conte! 
1.00 (12+,0-)| 1.00 04 1.00 
‘67 BLA) | 99 ‘03 ‘67 
33 448—) | 97 ‘01 133 
400 (O+12-)| 96 ‘00 ‘00 
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TABLE 5 
PROPORTION OF POSITIVE REINFORCEMENTS BASED ON 
WEIGHTED (M = 3) COMBINATIONS OF 
EVALUATION Irems AND ATTITUDE 
ITEMS 


x Evaluation conditions 


Proportion of 
similar attitudes 


easy LR Control 

1.00 (12+,0—)| 1.00 40 1.00 
.67 (8+,4—) 87 27 67 
.33 (4+,8—) 73 13 33 
.00 (0+,12—) .60 .00 .00 
-> 


hoc approximation. In the present study, two 
approaches were explored. The most obvious is to 
begin with the empirically derived Byrne-Nelson 
formula and apply weights to the six personal 
evaluation items prior to determining propor- 
tion of similar attitudes until the obtained at- 
traction scores do not differ significantly from 
the predicted attraction scores. Thus, each im- 
personal topic item was assigned a value of 1 
and the six evaluation items were multiplied by 
2, by 3, by 4, etc., until the two types of items 
were equated in terms of their effect on attrac- 
tion. Following this procedure, it was found that 
44 is the smallest whole number that can be ap- 


ATTRACTION 
o 
o 
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PROPORTION OF WEIGHTED 


Fic. 1. Attraction toward strangi 
positive reinforcements based 
and attitude items. 
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plied as a weighting coefficient and yield non- 
significant departures from the predicted values. 
The proportion of weighted positive reinfozce- 
ments received by each subject is shown in Table 
4. With predictions based on these new propor- 
tions, in the positive condition there are 25 sub- 
jects falling below the predicted value and 35 
above (z= 1.29, ms), and in the negative condi- 
tion there are 36 below and 24 above (z= 1.55, 
ns). Obviously, the control condition remaths 
the same. 

There is reason, however, to take a different 
approach to the weighting problem. The original 
analysis of variance yielded, in addition to sig- 
nificant main effects, a significant interaction. 
Among other possibilities, this finding suggests 


” 
that different constants may describe the rela- 


tionship between proportion »f positive, rein- 
forcements and attraction when the reinforce- 
ments include personal evaluation items rather 
than just impersonal items. The task, thén, would 
be to find a weighting coefficient which would 
yield a linear function. I$ was found that a fairly 
straight line could be achieved by weighting the 
evaluation items on a three to one basis com- 
pared with the attitude items. The recomputa- 
tion of proportions based on this weighted basis 
is shown in Table 5. Attraction was plotted as a 


Y=10.17X+364 


60 70 


A -80 
POSITIVE REINFORCEMENTS 


nger as a linear function of Proportion of 
on weighted combination of evaluation items 


-90~ 1.00 


ir 


s 


4 
a 


BRIEF ARTICLES 


function of these weighted proportions, as shown 
in Figure 1, and a straight-line function was 
fitted to the data by the least squares method. 


It'is obvious by inspection that the data fit the 


curve very closely. The solution yielded the 
formula Y = 10.17 X + 3.64. 

Another aspect of the findings should be noted. 
There are now three sets of data which have 
yielded a linear relationship between proportion 
of positive reinforcements and attraction. With 
the same dependent variable, three somewhat 
different types of positive and negative reinforce- 
ment stimuli have yielded somewhat different 
constants: with ratings of creativity of ,TAT 
stories Y= 2.98 X°+8.47 (McDonald, 1962), 
with attitude similarity-dissimilarity on imper- 
sonal topics Y = 5.44 X + 6.62 (Byrne & Nel- 
son, 196%), and with a combination of similarity- 
dissimilarity on ‘impersonal topics plus positive 
and negative evaluations on personal attributes 
in the present study Y= 10.17 X + 3.64. The 
increasing steepness of the function across these 
three types of conditions could represent merely 
randem variations across samples. It is also 
possible that the difference is due to some other 
variable which is operating. It does not seem un- 
reasonable to propose that the progression from 
@eativity ratings to attitude similarity to ratings 
of personal attributes is one of increasing drive 
level. If this is an accurate characterization, the 
experimental ‘manipulation of motivation (inde- 
pendently of the content of the positive and 
negative reinforcements) should have an effect 
on the attraction function. That is, as drive 
level increases, the slope of the line should in- 
crease “and the Y intercept should decrease. 


© e 
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COMMITMENT, ATTRACTION, AND CONFORMITY 


CHARLES A. KIESLER? 


Yale University 


AND 


LEE H. CORBIN o 
Ohio State University 


“This experiment was designed to test 2 hypotheses: (a) when Ss are not com- 
mitted to continue in a group, there is a monotonic relationship between at- 
traction to the group and conformity to group norms; the less S is attracted, 


the less he will conform; (b) when Ss are committed to coii/aue_ gs member 
of a group, there is a nonmonotonic relationship between attr 


ahd con- 


formity. The less S is attracted, the less he will conform “74n “to a crucial 


point in attraction. Beyond this point decreased attraction “will result in in- 
creased conformity. Both hypotheses were confirmed. In addition it was 


ascertained that the low attractjon, committed Ss, after conforming, sub- 
? 


sequently increased their attraction to the group. 


Generally one finds that the ‘less attracted an 
individual is to a group, the less susceptible he 
is to group pressures to conform to group norms 
(see Hare, 1962). Recently, however, Kiesler 
(1963) demonstrated that, under some condi- 
tions at least, a nonmonotonic relationship exists 
between attraction to the group and conformity 
to group norms, That is, he found that highly 
attracted people conformed the most, and less 
attraction produced less conformity down to a 
critical point in attraction. When attraction to 
the group was less than this critical value, the 
result was relatively heightened conformity. 

Kiesler’s hypothesis was empirically derived, 
based on relatively unnoticed occurrences of this 
phenomenon in the literature (e.g., Hughes, 
1946; Sherif, 1951). The experiment was really 
a demonstration and, as such, offers no firm evi- 
dence either why the nonmonotonic relationship 
would occur, or under what conditions it would 
occur. The present experiment is an 2'‘empt to 
provide some evidence on the latter p 4; 

As Kiesler has suggested, the question ù? Jhe 
parameters underlying the effect breaks down to 
one of the psychological state of the people in 
the low-attraction groups. If the less attracted 
groups conform, the relationship is nonmono- 
tonic. Tf they do not conform, the relationship 
is monotonic, The problem for analysis then is 
the low-attraction subjects, 

One variable stands out in those studies in 
which the nonmonotonic effect has occurred. In 
all of these studies, subjects were committed, or 
expected, to continue as a member of the group 
for some time. From this, one might hypothesize 
that low-attraction subjects will conform only 
when there are strong barriers against leaving 
the group. One might describe the psychological 


1 This earch was carried out while the first 
author wa¥%%eOhio State University. 


state of the subject in the following way: the w 


subject must continue as a member of tile group, 
but not only is he unattracted to the other*mem- 
bers of the group, but he subsequently--© orte“ys 
that he is in strong disagreeys/ RA is ae 
on an important topic 7 iod n isisi ##OWl- 
edge that the subjes tsyg q,teced to continue 
as a member of the grdup is inconsisteptyyvisd) 
the knowledge that he does not likes". di 
and that he disagrees withsit «tiiri: asilieoriwe 
that this “inconsiste@ > oe Yeduiced in two 
ways (eg. Festingéi,“1957). The subject cout 
psychologically reject the group, or he could 
come to “agree” with the group. Either act pre- 
sumably could reduce the inconsistency, How- 
ever, if the subj ommitted to continue as 
a member of tl for some time it would 
be rather diffic Wue or reject the group. 
The alternative ax the subject to reduce 
the inconsistency is to conform, in an effort to 
be more like the members of the group, and 
perhaps even convince himself that he is really 
attracted to the group. However, for a subject 
who is not so committe" continue in the 
group, it would be easier 0. psychologically re- 


ject the group as a method of, reducinet ite in- 
consistency. Therefore, gsf 34, neg sie 
attraction subjects who dre Nevins co" eon- 


tinue in the group to be gif!) = jo'to conform 
than subjects not so com, eu. 4 

Thus, from this preliminary and rather un- 
sophisticated type of analysis, one may now 
State the hypotheses for the present experiment. 
L When subjects are not committed to con- 
tinue in a group, a monotonic relationship exists 
between attraction to the'group and conformity 
to group norms. The less the subject is attracted 
to the group, the less he. will conform to its 
norms. € 

2. When subjects are committed to continue 


ma group, a nonmonotonic relationship exists 
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between attraction to the group and conformity 
to group norms. Decreasing attraction results in 
degreasing conformity down to a crucial point 
in attraction. Below this crucial value, decreased 
«attraction will result in relatively heightened 
« conformity. 


METHOD 
Subjects 


Subjects, numberifg ` volunteered to partici- 
pate in six-man discussion as a part of the 
À requirements for the introducto:, psychology course. 
° The subjects it each group did not know each 
other personally. The advertised purposes of the 
study were to study %rosp process (how strangers 
work out different group tasks), and to study 
* decision-making processes of groups. 
a . 
Procedure ° 


o 
ao Generally, the procedure was very similar to that 
pas ~"*4063), The subjects were told of the 
\ bd Tr-titute for Small Group Re- 
search 5 u of strangers worked 
out certain tasks. `a.. ..oup's function was to rate 
™ abstract paintings’ placed around the room. 
-yere told that prior to these discussion 
gioups az hed been rated by a panel of 
experts and it sas purpose to arrive at 
aerating as close as possiu. to the one given by 
this panel. 

On request, all subjects committed themselves for 
3 additional heurs for which they had a chance to 
Wi 9% ner oscsion and a bonus of $25 for the 
group Vin. rr posite score on the four 
tasks, The suvyects ‘i. ie groups were told that 
they had a 5% or 1 in zv uance of changing groups 
if dissatisfied, while the other half was told they 
had a 95% or 19 in 20 chance of changing groups. 
The intent was to have half the subjects feel they 
had*committed themselves to continue in the same 
group. 

Then each suhte- wa asked to rise, introduce 
P himself to the group, a | tell a little about himself. 
f „Until names could be learned, each subject could 
* be idem," hy, ‘se number on his desk. Next, the 


4 


N 


sui‘) discuss o oi five paintings. The dis- 
cussion - arounu objective aspects of the 
paintings, su. ‘-+, form, and so forth, The 


r “ee tola. . soid any statements indi- 
ca relative preferences among the paintings. 
i. The experimenter made certain each of the subjects 
y participated in the discussion. It was then an- 

nougced that Since they were competing with other 

groups for the prizes, it was felt that each subject 
i should h&ve a say about who was to be included 

in the final analysis of the group. For this reason, 
each subject was asked to indicate his opinion on 
two 11-point rating scales about each other subject 
ih the group. One scale asked, “How much do you 
like this person?” and the other, “How much do you 
think this person would contribute to the group in 
arriving at,the correct decision?” 


891 


After the scales were completed, and while the 
subjects were discussing Paintings 6 and 7, the ex- 
perimenter leafed through these ratings. After Paint- 
ing 7 was analyzed, the experimenter mentioned 
casually that no one had been rejected from the 
group and threw the ratings in the wastepaper 
basket. The subjects continued their discussion of 
the remaining paintings. 

The subjects then ranked the paintings, with 
Rank 1 indicating the painting they felt to be the 
best. It was mentioned that the group consensus 
based on the rankings would later provide a basis 
for further group discussion. Ranking forms were 
passed out and collected. The subjects were told 
that in order to maintain controlled conditions be- 
twefn groups it would be necessary to have silence 
during ¢his part of the experiment, and therefore 
each subject was instructed to sit facing the wall. 
While the experimenter was supposedly compiling 
this consensus, he mentioned that the subjects might 
be interested in looking at the anonymous ratings 
others in the group had given them. 


Manipulation of the Attraction Variable 


Fictitious ratings had been made up in advance 
and were in the same basket into which the subjects’ 
real ratings had been thrown. The experimenter took 
the fictitious ratings from the basket and passed 
them out according to number. It was mentioned 
that the average ratings seemed to be “like a little” 
and “contribute a little.” One third of the groups 
received ratings considerably below average, one 
third slightly below average, and one third were 
quite above the average. This has been shown to be 
a reliable method (e.g., Kiesler, 1963) for sys- 
tematically varying attraction to the group. 


Discrepancy of Opinion 


After the experimenter finished compiling the 
consensus,,each subject was given a ranking of the 
paintinr chich supposedly represented the opinion 
ol woe in the group, excluding himself. To 
insure’ a‘ constant discrepancy across subjects, the 
rankings were systematically manipulated. Regard- 
less of how the subject had ranked the paintings, 
his sixth choice was made the group’s first choice 
and his Ranks 1-5 were made 2-6, while Ranks 
7-10 remained unchanged. Thus, regardless of how 
the subjects had initially ranked the paintings, the 
difference between each subject’s opinion and that 
which represented the group consensus was constant. 

The rankings were collected and the experimenter 
said that in order to aid the group in arriving at a 
good decision, a “guide to modern painting” would 
be distributed. The “guide,” written specifically for 
this experiment, was “fact-free” and intended only 
to provide an excuse for a second ranking of the 
paintings. After the guide had been read and col- 
lected, the experimenter said that the Institute was 
also interested in studying possibl. change over 
time, and wanted the subjects now ‘o rerank the 
paintings since they now possessed ne information, 
and had further considered the paintings. It was 
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stressed repeatedly that this was a private ranking 
and that no one in the group would see it. In fact, 
the experimenter showed an envelope into which he 
was going to put the rankings and immediately send 
them to the Institute, The dependent variable be- 
haviér, then, was the extent to which the subject 
moved his original sixth choice up the scale on this 
second, private ranking. 


A Check on Attraction to the Group 


Each subject was then given a questionnaire with 
five items dealing with his opinion of the group: 
How much would you like to continue with this 
group? How much do you like the other members 
of the group? How much of a chance do you give 
this group to win the prize? How well does this 
group compare with other groups you have come 
into contact with? How good a job will this group 
do in arriving at a good decision? Each item had 
five choices, with five indicating the highest attrac- 
tion, Scores on the five questions were averaged to 
give a total attraction score for each subject. Then, 
to test the subjects’ perceived chance of changing 
groups, each subject was given the following scale, 
ranging from 0 to 100%: “According to the instruc- 
tions given at the outset of this session, what proba- 
bility would you have to get into another group?” 


Termination 


At the end of each experimental session, the 
experimenter explained that there was no reward 
and no necessity for coming back. The entire 
experimental procedure was explained with special 
emphasis on the acceptance variable, 


Summary 


There were three levels of attraction: high (H), 
lower than average (LA), and extremely low (L). 
In addition, half of the subjects thought they could 
change groups easily (U), while the other half 
thought there was little chance for this (C). The 
result is a 2X3 experimental design. Since the pro- 
cedure is quite complicated, a brief sequential sum- 
mary of the steps is as follows: introduction, com- 
mitment manipulated, subjects discuss Paintings 1-5, 
subjects rate others in group, subjects discuss Paint- 
ings 6-10, subjects rank the paintings, subjects given 
fictitious ratings from the basket, subjects given 
fictitious group consensus, subjects given meaning- 
less communication, subjects rerank the paintings 
(private), subjects rate the group, subjects answer 
commitment question, experiment explained, 


RESULTS 


The mean conformity or attitude-change 
scores are represented in Figure 1. An analysis 
of variance of these scores revealed a significant 
difference among attraction levels (F= 6.38, df 
= 2/174, p < .01), due mainly to the relatively 
high conformity for both high-attraction condi- 


tions, and an interaction term that approaches a 
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conventional level of significance (F = 2.64, df 
= 2/174, pœ .07). One would not expect the 
interaction to be strong, inasmuch as one fx- 
pected the commitment variable to affect con, 
formity differentially only in the low-attraction 
condition. 

Thus, the curves appear to be as expected. For 
uncommitted subjects, the relationship between 
attraction and conformity may be seen as mono- 
tonic. The difference in conformity between the 
uncommitted-high attraction (UH) subjects and 
UL subjects is very significant (t= 3.33, df 
= .58, p < .01),? with the differerice Between ad- 
joining points somewhat weaker as one would 
expect (UH versus ULA, t=1.60, df=58, 
p<.15; ULA versus UL, ¢=1.43, df= 58, 


b < .20). For committed subjects, the relation- p” 


ship appears to be nonmonotopic as predicted. 
The difference in conformity scores betwetn the 
CH subjects and the CLA subjects is very sig- 
nificant (č = 2.68, df= 58, p<.01). Although 
the difference in conformity between CLA sub- 
jects and CL subjects does not quite reach an 
adequate level of significance (ż = 1.99, df = 58, 
pœ 05), the difference between the two low- 
attraction conditions is significant (¿= 2.55, df 
= 58, p < .02). 

Thus, the two main predictions for this expert- 
ment appear to be borne out by the data. For 
uncommitted subjects, there was a monotonic 
relationship between attraction and conformity; 
for committed subjects this relationship was not 
monotonic. Of course, the effect of the commit- 
ment variable was concentrated in the low- 
attraction condition as expected, »In this condi- 
tion, subjects who were committed to continue 
in the group conformed; uncommitted, subjects 
did not, Let us now move to the attraction data. 

At the end of the experiment, the subjects 
completed five rating scales related to their at- 
traction to the group. The means of these five 
scales are presented in Table 1, For these data 
the difference among attraction conditions is 
highly significant (F= 26.60, df=2/174? p 
< .001), and the interaction term is also signifi- 
cant (F= 5.08, df= 2/174, p<.01). For each 
question taken separately the effects are similar 
and significant, with only one exception. On 
Question 3 (chance of winning a prize), „the 
interaction term is not significant (F = 1.22), 
but attraction is significant (F = 4.27, dj=2/t749 
$ < 02). On all other questions both the at- ` 
traction and interaction terms are significant. 
The difference between commitment conditions 
18 not significant on any attraction question. 
Finally, on the check on the subjetts’ perceived 


* All t tests are two-tailed unless otherwise noted. ~ 
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Q in the group to conformity to group norms. 


chance of changing groups, C subjects indicated 
they had less chance than U subjects (¢ = 16.57, 
df = 178). 

The interesting point of this is that the rela- 
tionship among the obtained attraction scores is 
yéry Similar to that among the conformity 
scores. The interaction is due to the relatively 
heightened attraction scores for the CL subjects. 
This casts some doubt on the hypothesized non- 
monotonic relationship between attraction and 
conformity for committed subjects, and leads 
ta two possible explanations of the data, both 
of which assume a monotonic relationship be- 
tween attraction and conformity. 

*One explanation pertains to a possible sampling 
bias*in attraction. That is, one could assume: 
there is a, monotonic relationship between at- 
traction and conformity; as a result of a 
sampling bias (for some unspecifiable reason) 
CL subjects were more attracted to the group 
than UL subjects; resulting in relatively height- 
enened conformity on the part of CL subjects. 
The central assumption in this argument is that 
of a monotonic relationship between attraction 
and conformity, irrelevant to the experimental 
«manipulation of attraction. This argument may be 


checked by an inspection of the relationship 
between attraction and conformity within con- 
ditions, where presumably the relationship is 
undisturbed by the experimental manipulation. 
However, the average correlation within con- 
ditions (McNemar, 1955), is —.148, which, 
although it approaches statistical significance 


6 
(t= 1.90, > j (N; — 3) = 162, p < .10), does not 


Jæ 
account for much of the variance, and further 


TABLE 1 


MEAN OBTAINED ATTRACTION SCORES (AVERAGED OVER 
FivE SCALES) AS A FUNCTION OF COMMITMENT 
AND MANIPULATED ATTRACTION (LARGER 
NUMBER EQUALS HIGHER ATTRACTION) 


Attraction condition 


High Below Low 

average 
Committed 3.00 2.16 2.52 
(30) (30) (30) 
Uncommitted 3.22 2.53 2.19 
(30) (30) (0) 


Note.—Ns appear in parentheses. 
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is not in the expected direction. Therefore, one vidual on an important topic. The less attracted 
might conclude that an explanation of the data the subject was to the group, the less the group 
on the basis of a sampling bias is not very could influence him. On the other hand, when the 
powerful, subjects were committed to continue as a mem- “| ' 


Th2 second explanation has to do with the ber o a eo ae agg eleten 
effect of commitment for CL subjects. We have was oman tween CARSEAT E 
assumed that commitment to continue in the the subject’s Susceptibility t+; auiuënce. 
group induces these subjects to conform. How- That is, the less the subject was attracted, the 
ever, on the basis of the present data, one could less the group could influence him down to a 
alternatively assert that the effect of commit- point. Beyond that point, decreased attractiott 
ment is to induce subjects to seek out attractive Produced relatively heightened conformity, 
aspects of the group, producing greater attraction It is interesting to note that in the latter case, 
to the group, and hence, assuming a monotonic conformity by low-attracted, subjects later re- 
relationship between attraction and conformidy, Sulted»in increased attrgction. Apparently the 
producing greater conformity, This question is subject, after changing his attitude in the direc- 
essentially a sequential one: which came first: tion “advocated” by the group, is still in a 
increased attraction or increased conformity? In psychologically uncomfortable state. He js com- on 
the present data this is impossible to ascertain, mitted to continue in, and how agrees with, a 
since the attraction measure came after the group to which he is relatively unattracted. The | 


conformity measure in time, group then becomes more attractive to the sub- 
Consequently, to test this “sequential” hy- ject, one might say, to bring the three cognitions 
pothesis, an extra “control” condition was run, involved into a consistent relationship. Alterna- 
This condition was identical to the CL condi- tively, one might note Heiger’s (1958) statement 
tion up to the point when the subjects were that P tends to like O when P agrees with O, 
presented with the discrepancy of opinion be- However, a precise explanation of the effect re- 
tween themselves and the group. At that time, mains for further research. In the previous study | 
instead of the upportunity to conform, subjects (Kiesler, 1963) no such effect was found. Ones | 
(N = 30) were given the attraction scales and Of the differences between that study and the | 
were then dismissed, By comparing this condi- Present one is the age of the subjects. In the 
tion with the CL condition, one may determine earlier study, subjects were seventh graders. One 
whether the subjects conformed first (and later might posit that older, better educated subjects 
became more attracted to the group), or in- Would be more Sensitive to cognitive incon- 
creased their attraction first (and later con. sistencies, 
pea The Te attraction score for the CL At least one theoretical ambiguity remains. 
j was 2.52. The mean attraction score The derivation for the Present experimen’ was 
for the control condition was 2.24, which is sig- primarily oriented toward th i 
nificantly | Hee ah t e low-attraction | 
cantly lower than CL (t= 1.87, df= 58, condition, However, in the strictest senge, oñe 
i ps ae sealed) but very similar to UL has difficulty with this type of theoretical orien- $ 
chat’ conditas. r duced gone may conclude tation accounting for the relatively decreased $ 
en uced low-attraction sub- conformity on the part of the CLA subjects. * 
jects to conform, and as a result of that con- Theoretically, the CLA subj asics 


ie { 
they had! gontoeneg Tn ¢ fi ), one must note that it is 
d med, ypothesis of a non- not in the expected directi ist i 
mena relationship between attraction and left in the meee m Title 
aa = pg teres to continue that the inconsisten 
group appears to be confirmed, ar “tolerable” level before the, increased 
conformit ; Ba 
Discussion Finall: aa 
i wy, it should be noted that the conftrmity + 
The two hypotheses for the Present experi- OF attitude-change measure here is a private 
ment seem to be supported by the data. When one. Every effort was made to insure that th 
the subjects were not committed to continue in subject believed that the 
the group, a monotonic relationship was found the Second index, Had this 


between attraction to the group and the extent a ae nee 
p 3 Curviling elati: h oe 
to which the group could influence the indi- confo ia abe ee 
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EXPECTANCY, CHOICE, AND THE OTHER PERSON * 


g PHILIP A. GOLDBERG 


Connecticut College 


Aronson, Carlsmith, & Darley (1963) found that Ss may be led to choose an 


unpleasant task in preference to a neutral task through the manipulation of 


expectangy. A replication failed to corroborate these findings. A variation of 
the original design using confederates was effective in inducing Ss to choose 


an objectively less pleasant task. The condition employing confederates was only 
successful when the confederate’s choice coincided with S’s expectancy. These 


results raised questions concerning exp’ 


ectancy and cognitive change. It was 


° concluded that the changes induced were tied exclusively to the behavior and 


were without concomitant phenomenal 


l or cognitive change. A dissonance- 


theory explanation of the data was rejected, 


In a study by Aronson, Carlsmith, and Darley 
(1963), it was found that subjects may be led 
to choose to perform an unpleasant task in pref- 
ereńce to a peutral task, In their experiment: 

e 


Ss performed 2 tasks, one unpleasant, the other 
ntutraf They were then led to expect that they 
would have to perform one of the tasks again. 
When subsequently presented with a choice, there 
was an increase in the probability of their choosing 
the expected task, even when it was objectively less 
pleasant [p. 220]. 


e Implicit in the authors’ discussion of expect- 
ancy is the belief that expectancy serves as an 
internalized regulatory mechanism. The subjects 
are get to behave in a prescribed manner and 
“their cognitive structures are modified to accom- 
modate the future experience. The choice of the 
léss pleasant experience reflects an alteration in 
the initial hierarchy; the subjects prefer the ob- 
jectively less pleasant experience to the state of 
dissonance inherent in the disconfirmation of a 
, Strong expectancy, and they choose accordingly. 


h ‘4 modification of a paper pre- 
otern Psychological Association 
hia, April 1964. 


It is the point of this paper that such a view 
of expectancy and cognitive change is gratuitous. 
It is suggested instead that the ordering of pref- 
erences remains unchanged and tied to the objec- 
tive qualities of the stimuli. Further, it is sug- 
gested that the subjects’ apparent cognitive 
change represents an act of compliance with the 
perceived demands of the experimental situation. 

The general hypothesis of this study was that: 
where subjects are presented with a model of 
noncompliance to assumed experimental demands 
there would be no demonstration of the expect- 
ancy effect, and consequently no evidence to 
support a view of induced cognitive change, Fur- 
ther, it was hypothesized that where a model of 
noncompliance was not supplied, there would 
be an apparent corroboration of the original 
findings. 


METHOD 
Subjects 


Subjects were 105 college students. The subjects 
were volunteers for an experiment in psychophysics 
and, as in the original study, all subjects were female. 
Subjects were assigned to one of eight groups; four 
of the groups constituted a replication of the Aron- 
son et al. (1963) study. 
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Procedure 


Subjects were all run individually and were re- 
quired to take part in two psychophysical tasks. One 
task involved weight discriminations, the other in- 
volved, taste discriminations. 

In the weight-discrimination task, subjects were 
required to” make comparative judgments of the 
heaviness of successive pairs of weights. In the taste- 
discrimination task, subjects were required to make 
comparative judgments of the bitterness of succes- 
sive pairs of solutions. Three solutions were used: 
tap water, a quinine sulfate solution of 14 gram 
per liter of water, a quinine sulfate solution of .23 
gram per liter of water, Each task lasted for 
approximately 8 minutes, and the order of the tasks 
was randomized. 

After the subjects completed the tasks they were 
told that they would have to repeat one of the tasks. 
The particular task to be repeated was specified 
by the experimenter. Subsequently, some subjects 
were called back after waiting 20 minutes; others 
were called back after 7 seconds. The 20-minute 
interval represented a strong-expectancy condition 

and the 7-second interval represented a weak- 
expectancy condition. After the waiting period the 
subjects were told that the experimenter had made 
an error when he had specified the task the subject 
would have to repeat. The subject was told that she 
could choose to repeat either one of the two psycho- 
physical tasks and that her choice made no difference 
to the experimenter, When the subject made her 
choice the experiment was concluded. 

As in the original study, there were four experi- 
mental conditions: expectation of taste for 20 min- 
utes, expectation of taste for 7 seconds, expectation 
of weight for 20 minutes, and expectation of weight 
for 7 seconds. 

The other four experimental groups differed from 
the four replication groups in that subjects were 
run in pairs, one member of each pair being a 
confederate of the experimenter. At the time the 
subjects were called back and asked to choose 
the psychophysical task they would repeat, the con- 


TABLE 1 


CHoice BEHAVIOR AS A FUNCTION or EXPECTANCY 
AND STRENGTH OF EXPECTANCY IN ORIGINAL 
STUDY AND REPLICATION 


Choice 
Expected Taste Weight 
Original | Replication Original | Replication 

Taste 

Strong 9 0 12 20 

Weak 2 1 19 9 
Weight 

Strong 0 0 12 5 

Weak 0 0 5 5 
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TABLE 2 


CHOICE BEHAVIOR AS A FUNCTION OF EXPECTANCY 
AND CHOICE OF CONFEDERATE 


Choice 
Expected 
Taste Weight 
Taste S 
Confederate-taste 5 15 
Confederate-weight 0 20 4 
Weight 
Confederate-taste 0 20 
Confederate-weight 0 ‘ar 5 
o xig 


4 

federate expressed her choice first, and her choice 
conformed to prearranged instructions given to her 
by the experimenter, In one group the subjects were 
led to expect the tastes and the confederates chose 
tastes; in the second group subjects expected tastes 
and the confederates chose the weights; in the third 
group subjects expected weights and the confederates 
chose tastes; in the fourth group subjects expected $ 
weights and the confederates chose the weights. 

Aside from running subjects in pairs and the use 
of confederates, these subjects received exactly the $ 
same treatment as the other four groups in the 
replication and as the subjects in the original study. 
In the light of theory and the previous findings, a 
7-second wait condition was not used with subjects” 
run in pairs. 


RESULTS 


Inspection of Table 1 reveals that the results 
of the replication are strikingly at variance with 
the original study. This study found, as did the 
original study, that none of the subjects who 
expected the weights chose to taste the “solu- 
tions. However, for subjects who expected to 
taste the solutions only 1 of 30 subjects chose 
the solutions in preference to the weights and 
this subject was in the 7-second, or weak- 
expectancy condition, The difference between the 
two groups is nonsignificant and thus the earlier 

dings are not corroborated. 

„~ Under the conditions employing confederate’, E 
it was similarly found that no subject who ex- Eo 
pected the weights chose to taste the solutłons 5 
(N = 25). This was true irrespective of what the, & 
confederate chose, 

Where the subject had been led to» expect the + 
solutions, what the confederate chose did appar- 3 
ently make a difference. An inspection of Table 2 » 
reveals that of the 40 subjects who expected 
the solutions 20 had a confederate who chose 
the weights and 20 had a confederate who chose 
the solutions, Five of the 40 subjects chose the y 
solutions ; all of these subjects had confederates a P 
who similarly had chosen the solutions. This dif- ml E 

s > 


ference is significant at the .05 level (Fisher 
exact test, two-tailed). 


DISCUSSION 


The data lend themselves to a partial recon- 
ciliation of the two contrasted views giving rise 
to the experiment. The power of expectancy was 
ngt sufficient in this study to induce changes in 
e choice behavior. Similarly, the effect of the model 
established by the confederate on choice be- 
havior was not general. Neither expectancy, nor 
a madel,of moncompliance was in itself sufficient 
to induce choicesof the less pleasant task. 
Subjects chose® the objectively less pleasant 
task only when the expectancy for the solutions 
was reinforced by the choice of the confederate. 
Appasently, expectancy had a subthreshold effect 
thag was incréised by the confederate’s choice. 
Interpretation of the data yielded by this 
study is complicated by reference to the findings 
obtained in the original study by Aronson et al. 
(1963). In the original study, expectancy was 
sufficient to producea choice of the less pleasant 
task, In this study it was not. The original find- 
ings suggest that the stronger the expectancy, as 
defined by the length of time spent expecting a 
particular task, the more pronounced the effect. 

Aronson et al. suggest that this finding indi- 
cates that the subjects were not in fact respond- 
ing to pereeived desires of the experimenter. In 
the light of the present findings it might be 
suggested that the subjects spent the 20 minutes 
in the strong-expectancy condition literally get- 
ting the bad taste of the experiment out of their 
mouths. Having achieved this, they were more 
acquiescent, more docile in their behavior. Orne 
(1962) has amusingly and convincingly reported 
on the docility of college students as subjects 
for psychological experimentation. He was unable 
to find a psychological task sufficiently tedious 
and repellent to lead subjects to refuse par- 
ticipation. 
« The data seem to suggest that subjects who 
chose a less pleasant task in preference to a 
neutral task, were, in fact, responding to per- 
‘ceived obligation. This sense of obligation is 
e strengthened by the availability of an interper- 

sonal model of compliance. It is, of course, pos- 

sible thaf the subjects were responding to per- 

ceived experimental demands, and that this was 

so irrespective of the presence and behavior of 
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the confederates. Differences in choice behavior 
between the confederate groups could have 
occurred as a function of an unwitting, but 
nonetheless systematic bias in the manner of 
presenting the instructions. ? 

In any event, there is little in these data to 
suggest that cognitive change may be induced 
through the methods described in these studies. 
The choice of the less pleasant task does not 
seem to represent a cognitive alteration, or re- 
interpretation of pleasantness. The competition, 
thus, does not seem to be between potential 
sources of dissonance, as such; but rather, what 
changes do occur seem to be tied to the behav- 
ioral level. It would seem unparsimonious and 
unjustified on the basis of the data to assume a 
concomitant change in the cognitive structure. 

The data do, of course, support in part the 
notion that expectancy as manipulated and de- 
fined by the procedures used does effect changes 
in the choice behavior. Additionally, the objec- 
tive, empirical fact is that subjects may be 
manipulated to choose to taste the foul-tasting 
solution in preference to a relatively innocuous 
choice behavior. Is there a phenomenal change 
in preference so that behavior and cognition 
are consonant? 

On the basis of these data we do not think so. 
An anecdotal report may serve to emphasize the 
danger involved in inferring cognitive or phe- 
nomenal change from behavior. The one subject 
in the replication who chose the tastes in pref- 
ence to the weights was questioned at length 
following the experiment. Her report is of some 
interest. The young lady, a sophomore art major, 
told the experimenter that she did not like psy- 
chology or psychologists. She resented, she said, 
brainwashing and attempts by psychologists to 
control the minds of other people. Consequently, 
“T chose the tastes, because I knew you wanted 
me to pick the weights.” 
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CHILDREN’S ACQUISITION OF SKILL IN PERFORMING A 
GROUP TASK UNDER TWO CONDITIONS OF i 

GROUP FORMATION ; 

> x MOSES H. GOLDBERG anp ELEANOR E. MACCOBY k | 
Stanford University 
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had all Previously worked with other groups on 
the task and thus had an Opportunity to acquire 


to pull cones out of a single 
usually de- 


ever, there was 
ance. The group 
turns and cones 
a relatively short 


We are interested in the situation of individual 
rewards ituati 


of taking 
were withdrawn smoothly in 


the development of these Cooperative strategies, 
how efficiently a group 
would perform on such a task if, although the 
sroup had not previously worked together, they 
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certain skills in getting and Maintaining group 

cooperation. a 
Specifically, we wished to discover whether 

the acquisition of such skill would be facilitated 0 

or hindered by constantly changing group mem- Í 

bership during the initial phases of experience 

with the task. It could be argued that if an € 

individual is to be brought together with a 

group of strangers to work with them on an 

individual-reward task with which all are famil- 

jar, successful y 

likely if the individuals involved have all had Pee 


fairly extensive Previous experience in adapting 
themselves 


Contrast, which does not change personnel during a 
e series of training trials, should develop a set Er 
of stable role relationships, and the members of ee, 
Such groups may find that the role they have ° _ 
acquired does not transfer easily to a new group he 


Setting. 
Our hypothesis is: when tested with a group". = 


a” Ce ie. 


? 


vs 
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of strangers on a task requiring group coopera- 
tion for achievement of individual goals, the 
ipdividuals who will be most successful will be 
the ones who have had previous experience in 
groups of changing composition; those least 
successful will be those whose previous experi- 
ence had been with a single unchanging group. 


. METHOD 
Design 


Changing groups. Groups of children performed 
a cooperative task for eight consecutive trials—the 
“trainihg “period.” After each pair of trials, the 
composition of the, groups was changed, so that 
each of the subjects irf these groups worked with 
four different sets of co-workers during the training 
trials. The subjects were reassorted once more, after 
which hey performed the task eight more times 
(test, trials) włthout further shuffling of group 
membership. 

Stable groups. Subjects assigned to this condition 
had their eight training trials with the same group 
of co-workers. They were then shifted to new 
groups, and performed eight test trials. 

S e 


Training trials Test trials 


Changing S Aes Alien rte 
Trial No. 1234567 8/9 10 11 12 13 14 15 16 
e 
Stable v 
Check marks indicate changes in 
° group composition. 
Subjects 


The subjects were 64 second-grade children, 32 
boys and 32 girls, from two schools in Palo Alto, 
California, Four groups of 16 were used, and from 
these pools of 16, four subgroups of four children 
eagh were formed. The groups were then designated 
as follows: 16 boys, stable-membership condition 
(SB); 16 girls, stable-membership. condition (SG); 
16 boys, changing-membership condition (CB); 16 
girls, changing-membership condition (CG). Groups 
CG and SB were from one school, SG and CB 
from the other. An attempt was made to form 
groups from children who knew each other as little 
as possible. Cases were eS, assigned to the 
EWON conditions. 


y Pade 


We wished to devise a task which would meet 
the followirfg criteria: 

1. It must require close group cooperation for 
successful performance, with an uncooperative mem- 
ber able to jeopardize the group’s success. 

2. It must permit individual differences in per- 
formance, and permit reward to be given to the 


* individual on the basis of his own performance, 


independent of the total group achievement. 
+3. It should provide a range of scores, rather than 


simply a,“pass” or “fail” score. 
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We chose a tower-building task, in which the 
group of four children working together had to 
build a single tower of blocks, and build it as high 
as they could within the 15-second time period 
allowed for each trial. Each child was given a pile 
of eight 3-inch blocks of a distinctive color, 4nd his 
score at the end of the trial was the number of 
his own blocks that were on the jointly built tower 
when time was called. No two children could place 
blocks at the same moment. Even with perfect co- 
operation (eg., regularly taking turns placing a 
block) it was not possible for all the children to 
place all their blocks. The time allotted was too 
short, and there was an upper limit to the height 
of the tower due to the height of the children and 
the stability of the structure they could build; these 
factorg combined in such a way that the highest 
tower built during the experiment was 19 blocks. 
An individual child could reduce everyone’s score by 
knocking over the tower, through carelessness, de- 
liberate negativism, or overzealous addition of blocks 
to an already wobbly structure. On this task, the 
potential range of scores for individual children 
was 0-8 on each trial. 

The experimental setting consisted of a large 
multipurpose room located at the respective schools. 
Four “stations” were established in the four corners 
of the room, and a judge was assigned to each 
station, the senior author serving as coordinator and 
timer. Also placed at each station was a set of 32 
3-inch cubes, 8 each of green, yellow, blue, and 
red. The 16 children who were to participate in 
the session were then assembled in the center of the 
room and the following instructions were read: 


This morning we are going to play a little 
game—and if you listen carefully and follow the 
rules, you can win some prizes. Does everybody 
like M & Ms?...OK.... Here’s what you 
have to do. Do you see the numbers 1, 2, 3, and 
4 in the corners? In a minute I’m going to call 
all of your names and tell you each one of the 
numbers, Then you will all go to the number 
I tell you. But first PI tell you what you are 
going to do. When I say “Go” I want everybody 
at each number to build one tower with blocks. 
There will be four boys [girls] at each number 
and all four of you have to build the same tower. 
I want you to build it as high as you can, There’s 
one thing, though, I want you each to use dif- 
ferent color blocks. That way we can see how 
many blocks each one of you put on the tower. 
We are going to give you one M & M for each 
block of your own color that’s on the tower. So 
the more blocks of your color you put on, the 
more M & Ms you'll get. Be careful, though, if 
the tower gets knocked down, it doesn’t count 
and you have to start over. No fair holding the 
tower, either; it has to stand up by itself. You 
will have to work quickly because I’m only going 
to give you a short time. Watch, and I'll show 
you what I mean. Suppose you build a tower and 
when I say stop, it looks like this: [blue, red, 
green; red, blue]. The boy [girl] with the red 
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blocks would get two M & Ms, because there are 
two red blocks on the tower, the one with blue 
would get two, the one with green would get one, 
and the one with yellow wouldn’t get anything, 
because there aren’t any yellow blocks on the 
tower. . . . Does everybody understand? .. . 


After this instruction the groups were divided into 
four subgroups of four children each and these were 
sent to the four stations. Each group had 16 trials 
of tower building. The judges distributed rewards 
(as per instructions to the children) after each trial, 
and enforced the rules. Each trial was 15 seconds in 
duration, and there were no rewards for blocks 
placed after the signal to stop. Towers which col- 
lapsed between the signal to stop and the time 
necessary for the judge to count the blocks were 
treated as if they had fallen during the trial, and 
only those blocks still standing erect in the original 
spot were rewarded. 

After the second, fourth, sixth, and eighth trial, 
the following instructions were given: 


Now before we try it again, I’m going to give 
you all another number, and I want you to move 
to the new place. Some of you will stay at the 
same number, but most of you will have to move. 


The subgroups were then reconstituted. 

In the changing-membership condition, complete 
reconstitution took place. No child ever worked with 
the same other child again when he moved to a new 
“station.” In the stable-membership condition, all 
of the children moved, but the subgroups were 
identical, that is, they moved as a unit of four, 
except for the last move, the one after Trial 8. At 
this time they, too, were reconstituted with dif- 
ferent partners, The last eight trials (9-16) are 
regarded as a testing period, and no movements or 
reconstitutions took place, 


RESULTS AND Discussion 


We predicted that children who had been 
trained in subgroups of changing composition 
would be more successful when placed with still 
another group in the rest trials than would sub- 
Jects whose training had occurred in groups of 
constant composition, The reverse proved to be 
the case. The mean total number of blocks placed 
per trial by the 32 subjects in the changing com- 
Position groups during their eight test trials was 
45.4, That of the constant composition groups 
was 72. This means that each of the eight four- 
person changing subgroups built towers aver- 
aging 5.7 blocks in height during the eight test 
trials, while the stable subgroups built towers 
which averaged 9 blocks in height. This differ- 
ence is significant, p < .01 (¢=3.02, df= 14). 
During the eight training trials, the perform- 
ance of the groups under the two training con- 
ditions was not significantly different. The aver- 
age number of blocks placed per trial by four- 
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man subgroups during training was 7.4 for the 
stable groups, 6.9 for the changing groups. 
Figure 1 shows the trial-by-trial totals for all of 
the 32 subjects in each condition combined, 
Dividing the totals in this graph by 8 will yield 
the mean performance of four-man subgroups, 
and dividing by 32 will give the individual mean 
performance for each trial, 

How can we account for the superiority’ of 
the constant-composition groups during the test 
trials? We first entertained the hypothesis that 
in the constant-membership condition, there was 
time for the formation of stable’ role rélation- 
ships—-that some subjects emerged as leaders, 
others as followers, and that this role acquisition 
somehow permitted subjects to function more 


effectively in a new group. It is true that the F 


task permitted the establishment of leadership 
and followership roles. The situation frequently 
occurred in which more than one child would 
place a first block and insist that the tower 
should be built upon his starting point. The child 
who succeeded in persuading the group to build 
on his first-placed block” had at least a ne- 
block advantage, of course, but his dominance 
gave his a greater advantage than this. The 
number of first blocks placed was significantly. 
correlated with total additional blocks placed? 
That is, the child who succeeded in establishing 
the starting point for the tower wag able to get 
more of his blocks on the tower, not counting 
the first block, It is Meaningful, then, to speak 
of dominance of some individuals in the per- 
formance of this task, and we may ask whether 
leadership or dominance became more polarized, 
more stably characteristic of specific individuals, 
in the constant than in the changinng condition. 

We can obtain information relevant to this 
question by examining the variability of scores 
in each group. If stable dominance patterns were 
emerging in one of the experimental conditions, 
this should mean that some individuals would 
earn consistently high scores and others con- 
sistently low scores over a series of trials, "If 
groups were functioning with little polarization, 


ni variance among individual scores shoulð be, 
ess. ?: 


. 


„During the training trials, the scores of indi- 
viduals were more variable under tHe changing- 
composition condition than under the constant- 
composition condition (F = 1.97, p<.05). The 7 
difference in variability was even more marked 
during the test trials. For each child, we com- 
puted a ratio score—the ratio of the number of* 
blocks he placed during the eight test trials 
to the total number placed by his group, The 
average of these ratios was, of coyrse, .25., 
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TRIALS 


Fic. 1. Changing (C) versus stable (S) groupings. (Total number of blocks placed in each condition 
trial by trial by 32 subjects in eight subgroups. Break in line indicates reconstitution of membership 


> of four-person subgroups.) 


But in the groups which had been trained under 
changing-membership conditions, the variability 
was considerably greater (F= 2.42, p<.01). 
There were two children in these groups who 
clearly exercised coercive dominance over the 
other children in their groups—one placed 59% 
of All the blocks placed by his group, the other 
50% + there was one child in a group dominated 
by a high-scoring child who placed no blocks 
during the test trials, and two others who placed 
only 7% each of the total being placed by their 
group. No such extremes were found among the 
children trained under constant-group condition. 
Observers’ reports on the changing groups indi- 
tated that in some instances the dominant child 
was shouldering others away from the tower. 
Sémetimes he would have a confederate who 


_ ` helped exclude the other two children and in 


return be allowed to place some of his own 
locks. e 
Several of the groups trained under constant- 


a ‘composition conditions, by contrast, developed 


smooth cooperative techniques including all group 
members during the test trials. For example, the 
. four members of a group would each pick up four 
blocks and stand poised for the starting signal— 
‘one child on his knees to place the bottom four 


{ 
Y ” 
| <, Deg blocks, the next child stooped to place the next 


7 


four just above, etc. At the starting signal, the 
four sets of blocks would be quickly superim- 
posed; out-of-line blocks would be straightened 
to stabilize the structure, and then the group 
would stand back and restrain one another from 
placing additional blocks so as not to topple the 
tower. Such a procedure resulted in equal scores 
for all participants and a high group score. The 
more equal distribution of scores among the 
constant-composition group suggests that coop- 
erative techniques of this kind may have been 
responsible for their high scores during the test 
trials. 

If equalitarian rather than dominance tech- 
niques are more effective in maximizing group 
scores, and constant-composition groups more 
often employ equalitarian techniques, how does 
it happen that these groups did not excel during 
the training trials? We have already seen that in 
the constant-group condition, different group 
members obtained more similar scores during 
this period as well as during the test trials, yet 
they did not consistently build higher towers at 
this time than the changing-membership group. 
Our observations suggest that it takes time to 
develop genuinely cooperative processes, because 
the group must first devise means of controlling 
would-be coercive individuals. It frequently 
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happened in our control groups that an individual 
would obtain a high score on early trials by 
using coercive techniques. But on subsequent 
trials, he would obtain a score of 0, when other 
members of the group took measures against him, 
sometimes by deliberately knocking down the 
tower. In the groups with constantly changing 
membership, this mobilization of the group 
against the bully does not appear to have taken 
place so readily. We suspect it may have begun 
to occur during the test trials when the experi- 
mental group children continued with the same 
group of co-workers for a longer period of time 
than they had done during their training trials, 
and that the countermeasures which the low- 
scoring children began to take may be in’ part 
responsible for the changing groups’ failure to 
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improve their performance during the test trials 
as much as the stable groups did. It is likely that 
the course of development of cooperative tech; 
niques would be greatly influenced by the char- 
acteristic behavior dispositions of group members, 
for example, whether one was habitually domi- 
nant and the others submissive. We suggest that 
experimental variation of group composition 
would be a fruitful line of further research, as 
well as more systematic observation of groupe 
process during training and test trials, 
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BODY SIZE, SELF-EVALUATION, AND MILITARY S 
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avy men dissatisfaction with height and ° 


weight varied directly with deviation from Preferred (“ideal”) height and 
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Physical deviation or weakness and of attempts 
to compensate for such handicaps through excep- 
tional mental or physical efforts. In view of the 


Proposition directly, 

Jourard and 
studies dem: 
ings toward 
also offered 


with implications for mental health. They found, 


for example, that college students who had posi- | 


vo 
Secord (1955) in a series of > “/ 
onstrated a connection between feel- 
body and feelings toward self, and 
evidence that attitudes toward the 
may be a significant personality variable 


tive feelings toward the body also felt more 
secure, self-confident, and freer of inferiority 
féelings than those who had negative body feel- 
“ings (Secord & Jourard, 1953). Johnson (1956) 
tended to support this hypothesis in finding nega- 
tive attitudes toward body to be related to 
somatic symptoms on the Cornell Medical Index 
Heglth Questionnaire. 

e Im an earlier paper (Gunderson & Johnson, 
1965). unfavorable personal experiences including 
family instability and conflict, inconsistent or 
extreme disciplinary practices by parents, and 
persistent difficulties in school were shown to 
correlate negatively with self-satisfaction in 
young Navy men. It seems highly plausible that 
Rid self-evaluations also are influenced by real char- 
i è acteristics of the person, such as body size and 
intelligence. The present study is concerned with 
** relationships between the attributes of height, 
~weight, and intelligence and positive or negative 
feelings toward those specific aspects of self 
* as well as pervasive effects of height and weight 
upon self-regard generally. Finally, the possible 
influence of height and weight upon military 
adjustment is examined. 


. METHOD 


Subjects 


Subjects were Navy enlisted men assigned to a 
medium-sized’ aircraft carrier. A personal data 
questionnaire and a test inventory were adminis- 
tered to these crewmen by medical personnel while 
the carrier was at sea. The age range of the sample 
tested was 17-21, with a mean of 18.9 years. The 
mean gducatioh was 11.1 years, and the mean Navy 
General Classification Test (GCT) score was 52.2. 
s The Navy GCT score is a measure of general intelli- 
3 ake gence expressed as a T score (M = 50, SD = 10); an 
* , IQ equivalent for a T score of 52 would be 103. 
The mean GCT score for all men entering the Navy 
is 50. 


Personal Data and Performance Variables 


The personal data questionnaire provided a num- 
ber of items of biographical and personal informa- 
tion from the subject, including height and weight, 
„as well as age, education, and so on. Navy GCT 
. scores and performance data, consisting of semi- 
_ annual Professional Performance and Military Be- 
hayior ratings, disciplinary offense records, and 
medical records were obtained from individual service 
r records sand ship’s records. Performance data were 

not available on the entire sample tested due to 
transfers, releases from service, or insufficient time 
in the service to receive performance evaluations. 
All items of performance data were available for 392 
men. 

‘Five items of criterion information, Professional 
Performance rating, Military Behavior rating, num- 
-ber of distiplinary offenses, number of sick calls, and 


.. 
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pay grade attained, were given coded values, guided 
by knowledge of their distributions and intercorre- 
lations. For each of the criterion items except num- 
ber of sick calls approximately the lower (least 
favorable) 25% of the distribution was given a 
value of 0, the middle 50% a value of 1, and the 
upper 25% a value of 2. For number of sick calls a 
smaller percentage of the total distribution was 
assigned a value of 2, thus giving this variable less 
weight in determining overall criterion scores. The 
justification for this procedure was the relatively 
low correlation present between the sick call item 
and the other criterion items, While a more sophisti- 
cated method of weighting these items may even- 
tuglly be devised, the method utilized was consid- 
ered a, reasonable and adequate procedure for dif- 
ferentiating performance for the purposes of this 
study. The five criterion values were combined into 
a composite score which provided an overall esti- 
mate of relative effectiveness of performance in the 
Navy. Performance scores ranged from 0 to 10 with 
a mean of 5.7 and a standard deviation of 1.8. 


Test Variables 


Attitudes toward body and self were obtained from 
a 52-item multiple-choice inventory, the self-evalua- 
tion index, developed by Secord and Jourard 
(1953) and modified by Johnson (1956). For each 
body or self-characteristic the subject rated his satis- 
faction or dissatisfaction with that particular aspect 
of himself on a 5-point scale. Two items, height and 
weight, were selected for study from among the body 
characteristics included in the index and one item, 
intellectual level, from among the self-characteristics. 
In addition, a self-evaluation (SE) score, called 
“self-cathexis” by Jourard and Secord, was obtained 
by summing ratings over the 30 items referring to 
various aspects of self. SE scores were available for 
485 men. 
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Statistical Analyses 


Heights, weights, and intelligence scores were 
grouped, and each variable was plotted against per- 
centages expressing dissatisfaction with that at- 
tribute, The criterion for being classed “dissatisfied” 
Was an extreme response on the satisfaction scale 
(“strongly dislike” or “don’t like”). Second, a joint 
distribution for the entire sample was plotted using 
6 categories of height and 8 categories of weight. The 
distributions for each of the 6 height categories 
were further partitioned into three subcategories by 
weight: underweight, medium-weight, and over- 
weight, with approximately 25%, 50%, and 25% 
falling in these subgroups, respectively. Means and 
standard deviations were computed for SE and per- 
formance scores in each of the resulting 18 height- 
weight categories, Various combinations of the 18 
height-weight categories were compared in terms .of 


levels (means) of self-regard and military effective- 
ness. 


RESULTS 


i D expressing dissatisfaction 
rises sharply with over half of those under 67 
inches expressing dissatisfaction. 


with weight is shown in Figure 2. Least dissatis- 


faction with weight was expressed in the range 
from 151 to 180 


satisfaction occurred outside these limits, reach- 
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ing a maximum (71%) in those weighing less 
than 131 pounds. Over half of the subjects 


weighing over 180 pounds, expressed dissatis- , 


faction with their weights. 

The relationship of intelligence (Navy GCT 
score) to expressed dissatisfaction with intel- 
lectual level, shown in Figure 3, is generally 
linear but discontinuous. Within a wide range 
of GCT scores, 35-54, no relationship appears 
with satisfaction, but outside these limits the 
relationship is inverse and linear, Percentages 
of expressed dissatisfaction ranged from 9% in 
the 65-69 GCT interval to ‘82% in the 32-34 
interval. Ši 


The “ideal” height for this population (72 


f 


inches) was about 2 inches taller than the actual ™ 


mean height (approximately 70, inches). Simi- 
larly the optimum weight range in ternis of 
satisfaction (171-180 pounds) was several 
pounds heavier than the mean weight for this 


population (approximately 159 pounds). Overall , 


dissatisfaction with weight was much more preva- 
lent than dissatisfaction With height (46%? for 
weight versus 27% for height). 

The separate and combined effects of height 
and weight upon self-evaluation generally and 


upon effectiveness of military performance arf 


summarized in Table 1. Mean standard scores 
(T scores) for the self-evaluation and perform- 
ance scales are shown in the table for each of 
the major height-weight combinations. In addi- 
tion to the combinations shown, differences 
among means for various other subgroups are 
discussed below without reference to Table i 
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TABLE 1 
` RELATIONSHIPS OF BODY SIZE TO SELF-EVALUATION AND PERFORMANCE 
= 
M and SD (T scores) 
m 
Self-evaluation Performaace 
M SD N M SD N 

. 

sight only 
Short (less than 69 inches) 48.95 9.88 139 50.36 9.60 110 
Average (69-71 inches) 51.23 9.32 192 50.18 10.26 154 
Tall (more than 71 inches) 50.52 9.63 142 49.61 9.89 119 

Weight ony ° 
Light (less than 141 pounds) a 48.01 9,54 71 49.51 9.80 53 
Medium (141-180 founds) 50.95 “| 9.57 332 50.14 9.99 279 
Heavy (more than 180 pounds) 49.81 9.71 70 50.17 10.00 51 

Combinations of height and weight 
Short-tinderweight* 48.70 8.99 27 48.56 9.55 25 
Averde-underweight 51.02 9.68 54 49.00 9.47 38 
Tadl-underweight 50.20 9.51 39 48.74 10.46 35 
Short-medium weight 49.34 10.46 88 51,69 9.36 67 
Average-medium weight 50.64 9,54 103 50.98 11.10 88 
Tall-medium weight 50.78 11.20 65 49.30 9.30 57 
Short-overweight 47.79 8.85 24 47.94 10.24 18 

e` Average-overweight 53.23 7.99 35 49.28 8.44 28 

Mroverweight A 49.10 10.35 39 51.41 10.49 27 

Extreme deviations of height and weight 
Very short (less than 67 inches)-underweight 48.92 3.88 12 51.40 8.08 10. 
Very tall (more than 73 inches)-underweight 48.00 12.46 6 51.80 10.42 5 
Very tall-overweight 46.00 11.66 10 57.33 9.01 9 

« Very short-overweight 46.27 9.72 15 47.20 9.33 10 


each height category. 
. 


An analysis of variance of SE scores over 
six categories of height (less than 67, 67-68, 69, 
70-71, 72-73, and more than 73 inches) indi- 
cated, a highly significant relationship between 
self-evaluation and height (F = 4.1, p < .001). 
Very short individuals (less than 67 inches) were 
~ Most dissatisfied with themselves (M T score 
© = 46.7), and very tall individuals (more than 

73 inches) also tended to be dissatisfied with 
themselves (M = 47.8). Individuals in the 70-71 
inch category were most satisfied with themselves 
(“= 52.2). Variations in mean SE scores over 
three combined height categories (less than 69, 
69-71, and more than 71 inches) are shown in 

a, Balle 1. - 5 
“e TNe differences in SE scores over eight cate- 
` gosies of weight (less than 131, 131-140, 141- 
> 180, 151-10, 161-170, 171-180, 181-190, and 
4 more than 190 pounds) were not significant by 
‘,dhalysis of variance technique, although as can 
be seen in Table 1, individuals classified as 
“light” (less than 141 pounds) tended to regard 
r ~ themselves unfavorably. Performance scores were 
_ “not significantly related to either height or weight 
categories when these variables were tested 


- Separately. 


a Underweight and overweight refer to approximately the lower and upper 25% of the weight distribution, respectively, for 


An analysis of variance of SE scores over the 
nine height-weight combinations shown in the 
center of Table 1 (short-underweight, average- 
underweight, and so on) revealed highly signifi- 
cant differences among these groups (F=5.1, 
p <.001). The short-overweight category scored 
lowest on self-esteem (M = 47.8) while the 
average-overweight group scored highest (M 
= 53.2). An analysis of variance of performance 
scores over the same nine height-weight cate- 
gories revealed no significant differences among 
these groups on the military effectiveness vari- 
able (F = 0.6, p>.05). 

In the last section of Table 1 results for the 
four most deviant height-weight combinations are 
shown. Among very deviant individuals, self- 
evaluations tended to be consistently negative; 
however, in three of these deviant subgroups 
(very short-underweight, very tall-underweight, 
and very tall-overweight) performance tended 
to be above the norm. The mean performance 
score for these three subgroups combined dif- 
fered significantly from that for other indi- 
viduals (z= 2.04, p< .05). The very short- 
overweight category was an exception in that 
members of this group were not only dissatisfied 
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with themselves, but low on the military per- 
formance criterion as well. 


Discussion 


‘The form of the relationships of height and 
weight with satisfaction about height and weight 
was found to be curvilinear in contrast to the 
implication by Jourard and Secord (1954) that 
the relationship for height was linear while no 
relationship existed for weight. The influence of 
social norms upon self-evaluation was evident in 
the present results in that deviations from the 
generally preferred height (72 inches) or weight 
(151-180 pounds) in either direction resulted7in 
increased dissatisfaction with those attributes. 
The cultural ideal for body size appears to be 
slightly larger than actual body size for this 
population, Many young adult males apparently 
find small body size a threat to self-esteem and 
tend to depreciate their own personal worth 
based upon this perception. The common ob- 
servation that there are wide individual dif- 
ferences in reactions to such perceived physical 
inadequacies appears to be borne out in that 
low self-evaluation seems to have no consistent 
relation to military performance in this study 
(r=.07), There is a suggestion that many of 
those who fall at the extremes of height and 
weight, though dissatisfied with themselves, 
nevertheless perform at least as well as others 
and perhaps better. The very short-overweight 
group appeared to be an exception, The dif- 
ficulties of this group, however, could be entirely 
those of poor physical mobility in work situa- 
tions aboard an aircraft carrier where physical 
quickness, agility, and stamina are often required. 

The relationship between intelligence and 
satisfaction with intellectual level was sharply 
linear in the upper portion of the distribution 
(GCT score over 50), but flat in the range from 
35 to 50, Below 35 a high proportion expressed 
dissatisfaction with their intellectual level. The 
absence of an increase of expressed dissatis- 
faction in the 35-46 range suggests that GCT 
scores in the low 40s and even below are a 
matter of indifference to many in this popula- 
tion; however, it is plausible that scores above 
the norm are of some importance since admis- 
sion to most Navy technical schools depends 
heavily upon above average GCT scores, 

The present study strongly suggests that phys- 
ical characteristics play a significant part in 
self-evaluation and that research in the area of 
self-concept or self-regard should take the “real” 
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characteristics of persons into account, Results 
would probably be different for college popula- 
tions and for female populations, especially før 
the weight variable. An obvious extension ofo 
the present research would be to test the same 
relationships between real characteristics and 
self-regard in a maladjusted or abnormal popula- 
tion. Friedman (1955) reported no more dis- 
crepancy between self-image and ideal-imagw in 
schizophrenics than in normals and interpreted 
this result as a tendency for severely disturbed 
individuals to report self-characteristics in- 
accurately, While there is considerable support 
for this view in clinical Sache precise docu- 
mentation is lacking. Some direct evidence was 
adduced in the present study for an Adlerian 
overcompensation effect within a normal) healthy 
male population; this finding particularly ‘appears L 
worthy of further research in other populations. 

One limitation of the present study that should” 
be kept in mind is the use of self-reported 
rather than actual height and weight measures, 
Jourard and Secord (1955) found virtually 
identical relationships of dctual measurements of 
height and weight to satisfaction and self- 
reported height and weight to satisfaction. It 
seems highly probable that individuals in the 
present sample also knew their heights and} 
weights and reported them accurately. 
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RELATIONSHIP OF AGE, SEX, AND INTELLIGENCE LEVEL 
c TO EXTREME RESPONSE STYLE 


CAROLE S. LIGHT, MELVIN ZAX ¢ 
University of Rochester i? 
anp DWIGHT H. GARDINER 


Hobart and William Smith Colleges 


4 groups of 20 children each at each of 3 grade levels (4th, 8th, and 12th) 


age, sex, intelligence, 


scales excessively. The 4 group? were high šQ males, high IQ females, 


were used as Ss in a study to determine the existence of a relationship between 
and the tendency to use the extreme points of rating 


low IQ 


males, and low IQ females. Ss rated 10 Rorschach inkblots on 15 semantic 


differential scales. Each child was scored for the number of extreme (E), 


jp . intermediate (I), or neutral (N) ratings made. Brighter and older Ss made 
` e significantly lower E scores and higher I scores. No significant differences at- 
e trifutable to sex were found in these scores. 


In a,study on word meaning, Osgood, Suci, 

and Tannenbaum (1957) using the semantic dif- 

„° ferential, noted differences among different popu- 

lations in their choice*of extreme, intermediate, 

or neutral points along the 7-point scale. Com- 

pared to laymen, college students used the inter- 

mediary positions relatively more frequently and 

>- he extremes and neutral positions less. Amer- 

„ican Legionnaires tended to make exclusive use 

of the extremes or the extremes and neutral 

point. Osgood added that from their experience 

the frequency of extreme ratings might be 
expected to vary with sex and age. 

Berg and Collier (1953) found a sex difference 
for extreme tatings on the Perceptual Reaction 
Test. 4n an important addition, they also found 
that high-anxiety males made more such ratings 

| thfn low-anxiety males. That this extreme re- 
sponse set of high-anxiety subjects was a func- 
tion of the actual content of the alternatives 
rather than of their spatial position was shown 
by Lewis and Taylor (1955). 

In a series of studies, Zax, Gardiner, and 
Lowy (1964) substantiated the hypothesis that 
more extreme ratings would be given by the 

_ mors “immature” or poorly adjusted individual 
“Où the basis of their findings and their review 
ni Bf developmental theories they suggested that 
! the#tendency to respond in the extreme was a 
fulktion of developmental level and recom- 
mendedea controlled study to establish this. 


° Now at Teachers College, Columbia University. 
»'2Special credit is due Joan Mestrovich and 
Mary-Jo Bryan of William Smith College who ran 
¢he majority of subjects as part of a senior research 
= project and Louise Kraft Myers, school psycholo- 
gist for the Waterloo Central Schools, who sched- 
ged all of the classes for testing. 


r 


The notion that the developmental level of 
an individual would play a role in the making 
of extreme responses finds theoretical support 
in the work of Werner (1948), Piaget (see 
Flavell, 1963), and Lewin (1951). Common to 
all three writers is the idea of progression from 
homogeneity to heterogeneity, from a less to a 
more differentiated psychic structure. As catego- 
ries of experience multiply, there is more “free- 
dom of choice” for making judgments. From this 
it follows that in dealing with situations, the 
young child, or the pathologically regressed indi- 
vidual, would be more likely to give all or nothing 
reactions and to show more rigid modes of behav- 
ior than would the older child or normal indi- 
vidual. Both Werner and Piaget view this differ- 
ence between adult and childhood intellect as due 
to the gradual development of the “abstract fac- 
ulty.” This suggests that measures of IQ, insofar 
as they involve abstract concepts, would be re- 
lated to response judgments. (Note the similarity 
to the association drawn between Lewin’s “differ- 
entiation of the life space” and the measurement 
of mental age—Kounin, 1943). We might expect, 
therefore, that extreme judgments would de- 
crease not only with increasing chronological age 
but also with increasing intellectual maturity 
within given age groups. 

With this theoretical background, and the re- 
sults of the studies reported above, a develop- 
mental study was undertaken in which children 
of different grade levels were compared on the 
number of extreme, intermediate, and neutral 
ratings made on a semantic differential rating 
scale. Rorschach inkblots were used as the stim- 
uli. It was expected that younger children would 
rate more extremes and fewer intermediates than 
older children and, based on Osgood’s findings, 
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TABLE 1 scales as devised by Osgood et al. (1957) which were E 
indicated by the Thorndike-Lorge (1944) word 


Mean IQ Lever ror THE 12 Groves frequency list to be appropriate for a Grade»3 oh 
Í 


=O ey vocabulary level. The scales used were: 7 from the? 
» evaluative factor—clean-dirty, beautiful-ugly, kind- 
19 cruel, fair-unfair, good-bad, wise-foolish, happy-sad ; 
‘i 5 from the potency factor—strong-weak, rough- 
High Low smooth, heavy-light, large-small, hard-soft; and 3 
Gadd: from the activity factor—sharp-dull, hot-cold, fast- 
ae 116.15 92.05 slow. ‘The scales were arranged in different orders foy 
Females 127.00 96.40 each inkblot. The data were collected in a classroom 
rade 8 containing 20-30 subjects. Each subject was given a 
G 4 4 
Males 122.25 92.15 booklet which contained 10 sheets on, each of which 
Females 126.55 95.30 appeared the 15 adjective pairs, the orler“of the 
Grade 12 er 9200 @ scales being different from one sheet to another. The 
Melee +e 45 100.50 booklet also contained the Standard semantic differ- 
comes 4 i ential instructions adapted for use with inkblots. An © 
image of the Rorschach cards was projected on a i 
that females would show more extremes than A w 2 
; pen TABLE 3 >» 4 
males. A third hypothesis was that within grade S v. nh k 
levels the more intelligent children would rate ral eae Scones oe 
fewer extremes than the less intelligent. ® 
METHOD Source sS af “us| F > 
The subjects for this study were 240 students in G Between Grades 4, 2 $ 
the public school systems of Waterloo, New York. (usp 12 () Bia 6,717.12] 2 | 3,358.56] 3.89% 
They were divided into four groups at each of three ‘iow (B) oie 5,245.36 | 1 | 5,245.36 | 5.29% 
different grade levels: Grade 12 (approximate ages; Between sexes, male- 
17-18), Grade 8 (13-14), and Grade 4 (9-10). The g Xe“ © Pa E ES 0 0.00 gy 
four groups each contained 20 subjects; they were: AXB 5,215.70| 2 | 2,607.85| 2.63 
high IQ males, low IQ males, high IQ females, and R Sis xB abla Eaa a 
low IQ females. The mean IQs for each Within cell: , 4 } á 
4 umasa each group are I in cells 225,900.40 | 228 3990.79 
given in Table 1. A 
Tt will be noted that high TQ males and females È 10,375.40 1 1037840 11.47m 
are not too well matched across grade levels or at i f y: 
times even pi grade levels (Grade 4). At the AXE 25269] 2 | tare | ons 
same time, however, average differences between 4 Pedant ATE AEE e 
Pi 5 XCXB l.. a j. K 
ae ae ey moa ae Senne everywhere Within cells 206,321.20 228 i Soag b pe 
rade emales. 
Data in the form of semantic differential ratings * p <.05. z OTI 
were collected using the 10 Rorschach inkblots as sere ye 


stimuli and a set of 15 7-poi i jecti 
of 15 7-point bipolar adjectival screen in the front of the classroom and it was this ` 


TABLE 2 projected image that was rated. There was, inci- 


dentally, no difficulty in administering the scales 
MEANE, I, > y in administering the sc 
eae OO A COFLEIN even at the youngest grade levels. (Contrary, thank- 


fully, to expectations!) The only caution was to 


E I N read the instructions carefully with the students 
Gide and to give (one or two examples on the blackboard. | 
High IQ male 48,60 65.45 35.95 The total time required for the Grade 4, lova JQ 
Low IQ male 69,15 58.85 22.00 children, was 45-50 minutes. 2, p 
High IQ female | 43.90 70.65 35.45 
Lam dQ female | 6530 | 5330 | 3140 ResurTs 
2 TA, 
High 10 male 46.30 75.95 TE Analyses were carried out by tallying for eich: 


Low IQ male 42.80 61.00 46.20 subject the number of times he made an æxtteme 
High TQ female | 4625 | 7850 | 2525 (E) rating of 1 or 7, an intermediate (I) rating” 


Low IQ fi oe 
Grade if carl [deans 64.95 39.00 of 2, 3, 5, or 6, or a neutral (N) rating a4, 


High IQ male 37.70 92. on the semantic differential scales. Subjects made >“ 
Low IQ male 52.10 oes aoe a total of 150 ratings: Scores for ET and RA 
High IQ female} 44.75 79.85 25.40 were determined for each subject. The mean E 
Low IQ female | 48.20 66.50 35.30 I, and N scores for each group are iven in 
Table 2. Seas 


L -2 The figures in Table 2 indicate that there is 
the predicted drop in E score, at least direction- 
Wly, as one moves from fourth graders to 

. twelfth graders. This is not always, however, a 

* smoothly progressive drop as one moves from 
fourth graders to eighth graders and then to 
twelfth graders. Also, older high IQ females fail 
to demonstrate E score differences from younger 
high IQ females. Differences in I scores between 

eyounger and older subjects seem to be smoother 
with older subjects consistently displaying higher 
mean I scores. Both older males and females 

of nih $Q Rave lower N scores than younger 

: high IQ subjects, while the reverse is true for 

the low IQ groups wlfere younger subjects have 

the high N scores. 

In. ofder to test for the significance of the 
mean “differences observed in Table 2, three 
$ .2 x2 X 3 analyses of variance were carried out, 
* ythe three variables under consideration being 
grade tevel, IQ, and sex. The relevant F tests 
for E and I scores are presented in Table 3. 
Separate analyses were done for E, N, and I 
‘scdres despite the likelihood that these three 
scores are not jndependent. 

Tt can be seen in Table 3 that differences in 
both E and I scores are significant between grade 
levels and IQ levels. The analysis of variance 
for. the N scores revealed only a significant 
interaction between grade level and IQ, indi- 
‘cating that the. greater number of N ratings 
given by high IQ children of a younger age and 
by older, low IQ children was significant. 


jf a S DISCUSSION 

These results support the earlier stated hy- 
igerse regarding the effects of development 

d IQ on the tendency to use extreme or inter- 
Mediate points on rating scales. The fact that 
fewer E and more I ratings are used by older chil- 
‘fren and by those of high IQ also supports theo- 
_tetical notions of the gradual development of the 
thild’s abstract powers, especially those of Heinz 
Warner (1948). Such development would indi- 
te the possibility of greater control and com- 


pi 


giepaint i 
A found as a function of IQ, since the mea- 

Rement of IQ is often related to one’s ability 
© use abstractions, Incidentally, this finding is 
'£ 
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similar to that reported by Osgood et al. (1957) 
and attributed to Kerrick. 

Somewhat surprising is the failure to find sig- 
nificant differences in E ratings between males 
and females as has been reported elsewhere 
(Berg & Collier, 1953; Osgood et, al., 1957), 
usually among college students. An examination 
of Table 2 reveals that the one pair of groups 
in our sample which most resembles such sub- 
jects (high IQ males and females in Grade 12) 
show the expected directional differences between 
sexes, although not to a significant degree. Our 
failure to find differences among twelfth graders 
may be related to the fact that the females were 
slightly brighter than the males and that neither 
group was as bright as an average college group. 


- We might point out that our Grade 12 males 


and females made 25% and 30% of their ratings, 
respectively, in the extreme category, and Zax, 
Loiselle, and Karras (1960) report that their 
male college subjects used these categories for 
only 16% of their ratings. 
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